Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective

Citation

Everitt, T, Hutter, M, Kumar, R et al. 2021, 'Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective', Synthese, vol. 198, pp. 1-33.

Year

2021

Updated:  14 April 2024 / Responsible Officer:  Director (Research Services Division) / Page Contact:  Researchers