S. Bratus

Bachelor thesis (1)

1 records found

Generalisation Ability of Proper Value Equivalence Models in Model-Based Reinforcement Learning

Bachelor thesis (2024) - S. Bratus (author) , J. He (mentor) , M.M. de Weerdt (coach) , F.A. Oliehoek (graduation committee member)

We investigate the generalization performance of predictive models in model-based reinforcement learning when trained using maximum likelihood estimation (MLE) versus proper value equivalence (PVE) loss functions. While the more conventional MLE loss aims to fit models to predict ...