S.K. van Wolfswinkel

Bachelor thesis (1)

1 records found

Acting in the Face of Uncertainty

Pessimism in Offline Model-Based Reinforcement Learning

Bachelor thesis (2024) - S.K. van Wolfswinkel (author) , J. He (mentor) , F.A. Oliehoek (graduation committee member) , M.M. de Weerdt (graduation committee member)

Offline model-based reinforcement learning uses a model of the environment, learned from a static dataset of interactions, to guide policy generation. Sub-optimal planning decisions can be made when the agent explores states that are out-of-distribution, as the world model will h ...