Sv

S.K. van Wolfswinkel

1 records found

Acting in the Face of Uncertainty

Pessimism in Offline Model-Based Reinforcement Learning

Offline model-based reinforcement learning uses a model of the environment, learned from a static dataset of interactions, to guide policy generation. Sub-optimal planning decisions can be made when the agent explores states that are out-of-distribution, as the world model will h ...