Gaspar I. Melsion

Journal article (1)

1 records found

Human-Feedback Shield Synthesis for Perceived Safety in Deep Reinforcement Learning

Journal article (2022) - Daniel Marta (author), Christian Pek (author), Gaspar I. Melsion (author), Jana Tumova (author), Iolanda Leite (author)

Despite the successes of deep reinforcement learning (RL), it is still challenging to obtain safe policies. Formal verification approaches ensure safety at all times, but usually overly restrict the agent's behaviors, since they assume adversarial behavior of the environment. Ins ...