Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions

Conference paper (2023)

Authors

D. Du Robot Dynamics - Mechanical, Maritime and Materials Engineering , Harbin Institute of Technology

S. Han Student

Naiming Qi Harbin Institute of Technology

Haitham Bou Ammar Huawei Technologies, University College London

Jun Wang University College London

W. Pan The University of Manchester, Robot Dynamics - Mechanical, Maritime and Materials Engineering

Research Group

Robot Dynamics (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI: https://doi.org/10.1109/ICRA48891.2023.10160991

To reference this document use:

http://resolver.tudelft.nl/uuid:9e01529f-e133-4794-bb32-eb525b08ede2

More Info

expand_more

Published Date

2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Robot Dynamics

Abstract

Reinforcement learning (RL) exhibits impressive performance when managing complicated control tasks for robots. However, its wide application to physical robots is limited by the absence of strong safety guarantees. To overcome this challenge, this paper explores the control Lyapunov barrier function (CLBF) to analyze the safety and reachability solely based on data without explicitly employing a dynamic model. We also proposed the Lyapunov barrier actor-critic (LBAC), a model-free RL algorithm, to search for a controller that satisfies the data-based approximation of the safety and reachability conditions. The proposed approach is demonstrated through simulation and real-world robot control experiments, i.e., a 2D quadrotor navigation task. The experimental findings reveal this approach's effectiveness in reachability and safety, surpassing other model-free RL methods.

Files

Reinforcement_Learning_for_Saf... (pdf)

(pdf | 1.04 Mb)

- Embargo expired in 04-01-2024

Unknown license