TH

Tianjiang Hu

1 records found

DACOOP-A

Decentralized Adaptive Cooperative Pursuit via Attention

Integrating rule-based policies into reinforcement learning promises to improve data efficiency and generalization in cooperative pursuit problems. However, most implementations do not properly distinguish the influence of neighboring robots in observation embedding or inter-robo ...