A.T. Czechowski | TU Delft Repository

Safety Guarantees in Multi-agent Learning via Trapping Regions

Journal article (2023) - A.T. Czechowski (author), F.A. Oliehoek (author)

One of the main challenges of multi-agent learning lies in establishing convergence of the algorithms, as, in general, a collection of individual, self-serving agents is not guaranteed to converge with their joint policy, when learning concurrently. This is in stark contrast to m ...

Non-chaotic limit sets in multi-agent learning

Journal article (2023) - A.T. Czechowski (author), Georgios Piliouras (author)

Non-convergence is an inherent aspect of adaptive multi-agent systems, and even basic learning models, such as the replicator dynamics, are not guaranteed to equilibriate. Limit cycles, and even more complicated chaotic sets are in fact possible even in rather simple games, inclu ...

Safe Multi-agent Learning via Trapping Regions

Conference paper (2023) - A.T. Czechowski (author), F.A. Oliehoek (author)

One of the main challenges of multi-agent learning lies in establishing convergence of the algorithms, as, in general, a collection of individual, self-serving agents is not guaranteed to converge with their joint policy, when learning concurrently. This is in stark contrast to m ...

Influence-aware memory architectures for deep reinforcement learning in POMDPs

Journal article (2022) - M. Suau (author), J. He (author), E. Congeduti (author), R.A.N. Starre (author), A.T. Czechowski (author), F.A. Oliehoek (author)

Due to its perceptual limitations, an agent may have too little information about the environment to act optimally. In such cases, it is important to keep track of the action-observation history to uncover hidden state information. Recent deep reinforcement learning methods use r ...

Poincaré-Bendixson Limit Sets in Multi-Agent Learning

Conference paper (2022) - A.T. Czechowski (author), Georgios Piliouras (author)

A key challenge of evolutionary game theory and multi-agent learning is to characterize the limit behavior of game dynamics. Whereas convergence is often a property of learning algorithms in games satisfying a particular reward structure (e.g., zero-sum games), even basic learnin ...

Multi Robot Surveillance and Planning in Limited Communication Environments

Conference paper (2022) - V. Inna Kedege (author), A.T. Czechowski (author), Ludo Stellingwerff (author), F.A. Oliehoek (author)

Distributed robots that survey and assist with search & rescue operations usually deal with unknown environments with limited communication. This paper focuses on distributed & cooperative multi-robot area coverage strategies of unknown environments, having constrained co ...

Constraint Propagation and Reverse Multi-Agent Learning

Conference paper (2021) - A.T. Czechowski (author)

The development of multi-agent reinforcement learning has been largely driven by the question of how to design learning algorithms to reach some particular notion of optimality of strategies, e.g. Nash equilibria. The set of optimal strategies is not known before the execution of ...

Exploring the Effects of Conditioning Independent Q-Learners on the Sufficient Statistic for Dec-POMDPs

Conference paper (2020) - A.V. Mandersloot (author), F.A. Oliehoek (author), A.T. Czechowski (author)

In this study, we investigate the effects of conditioning Independent Q-Learners (IQL) not solely on the individual action-observation history, but additionally on the sufficient plan-time statistic for Decentralized Partially Observable Markov Decision Processes. In doing so, we ...

Decentralized MCTS via Learned Teammate Models

Conference paper (2020) - A.T. Czechowski (author), F.A. Oliehoek (author)

Decentralized online planning can be an attractive paradigm for cooperative multi-agent systems, due to improved scalability and robustness. A key difficulty of such approach lies in making accurate predictions about the decisions of other agents. In this paper, we present a trai ...

Alternating Maximization with Behavioral Cloning

Conference paper (2020) - A.T. Czechowski (author), F.A. Oliehoek (author)

The key difficulty of cooperative, decentralized planning lies in making accurate predictions about the behavior of one’s teammates. In this paper we introduce a planning method of Alternating maximization with Behavioural Cloning (ABC) – a trainable online decentralized planning ...

Influence-Based Abstraction in Deep Reinforcement Learning

Conference paper (2019) - M. Suau (author), E. Congeduti (author), R.A.N. Starre (author), A.T. Czechowski (author), F.A. Oliehoek (author)

thousands, or even millions of state variables. Unfortunately, applying reinforcement learning algorithms to handle complex tasks becomes more and more challenging as the number of state variables increases. In this paper, we build on the concept of influence-based abstraction wh ...