An Exploration of a Hierarchical Approach for MacDec-POMDPs

Master thesis (2021)

Authors

K. Fani Electrical Engineering, Mathematics and Computer Science

Contributors

F.A. Oliehoek Interactive Intelligence - (mentor)

R.A.N. Starre Interactive Intelligence - (mentor)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:d702c98a-eeea-4013-b27f-d412dee41088

More Info

expand_more

Published Date

14-07-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

In order to do something useful, you should know what to do and how to do it. The same goes for robots or other machines, which are also referred to as agents. Advances in e.g. technology and science have made such agents more and more sophisticated and capable. This opens up a plethora of possibilities for solving problems or automating processes.
However, as agents and the problems they solve become more and more sophisticated, the software that controls these agents naturally also becomes more complex. Especially when considering systems of multiple cooperating agents. With rapidly increasing amounts of agents, observations and capabilities to take into account, the plans used to come to decisions become increasingly infeasible to create by hand. Thus automated methods are necessary to create such plans.
The approach this thesis uses is inspired by the ability of human intelligence to abstract away pedantic details when planning. We do not for example include which muscles we will utilize, when we make a plan for picking up groceries. Deciding to get groceries is on a completely different level than deciding how many centimeters we want to move a limb in a particular direction. Abstracting away such details greatly reduces the size of any potential plan. Which makes plans easier to construct, while the scope of the problem remains the same.
Thus, utilizing this notion of different levels of abstraction during planning, is the approach this thesis uses to improve planning for agents. This is done by extending the Macro Decentralized Partially Observable Markov Decision Process (MacDecPOMDP) framework to allow for an arbitrary number of levels of abstraction.

Files

MSc_Thesis_Kees_Fani.pdf

(pdf | 4.24 Mb)

Unknown license