Global ETD Search

Return to search

Temporal Abstractions in Multi-agent Learning

<p dir="ltr">Learning, planning, and representing knowledge at multiple levels of temporal abstractions provide an agent with the ability to predict consequences of different courses of actions, which is essential for improving the performance of sequential decision making. However, discovering effective temporal abstractions, which the agent can use as skills, and adopting the constructed temporal abstractions for efficient policy learning can be challenging. Despite significant advancements in single-agent settings, temporal abstractions in multi-agent systems remains underexplored. This thesis addresses this research gap by introducing novel algorithms for discovering and employing temporal abstractions in both cooperative and competitive multi-agent environments. We first develop an unsupervised spectral-analysis-based discovery algorithm, aiming at finding temporal abstractions that can enhance the joint exploration of agents in complex, unknown environments for goal-achieving tasks. Subsequently, we propose a variational method that is applicable for a broader range of collaborative multi-agent tasks. This method unifies dynamic grouping and automatic multi-agent temporal abstraction discovery, and can be seamlessly integrated into the commonly-used multi-agent reinforcement learning algorithms. Further, for competitive multi-agent zero-sum games, we develop an algorithm based on Counterfactual Regret Minimization, which enables agents to form and utilize strategic abstractions akin to routine moves in chess during strategy learning, supported by solid theoretical and empirical analyses. Collectively, these contributions not only advance the understanding of multi-agent temporal abstractions but also present practical algorithms for intricate multi-agent challenges, including control, planning, and decision-making in complex scenarios.</p>

10.25394/pgs.26018863.v1

Autonomous agents and multiagent systems

Intelligent robotics

Planning and decision making

Reinforcement Learning

Counterfactual Regret Minimization

Multi-agent Reinforcement Learning

Hierarchical Learning

Option Discovery

Skill Discovery

Identifer	oai:union.ndltd.org:purdue.edu/oai:figshare.com:article/26018863
Date	13 June 2024
Creators	Jiayu Chen (18396687)
Source Sets	Purdue University
Detected Language	English
Type	Text, Thesis
Rights	CC BY-NC-SA 4.0
Relation	https://figshare.com/articles/thesis/Temporal_Abstractions_in_Multi-agent_Learning/26018863

Page generated in 0.0019 seconds

Temporal Abstractions in Multi-agent Learning

Description

Links & Downloads

Tags

Additional Fields