Literatur vom gleichen Autor/der gleichen Autor*in
plus bei Google Scholar

Bibliografische Daten exportieren
 

Spatially Grouped Curriculum Learning for Multi-Agent Path Finding

Titelangaben

Phan, Thomy ; Koenig, Sven:
Spatially Grouped Curriculum Learning for Multi-Agent Path Finding.
In: Proceedings of the AAAI Conference on Artificial Intelligence. Bd. 40 (2026) Heft 35 . - S. 29642-29650.
ISSN 2159-5399
DOI: https://doi.org/10.1609/aaai.v40i35.40208

Volltext

Link zum Volltext (externe URL): Volltext

Angaben zu Projekten

Projekttitel:
Offizieller Projekttitel
Projekt-ID
AI Research Institute for Advances in Optimization
2112533
Causal Foundations for Decision Making and Learning
2321786

Projektfinanzierung: National Science Foundation
Amazon Robotics
Donald Bren Foundation

Abstract

Multi-agent path finding (MAPF) is the challenging problem of finding conflict-free paths with minimal costs for multiple agents. While traditional MAPF solvers are centralized using heuristic search, reinforcement learning (RL) is becoming increasingly popular due to its potential to learn decentralized and generalizing policies. RL-based MAPF must cope with spatial coordination, which is often addressed by combining independent training with ad hoc measures like replanning and communication. Such ad hoc measures often complicate the approach and require knowledge beyond the actual accessible information in RL, such as the full map occupation or broadcast communication channels, which limits generalizability, effectiveness, and sample efficiency. In this paper, we propose Partitioned Attention-based Reverse Curricula for Enhanced Learning (PARCEL), considering a bounding region for each agent. PARCEL trains all agents with overlapping regions jointly via self-attention to avoid potential conflicts. By employing a reverse curriculum, where the bounding regions grow as the policies improve, all agents will eventually merge into a single coordinated group. We evaluate PARCEL in two simple coordination tasks and four MAPF benchmark maps. Compared with state-of-the-art RL-based MAPF methods, PARCEL demonstrates better effectiveness and sample efficiency without ad hoc measures.

Weitere Angaben

Publikationsform: Artikel in einer Zeitschrift
Begutachteter Beitrag: Ja
Institutionen der Universität: Fakultäten > Fakultät für Mathematik, Physik und Informatik > Institut für Informatik > Juniorprofessur Künstliche Intelligenz und Maschinelles Lernen
Fakultäten > Fakultät für Mathematik, Physik und Informatik > Institut für Informatik > Juniorprofessur Künstliche Intelligenz und Maschinelles Lernen > Juniorprofessur Künstliche Intelligenz und Maschinelles Lernen - Juniorprof. Dr. Thomy Phan
Titel an der UBT entstanden: Ja
Themengebiete aus DDC: 000 Informatik,Informationswissenschaft, allgemeine Werke > 004 Informatik
Eingestellt am: 04 Mai 2026 06:57
Letzte Änderung: 04 Mai 2026 06:57
URI: https://eref.uni-bayreuth.de/id/eprint/96963