Literature by the same author
plus at Google Scholar

Bibliografische Daten exportieren
 

Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling

Title data

Phan, Thomy ; Belzner, Lenz ; Kiermeier, Marie ; Friedrich, Markus ; Schmid, Kyrill ; Linnhoff-Popien, Claudia:
Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling.
In: Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 33 (2019) Issue 1 . - pp. 7941-7948.
ISSN 2159-5399
DOI: https://doi.org/10.1609/aaai.v33i01.33017941

Official URL: Volltext

Project information

Project title:
Project's official title
Project's id
Innovationszentrum Mobiles Internet (InnoMI)
No information

Project financing: Bayerisches Staatsministerium für Wirtschaft, Infrastruktur, Verkehr und Technologie

Abstract in another language

State-of-the-art approaches to partially observable planning like POMCP are based on stochastic tree search. While these approaches are computationally efficient, they may still construct search trees of considerable size, which could limit the performance due to restricted memory resources. In this paper, we propose Partially Observable Stacked Thompson Sampling (POSTS), a memory bounded approach to openloop planning in large POMDPs, which optimizes a fixed size stack of Thompson Sampling bandits. We empirically evaluate POSTS in four large benchmark problems and compare its performance with different tree-based approaches. We show that POSTS achieves competitive performance compared to tree-based open-loop planning and offers a performancememory tradeoff, making it suitable for partially observable planning with highly restricted computational and memory resources.

Further data

Item Type: Article in a journal
Refereed: Yes
Keywords: Planning; Plannung under Uncertainty; POMDP
Institutions of the University: Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Computer Science
Faculties
Faculties > Faculty of Mathematics, Physics und Computer Science
Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Computer Science > Junior Professor Artificial Intelligence and Machine Learning > Junior Professor Artificial Intelligence and Machine Learning - Juniorprof. Dr. Thomy Phan
Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Computer Science > Junior Professor Artificial Intelligence and Machine Learning
Result of work at the UBT: No
DDC Subjects: 000 Computer Science, information, general works > 004 Computer science
Date Deposited: 17 Nov 2025 13:11
Last Modified: 25 Nov 2025 06:34
URI: https://eref.uni-bayreuth.de/id/eprint/95250