ERef Bayreuth

Anmelden

Projekte: Innovationszentrum Mobiles Internet (InnoMI)

Eine Ebene nach oben ...

Gruppieren nach: Jahr | Person

Springe zu: A | G | M | P

Anzahl der Einträge: 15.

A

Altmann, Philipp ; Ritz, Fabian ; Feuchtinger, Leonard ; Nüßlein, Jonas ; Linnhoff-Popien, Claudia ; Phan, Thomy:
CROP: Towards Distributional-Shift Robust Reinforcement Learning Using Compact Reshaped Observation Processing.
In: Elkind, Edith (Hrsg.): Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23). - Vienna, Austria : International Joint Conferences on Artificial Intelligence Organization, 2023 . - S. 3414-3422
DOI: https://doi.org/10.24963/ijcai.2023/380

G

Gabor, Thomas ; Sedlmeier, Andreas ; Kiermeier, Marie ; Phan, Thomy ; Henrich, Marcel ; Pichlmair, Monika ; Kempter, Bernhard ; Klein, Cornel ; Sauer, Horst ; Schmid, Reiner ; Wieghardt, Jan:
Scenario Co-Evolution for Reinforcement Learning on a Grid World Smart Factory Domain.
In: Proceedings of the Genetic and Evolutionary Computation Conference. - New York, NY, USA : Association for Computing Machinery, 2019 . - S. 898-906 . - (ACM Conferences )
DOI: https://doi.org/10.1145/3321707.3321831

Gabor, Thomas ; Peter, Jan ; Phan, Thomy ; Meyer, Christian ; Linnhoff-Popien, Claudia:
Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search.
In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19). - International Joint Conferences on Artificial Intelligence Organization, 2019 . - S. 5562-5568
DOI: https://doi.org/10.24963/ijcai.2019/772

M

Müller, Robert ; Illium, Steffen ; Phan, Thomy ; Haider, Tom ; Linnhoff-Popien, Claudia:
Towards Anomaly Detection in Reinforcement Learning.
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS '22). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2022 . - S. 1799-1803 . - (ACM Conferences )
DOI: https://doi.org/10.5555/3535850.3536113

P

Phan, Thomy ; Sommer, Felix ; Ritz, Fabian ; Altmann, Philipp ; Nüßlein, Jonas ; Kölle, Michael ; Belzner, Lenz ; Linnhoff-Popien, Claudia:
Emergent Cooperation from Mutual Acknowledgment Exchange in Multi-Agent Reinforcement Learning.
In: Autonomous Agents and Multi-Agent Systems. Bd. 38 (2024) . - 34.
DOI: https://doi.org/10.1007/s10458-024-09666-5

Phan, Thomy:
Emergence and Resilience in Multi-Agent Reinforcement Learning.
München : Ludwig-Maximilians-Universität , 2023 . - XIV, 69 S.
( Dissertation, 2023 , Ludwig-Maximilians-Universität München)
DOI: https://doi.org/10.5282/edoc.31981

Phan, Thomy ; Ritz, Fabian ; Altmann, Philipp ; Zorn, Maximilian ; Nüßlein, Jonas ; Kölle, Michael ; Gabor, Thomas ; Linnhoff-Popien, Claudia:
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability.
In: Krause, Andreas ; Brunskill, Emma ; Cho, Kyunghyun ; Engelhardt, Barbara ; Sabato, Sivan ; Scarlett, Jonathan (Hrsg.): Proceedings of the 40th International Conference on Machine Learning. - Red Hook, NY : Curran Associates, Inc., 2023 . - S. 27840-27853 . - (Proceedings of Machine Learning Research ; 202 )

Phan, Thomy ; Sommer, Felix ; Altmann, Philipp ; Ritz, Fabian ; Belzner, Lenz ; Linnhoff-Popien, Claudia:
Emergent Cooperation from Mutual Acknowledgment Exchange.
In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS '22). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2022 . - S. 1047-1055 . - (ACM Conferences )
DOI: https://doi.org/10.5555/3535850.3535967

Phan, Thomy ; Belzner, Lenz ; Gabor, Thomas ; Sedlmeier, Andreas ; Ritz, Fabian ; Linnhoff-Popien, Claudia:
Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition.
In: Proceedings of the AAAI Conference on Artificial Intelligence. Bd. 35 (2021) Heft 13 . - S. 11308-11316.
DOI: https://doi.org/10.1609/aaai.v35i13.17348

Phan, Thomy ; Ritz, Fabian ; Belzner, Lenz ; Altmann, Philipp ; Gabor, Thomas ; Linnhoff-Popien, Claudia:
VAST: Value Function Factorization with Variable Agent Sub-Teams.
In: Ranzato, Marc'Aurelio ; Beygelzimer, A. ; Dauphin, Y. ; Liang, P. S. ; Vaughan, J. Wortman (Hrsg.): Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021). - Red Hook, NY : Curran Associates, Inc., 2021 . - S. 24018-24032 . - (Advances in Neural Information Processing Systems ; 34 )

Phan, Thomy ; Gabor, Thomas ; Sedlmeier, Andreas ; Ritz, Fabian ; Kempter, Bernhard ; Klein, Cornel ; Sauer, Horst ; Schmid, Reiner ; Wieghardt, Jan ; Zeller, Marc ; Linnhoff-Popien, Claudia:
Learning and Testing Resilience in Cooperative Multi-Agent Systems.
In: Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '20). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2020 . - S. 1055-1063 . - (ACM Conferences )
DOI: https://doi.org/10.5555/3398761.3398884

Phan, Thomy ; Gabor, Thomas ; Müller, Robert ; Roch, Christoph ; Linnhoff-Popien, Claudia:
Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning.
In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19). - International Joint Conferences on Artificial Intelligence Organization, 2019 . - S. 5607-5613
DOI: https://doi.org/10.24963/ijcai.2019/778

Phan, Thomy ; Schmid, Kyrill ; Belzner, Lenz ; Gabor, Thomas ; Feld, Sebastian ; Linnhoff-Popien, Claudia:
Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies.
In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '19). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2019 . - S. 2162-2164 . - (ACM Conferences )
DOI: https://doi.org/10.5555/3306127.3332044

Phan, Thomy ; Belzner, Lenz ; Kiermeier, Marie ; Friedrich, Markus ; Schmid, Kyrill ; Linnhoff-Popien, Claudia:
Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling.
In: Proceedings of the AAAI Conference on Artificial Intelligence. Bd. 33 (2019) Heft 1 . - S. 7941-7948.
DOI: https://doi.org/10.1609/aaai.v33i01.33017941

Phan, Thomy ; Belzner, Lenz ; Gabor, Thomas ; Schmid, Kyrill:
Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation.
In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '18). - Richland,SC : International Foundation for Autonomous Agents and Multiagent Systems, 2018 . - S. 730-738 . - (ACM Conferences )
DOI: https://doi.org/10.5555/3237383.3237491

Diese Liste wurde am Wed Jul 29 16:16:41 2026 CEST generiert.
[Zum Seitenanfang]

Zum Einbinden der Liste in das CMS beachten Sie bitte die Hinweise auf dieser Hilfeseite.
Das ERef-Team hilft Ihnen selbstverständlich auch gerne bei der Ermittlung der korrekten URL weiter.