Projekte: Innovationszentrum Mobiles Internet (InnoMI)
Gruppieren nach: Jahr | Person Anzahl der Einträge: 15. A
Altmann, Philipp ; Ritz, Fabian ; Feuchtinger, Leonard ; Nüßlein, Jonas ; Linnhoff-Popien, Claudia ; Phan, Thomy:
CROP: Towards Distributional-Shift Robust Reinforcement Learning Using Compact Reshaped Observation Processing. In: Elkind, Edith (Hrsg.): Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23). - Vienna, Austria : International Joint Conferences on Artificial Intelligence Organization, 2023 . - S. 3414-3422 DOI: https://doi.org/10.24963/ijcai.2023/380 G
Gabor, Thomas ; Sedlmeier, Andreas ; Kiermeier, Marie ; Phan, Thomy ; Henrich, Marcel ; Pichlmair, Monika ; Kempter, Bernhard ; Klein, Cornel ; Sauer, Horst ; Schmid, Reiner ; Wieghardt, Jan:
Scenario Co-Evolution for Reinforcement Learning on a Grid World Smart Factory Domain. In: Proceedings of the Genetic and Evolutionary Computation Conference. - New York, NY, USA : Association for Computing Machinery, 2019 . - S. 898-906 . - (ACM Conferences ) DOI: https://doi.org/10.1145/3321707.3321831
Gabor, Thomas ; Peter, Jan ; Phan, Thomy ; Meyer, Christian ; Linnhoff-Popien, Claudia:
Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search. In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19). - International Joint Conferences on Artificial Intelligence Organization, 2019 . - S. 5562-5568 DOI: https://doi.org/10.24963/ijcai.2019/772 M
Müller, Robert ; Illium, Steffen ; Phan, Thomy ; Haider, Tom ; Linnhoff-Popien, Claudia:
Towards Anomaly Detection in Reinforcement Learning. In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS '22). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2022 . - S. 1799-1803 . - (ACM Conferences ) DOI: https://doi.org/10.5555/3535850.3536113 P
Phan, Thomy ; Sommer, Felix ; Ritz, Fabian ; Altmann, Philipp ; Nüßlein, Jonas ; Kölle, Michael ; Belzner, Lenz ; Linnhoff-Popien, Claudia:
Emergent Cooperation from Mutual Acknowledgment Exchange in Multi-Agent Reinforcement Learning. In: Autonomous Agents and Multi-Agent Systems. Bd. 38 (2024) . - 34. DOI: https://doi.org/10.1007/s10458-024-09666-5
Phan, Thomy:
Emergence and Resilience in Multi-Agent Reinforcement Learning. München : Ludwig-Maximilians-Universität , 2023 . - XIV, 69 S. ( Dissertation, 2023 , Ludwig-Maximilians-Universität München) DOI: https://doi.org/10.5282/edoc.31981
Phan, Thomy ; Ritz, Fabian ; Altmann, Philipp ; Zorn, Maximilian ; Nüßlein, Jonas ; Kölle, Michael ; Gabor, Thomas ; Linnhoff-Popien, Claudia:
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability. In: Krause, Andreas ; Brunskill, Emma ; Cho, Kyunghyun ; Engelhardt, Barbara ; Sabato, Sivan ; Scarlett, Jonathan (Hrsg.): Proceedings of the 40th International Conference on Machine Learning. - Red Hook, NY : Curran Associates, Inc., 2023 . - S. 27840-27853 . - (Proceedings of Machine Learning Research ; 202 )
Phan, Thomy ; Sommer, Felix ; Altmann, Philipp ; Ritz, Fabian ; Belzner, Lenz ; Linnhoff-Popien, Claudia:
Emergent Cooperation from Mutual Acknowledgment Exchange. In: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS '22). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2022 . - S. 1047-1055 . - (ACM Conferences ) DOI: https://doi.org/10.5555/3535850.3535967
Phan, Thomy ; Belzner, Lenz ; Gabor, Thomas ; Sedlmeier, Andreas ; Ritz, Fabian ; Linnhoff-Popien, Claudia:
Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition. In: Proceedings of the AAAI Conference on Artificial Intelligence. Bd. 35 (2021) Heft 13 . - S. 11308-11316. DOI: https://doi.org/10.1609/aaai.v35i13.17348
Phan, Thomy ; Ritz, Fabian ; Belzner, Lenz ; Altmann, Philipp ; Gabor, Thomas ; Linnhoff-Popien, Claudia:
VAST: Value Function Factorization with Variable Agent Sub-Teams. In: Ranzato, Marc'Aurelio ; Beygelzimer, A. ; Dauphin, Y. ; Liang, P. S. ; Vaughan, J. Wortman (Hrsg.): Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021). - Red Hook, NY : Curran Associates, Inc., 2021 . - S. 24018-24032 . - (Advances in Neural Information Processing Systems ; 34 )
Phan, Thomy ; Gabor, Thomas ; Sedlmeier, Andreas ; Ritz, Fabian ; Kempter, Bernhard ; Klein, Cornel ; Sauer, Horst ; Schmid, Reiner ; Wieghardt, Jan ; Zeller, Marc ; Linnhoff-Popien, Claudia:
Learning and Testing Resilience in Cooperative Multi-Agent Systems. In: Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '20). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2020 . - S. 1055-1063 . - (ACM Conferences ) DOI: https://doi.org/10.5555/3398761.3398884
Phan, Thomy ; Gabor, Thomas ; Müller, Robert ; Roch, Christoph ; Linnhoff-Popien, Claudia:
Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning. In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19). - International Joint Conferences on Artificial Intelligence Organization, 2019 . - S. 5607-5613 DOI: https://doi.org/10.24963/ijcai.2019/778
Phan, Thomy ; Schmid, Kyrill ; Belzner, Lenz ; Gabor, Thomas ; Feld, Sebastian ; Linnhoff-Popien, Claudia:
Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '19). - Richland, SC : International Foundation for Autonomous Agents and Multiagent Systems, 2019 . - S. 2162-2164 . - (ACM Conferences ) DOI: https://doi.org/10.5555/3306127.3332044
Phan, Thomy ; Belzner, Lenz ; Kiermeier, Marie ; Friedrich, Markus ; Schmid, Kyrill ; Linnhoff-Popien, Claudia:
Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling. In: Proceedings of the AAAI Conference on Artificial Intelligence. Bd. 33 (2019) Heft 1 . - S. 7941-7948. DOI: https://doi.org/10.1609/aaai.v33i01.33017941
Phan, Thomy ; Belzner, Lenz ; Gabor, Thomas ; Schmid, Kyrill:
Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '18). - Richland,SC : International Foundation for Autonomous Agents and Multiagent Systems, 2018 . - S. 730-738 . - (ACM Conferences ) DOI: https://doi.org/10.5555/3237383.3237491 Zum Einbinden der Liste in das CMS beachten Sie bitte die Hinweise auf dieser Hilfeseite.
Das ERef-Team hilft Ihnen selbstverständlich auch gerne bei der Ermittlung der korrekten URL weiter. |