Proceedings Article10.1109/ICRA48891.2023.10160342
MPOGames: Efficient Multimodal Partially Observable Dynamic Games
Oswin So,Paul C. Drews,Thomas A. Balch,Velin Dimitrov,Guy Rosman,Evangelos A. Theodorou +5 more
- 19 Oct 2022
pp 3189-3196
3
TL;DR: This work proposes MPOGames, a method for efficiently solving MaxEnt dynamic games that captures the interactions between local Nash equilibria and shows the importance of uncertainty-aware game theoretic methods via a two-agent merge case study.
read more
Abstract: Game theoretic methods have become popular for planning and prediction in situations involving rich multi-agent interactions. However, these methods often assume the existence of a single local Nash equilibria and are hence unable to handle uncertainty in the intentions of different agents. While maximum entropy (MaxEnt) dynamic games try to address this issue, practical approaches solve for MaxEnt Nash equilibria using linear-quadratic approximations which are restricted to unimodal responses and unsuitable for scenarios with multiple local Nash equilibria. By reformulating the problem as a POMDP, we propose MPOGames, a method for efficiently solving MaxEnt dynamic games that captures the interactions between local Nash equilibria. We show the importance of uncertainty-aware game theoretic methods via a two-agent merge case study. Finally, we prove the real-time capabilities of our approach with hardware experiments on a 1/10th scale car platform.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Blending Data-Driven Priors in Dynamic Games
Justin M. Lidard,Haimin Hu,Asher Hancock,Zixu Zhang,Albert Gim'o Contreras,Vikash Modi,Jonathan A. DeCastro,Deepak E Gopinath,Guy Rosman,Naomi Leonard,Mar'ia Santos,J. F. Fisac +11 more
TL;DR: Through a series of simulated and real-world autonomous driving scenarios, it is demonstrated that KLGame policies can more effectively incorporate guidance from the reference policy and account for noisily-rational human behaviors versus non-regularized baselines.
2
Contingency Games for Multi-Agent Interaction
Lasse Peters,Andrea Bajcsy,Chih-Yuan Chiu,David Fridovich-Keil,Forrest Laine,Laura Ferranti,Javier Alonso-Mora +6 more
TL;DR: In this paper , the authors take a game-theoretic perspective on contingency planning which is tailored to multi-agent scenarios in which a robot's actions impact the decisions of other agents and vice versa.
Multi-Robot Communication-Aware Cooperative Belief Space Planning with Inconsistent Beliefs: An Action-Consistent Approach
Tanmoy Kundu,Moshe Rafaeli,Vadim Indelman +2 more
TL;DR: Multi-robot communication-aware cooperative belief space planning with inconsistent beliefs: An action-consistent approach finds a consistent joint action despite inconsistent beliefs, improving coordination and safety.
References
CasADi: a software framework for nonlinear optimization and optimal control
TL;DR: This article gives an up-to-date and accessible introduction to the CasADi framework, which has undergone numerous design improvements over the last 7 years.
•Proceedings Article
Maximum entropy inverse reinforcement learning
Brian D. Ziebart,Andrew L. Maas,J. Andrew Bagnell,Anind K. Dey +3 more
- 13 Jul 2008
TL;DR: A probabilistic approach based on the principle of maximum entropy that provides a well-defined, globally normalized distribution over decision sequences, while providing the same performance guarantees as existing methods is developed.
On the rationale of maximum-entropy methods
E. T. Jaynes
- 01 Sep 1982
TL;DR: The relations between maximum-entropy (MAXENT) and other methods of spectral analysis such as the Schuster, Blackman-Tukey, maximum-likelihood, Bayesian, and Autoregressive models are discussed, emphasizing that they are not in conflict, but rather are appropriate in different problems.
1.7K
Learning policies for partially observable environments: scaling up
Michael L. Littman,Anthony R. Cassandra,Leslie Pack Kaelbling +2 more
- 01 Oct 1997
TL;DR: This paper discusses several simple solution methods and shows that all are capable of finding near- optimal policies for a selection of extremely small POMDP'S taken from the learning literature, but shows that none are able to solve a slightly larger and noisier problem based on robot navigation.
820