MPOGames: Efficient Multimodal Partially Observable Dynamic Games

doi:10.1109/ICRA48891.2023.10160342

Proceedings Article10.1109/ICRA48891.2023.10160342

MPOGames: Efficient Multimodal Partially Observable Dynamic Games

Oswin So, +5 more

- 19 Oct 2022

pp 3189-3196

3

TL;DR: This work proposes MPOGames, a method for efﬁciently solving MaxEnt dynamic games that captures the interactions between local Nash equilibria and shows the importance of uncertainty-aware game theoretic methods via a two-agent merge case study.

Abstract: Game theoretic methods have become popular for planning and prediction in situations involving rich multi-agent interactions. However, these methods often assume the existence of a single local Nash equilibria and are hence unable to handle uncertainty in the intentions of different agents. While maximum entropy (MaxEnt) dynamic games try to address this issue, practical approaches solve for MaxEnt Nash equilibria using linear-quadratic approximations which are restricted to unimodal responses and unsuitable for scenarios with multiple local Nash equilibria. By reformulating the problem as a POMDP, we propose MPOGames, a method for efficiently solving MaxEnt dynamic games that captures the interactions between local Nash equilibria. We show the importance of uncertainty-aware game theoretic methods via a two-agent merge case study. Finally, we prove the real-time capabilities of our approach with hardware experiments on a 1/10th scale car platform.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2402.14174

Blending Data-Driven Priors in Dynamic Games

Justin M. Lidard, +11 more

- 21 Feb 2024

- arXiv.org

TL;DR: Through a series of simulated and real-world autonomous driving scenarios, it is demonstrated that KLGame policies can more effectively incorporate guidance from the reference policy and account for noisily-rational human behaviors versus non-regularized baselines.

...read moreread less

2

Journal Article•10.48550/arXiv.2304.05483

Contingency Games for Multi-Agent Interaction

Lasse Peters, +6 more

- 11 Apr 2023

- arXiv.org

TL;DR: In this paper , the authors take a game-theoretic perspective on contingency planning which is tailored to multi-agent scenarios in which a robot's actions impact the decisions of other agents and vice versa.

...read moreread less

Journal Article•10.48550/arxiv.2403.05962

Multi-Robot Communication-Aware Cooperative Belief Space Planning with Inconsistent Beliefs: An Action-Consistent Approach

Tanmoy Kundu, +2 more

- 09 Mar 2024

- arXiv.org

TL;DR: Multi-robot communication-aware cooperative belief space planning with inconsistent beliefs: An action-consistent approach finds a consistent joint action despite inconsistent beliefs, improving coordination and safety.

...read moreread less

References

•Journal Article•10.1007/S12532-018-0139-4

CasADi: a software framework for nonlinear optimization and optimal control

Joel Andersson, +4 more

- 20 Mar 2019

- Mathematical Programming Computation

TL;DR: This article gives an up-to-date and accessible introduction to the CasADi framework, which has undergone numerous design improvements over the last 7 years.

...read moreread less

3.5K

•Proceedings Article

Maximum entropy inverse reinforcement learning

Brian D. Ziebart, +3 more

- 13 Jul 2008

TL;DR: A probabilistic approach based on the principle of maximum entropy that provides a well-defined, globally normalized distribution over decision sequences, while providing the same performance guarantees as existing methods is developed.

...read moreread less

3.1K

•Book

Individual Choice Behavior: A Theoretical Analysis

R. Duncan Luce

- 25 Sep 1979

2.5K

Journal Article•10.1109/PROC.1982.12425

On the rationale of maximum-entropy methods

E. T. Jaynes

- 01 Sep 1982

TL;DR: The relations between maximum-entropy (MAXENT) and other methods of spectral analysis such as the Schuster, Blackman-Tukey, maximum-likelihood, Bayesian, and Autoregressive models are discussed, emphasizing that they are not in conflict, but rather are appropriate in different problems.

...read moreread less

1.7K

Book Chapter•10.1016/B978-1-55860-377-6.50052-9

Learning policies for partially observable environments: scaling up

Michael L. Littman, +2 more

- 01 Oct 1997

TL;DR: This paper discusses several simple solution methods and shows that all are capable of finding near- optimal policies for a selection of extremely small POMDP'S taken from the learning literature, but shows that none are able to solve a slightly larger and noisier problem based on robot navigation.

...read moreread less

820

...

Expand