Model-based Validation as Probabilistic Inference

doi:10.48550/arXiv.2305.09930

Proceedings Article10.48550/arXiv.2305.09930

Model-based Validation as Probabilistic Inference

Harrison Delecki, +2 more

- 17 May 2023

pp 825-837

4

TL;DR: In this paper , the authors estimate the distribution over failure trajectories for sequential systems as Bayesian inference using rollouts of system dynamics and computes trajectory gradients using automatic differentiation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2206.06009

Relative Policy-Transition Optimization for Fast Policy Transfer

Lei Han, +4 more

- 13 Jun 2022

- arXiv.org

TL;DR: A lemma based on existing theoretical results in reinforcement learning is introduced to measure the relativity between two arbitrary MDPs, that is the difference between any two cumulative expected returns defined on different policies and environment dynamics.

...read moreread less

Journal Article•10.1109/metroxraine58569.2023.10405725

The Role of System Modeling on Artificial Intelligence: A Review of Emerging Trends

Saad Aldoihi, +3 more

- 25 Oct 2023

TL;DR: The significance of system modeling in AI is explored, along with examples of its use in robotics, machine learning, and natural language processing, and the difficulties and possible directions in system modeling for AI are discussed.

...read moreread less

Journal Article•10.48550/arxiv.2404.03412

RADIUM: Predicting and Repairing End-to-End Robot Failures using Gradient-Accelerated Sampling

Charles Dawson, +2 more

- 04 Apr 2024

- arXiv.org

TL;DR: Radium is a framework for predicting and repairing end-to-end robot failures using gradient-accelerated sampling. It efficiently handles high-dimensional environmental parameters, includes vision in the loop, and provides guidance on how to mitigate failures.

...read moreread less

Journal Article•10.1109/lra.2024.3455782

Learning-based Bayesian Inference for Testing of Autonomous Systems

Anjali Parashar, +4 more

- 06 Sep 2024

- IEEE robotics and automation letters

TL;DR: This research introduces a novel sampling-based testing framework for autonomous systems, utilizing a discretized gradient-based second-order Langevin algorithm and learning-based techniques for constrained sampling of failure modes, improving failure prediction and feasibility.

...read moreread less

References

•Posted Content

Proximal Policy Optimization Algorithms

John Schulman, +4 more

- 20 Jul 2017

- arXiv: Learning

TL;DR: A new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent, are proposed.

...read moreread less

18K

•Journal Article•10.1103/PHYSREVE.62.1805

Congested traffic states in empirical observations and microscopic simulations

Martin Treiber, +2 more

- 01 Aug 2000

- Physical Review E

TL;DR: It is shown that the results of the microscopic model can be understood by formulating the theoretical phase diagram for bottlenecks in a more general way, and a local drop of the road capacity induced by parameter variations has essentially the same effect as an on-ramp.

...read moreread less

4.6K

•Book•10.1201/B10905

MCMC using Hamiltonian dynamics

Radford M. Neal

- 09 Jun 2012

- arXiv: Computation

TL;DR: In this paper, the authors discuss theoretical and practical aspects of Hamiltonian Monte Carlo, and present some of its variations, including using windows of states for deciding on acceptance or rejection, computing trajectories using fast approximations, tempering during the course of a trajectory to handle isolated modes, and short-cut methods that prevent useless trajectories from taking much computation time.

...read moreread less

3.3K

•Posted Content

The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo

Matthew D. Hoffman, +1 more

- 18 Nov 2011

- arXiv: Computation

TL;DR: The No-U-Turn Sampler (NUTS) as discussed by the authors is an extension to HMC that eliminates the need to set a number of steps L. NUTS uses a recursive algorithm to build a set of likely candidate points that spans a wide swath of the target distribution, stopping automatically when it starts to double back and retrace its steps.

...read moreread less

2.7K

•Journal Article•10.1111/J.1467-9868.2009.00736.X

Particle Markov chain Monte Carlo methods

Christophe Andrieu, +2 more

- 01 Jun 2010

- Journal of The Royal Statistical Society...

TL;DR: It is shown here how it is possible to build efficient high dimensional proposal distributions by using sequential Monte Carlo methods, which allows not only to improve over standard Markov chain Monte Carlo schemes but also to make Bayesian inference feasible for a large class of statistical models where this was not previously so.

...read moreread less

2.4K

...

Expand