State Alignment-based Imitation Learning

Open AccessProceedings Article

State Alignment-based Imitation Learning

- 30 Apr 2020

26

TL;DR: This work proposes a novel state alignment-based imitation learning method to train the imitator by following the state sequences in the expert demonstrations as much as possible, and combines them into a reinforcement learning framework by a regularized policy update objective.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Transfer Learning in Deep Reinforcement Learning: A Survey

Zhuangdi Zhu, +2 more

- 16 Sep 2020

- arXiv: Learning

TL;DR: This survey surveys the field of transfer learning in the problem setting of Reinforcement Learning, providing a systematic categorization of its state-of-the-art techniques.

...read moreread less

407

•Posted Content

State-Only Imitation Learning for Dexterous Manipulation

Ilija Radosavovic, +3 more

- 07 Apr 2020

- arXiv: Robotics

TL;DR: This paper trains an inverse dynamics model and uses it to predict actions for state-only demonstrations and considerably outperforms RL alone, and is able to learn from demonstrations with different dynamics, morphologies, and objects.

...read moreread less

99

•Book Chapter•10.1007/978-3-031-19842-7_33

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

Sascha O. Becker, +1 more

- 01 Jan 2022

TL;DR: Zhou et al. as discussed by the authors proposed DexMV (Dexterous Manipulation from Videos) for imitation learning, which is a platform with a simulation system for complex dexterous manipulation tasks with a multi-finger robot hand and a computer vision system to record large-scale demonstrations of a human hand conducting the same tasks.

...read moreread less

52

•Journal Article•10.1109/LRA.2021.3068912

Learning From Imperfect Demonstrations From Agents With Varying Dynamics

Zhangjie Cao, +1 more

- 25 Mar 2021

TL;DR: In this article, the authors propose a metric composed of a feasibility score and an optimality score to measure how useful a demonstration is for imitation learning, which enables learning from more informative demonstrations and disregarding the less relevant demonstrations.

...read moreread less

28

•Posted Content

ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations

Tongzhou Mu, +8 more

- 30 Jul 2021

- arXiv: Learning

TL;DR: The SAPIEN Manipulation Skill Benchmark (ManiSkill) as mentioned in this paper is a full-physics simulator for 3D object manipulation that includes large intra-class topological and geometric variations.

...read moreread less

21

...

Expand

References

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

•Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

- 01 Jan 2014

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

28.9K

•Posted Content

Proximal Policy Optimization Algorithms

John Schulman, +4 more

- 20 Jul 2017

- arXiv: Learning

TL;DR: A new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent, are proposed.

...read moreread less

18K

•Book

Optimal Transport: Old and New

Cédric Villani

- 02 Jan 2013

TL;DR: In this paper, the authors provide a detailed description of the basic properties of optimal transport, including cyclical monotonicity and Kantorovich duality, and three examples of coupling techniques.

...read moreread less

7.4K

Proceedings Article•10.1109/IROS.2012.6386109

MuJoCo: A physics engine for model-based control

Emanuel Todorov, +2 more

- 24 Dec 2012

TL;DR: A new physics engine tailored to model-based control, based on the modern velocity-stepping approach which avoids the difficulties with spring-dampers, which can compute both forward and inverse dynamics.

...read moreread less

6.4K