View estimation learning based on value system

doi:10.1109/FUZZY.2009.5277378

Proceedings Article10.1109/FUZZY.2009.5277378

View estimation learning based on value system

Yasutake Takahashi, +2 more

- 20 Aug 2009

- pp 939-944

1

TL;DR: Experiments with simple humanoid robots show the validity of the method, the developmental process parallel to young children's estimation of its own view during the imitation of the observed behavior demonstrated by the caregiver is discussed.

Abstract: Estimation of a caregiver's view is one of the most important capabilities for a child to understand the behavior demonstrated by the caregiver, that is, to infer the intention of behavior and/or to learn the observed behavior efficiently. We hypothesize that the child develops this ability in the same way as behavior learning motivated by an intrinsic reward, that is, he/she updates the model of the estimated view of his/her own during the behavior imitated from the observation of the behavior demonstrated by the caregiver based on minimizing the estimation error of the reward during the behavior. From this view, this paper shows a method for acquiring such a capability based on a value system from which values can be obtained by reinforcement learning. The parameters of the view estimation are updated based on the temporal difference error (hereafter TD error: estimation error of the state value), analogous to the way such that the parameters of the state value of the behavior are updated based on the TD error. Experiments with simple humanoid robots show the validity of the method, and the developmental process parallel to young children's estimation of its own view during the imitation of the observed behavior demonstrated by the caregiver is discussed.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Dissertation

Cooperation and Interaction between Human and Humanoid Robots through Integration of Symbolic Expressions and Sensorimotor Patterns

敬丞奥野, +1 more

- 28 Sep 2012

References

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

•Book

Introduction to Reinforcement Learning

Richard S. Sutton, +1 more

- 01 Mar 1998

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.

...read moreread less

7.7K

Journal Article•10.1038/1124

Dopamine neurons report an error in the temporal prediction of reward during learning.

Jeffrey R. Hollerman, +1 more

- 01 Aug 1998

- Nature Neuroscience

TL;DR: Dopamine neuron responses reflected the changes in reward prediction during individual learning episodes; dopamine neurons were activated by rewards during early trials, but activation was progressively reduced as performance was consolidated and rewards became more predictable.

...read moreread less

1.2K

•Journal Article•10.1111/J.1467-7687.2007.00574.X

'Like me': a foundation for social cognition.

Andrew N. Meltzoff

- 01 Jan 2007

- Developmental Science

TL;DR: The 'like me' nature of others is the starting point for social cognition, not its culmination.

...read moreread less

793

•Journal Article•10.1098/RSTB.2002.1258

Computational approaches to motor learning by imitation

Stefan Schaal, +3 more

- 29 Mar 2003

- Philosophical Transactions of the Royal ...

TL;DR: This paper will primarily emphasize the motor side of imitation, assuming that a perceptual system has already identified important features of a demonstrated movement and created their corresponding spatial information.

...read moreread less

711

View estimation learning based on value system

Chat with Paper

AI Agents for this Paper

Citations

Cooperation and Interaction between Human and Humanoid Robots through Integration of Symbolic Expressions and Sensorimotor Patterns

References

Reinforcement Learning: An Introduction

Introduction to Reinforcement Learning

Dopamine neurons report an error in the temporal prediction of reward during learning.

'Like me': a foundation for social cognition.

Computational approaches to motor learning by imitation

Related Papers (5)

View Estimation Based on Value System

How does our motor system determine its learning rate

Control of the error signals in negative correlation learning

Temporal Coherence and Prediction Decay in TD Learning

An analysis of experience replay in temporal difference learning