Dan Xiao
Nanyang Technological University
5 Papers
21 Citations
Dan Xiao is an academic researcher from Nanyang Technological University. The author has contributed to research in topics: Reinforcement learning & Multi-agent system. The author has an hindex of 4, co-authored 5 publications.
Chat about Author
Papers
Integrating Temporal Difference Methods and Self-Organizing Neural Networks for Reinforcement Learning With Delayed Evaluative Feedback
Ah-Hwee Tan,Ning Lu,Dan Xiao +2 more
TL;DR: The proposed neural model, called TD fusion architecture for learning, cognition, and navigation (TD-FALCON), enables an autonomous agent to adapt and function in a dynamic environment with immediate as well as delayed evaluative feedback (reinforcement) signals.
140
Self-Organizing Neural Architectures and Cooperative Learning in a Multiagent Environment
Dan Xiao,Ah-Hwee Tan +1 more
- 01 Dec 2007
TL;DR: It is demonstrated that TD-FALCON agent teams are able to adapt and function well in a multiagent environment without an explicit mechanism of collaboration, and also outperforms the RPROP-based reinforcement learners in terms of both task completion rate and learning efficiency.
23
Cooperative reinforcement learning in topology-based multi-agent systems
Dan Xiao,Ah-Hwee Tan +1 more
TL;DR: This paper proposes a cooperative learning strategy, under which autonomous agents are assembled in a binary tree formation (BTF), and applies it to a class of reinforcement learning agents known as temporal difference-fusion architecture for learning and cognition (TD-FALCON).
14
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
Dan Xiao,Ah-Hwee Tan +1 more
- 09 Dec 2008
TL;DR: In this article, two strategies, i.e., policy sharing and neighboring-agent mechanism, are presented to further improve the learning efficiency of TD-FALCON in complex multi-agent domains.
8
Scaling Up Multi-agent Reinforcement Learning in Complex Domains.
Dan Xiao,Ah-Hwee Tan +1 more
- 01 Jan 2008
TL;DR: This paper presents two strategies, i.e. policy sharing and neighboring-agent mechanism, to further improve the learning efficiency of TD-FALCON in complex multi-agent domains.
5