Steven Yang
1 Papers
4 Citations
Steven Yang is an academic researcher. The author has contributed to research in topics: Stability (learning theory) & Q-learning. The author has an hindex of 1, co-authored 1 publications.
Chat about Author
Papers
•Posted Content
Distributional Advantage Actor-Critic
TL;DR: This paper develops a new algorithm that combines advantage actor-critic with value distribution estimated by quantile regression, and evaluated this new algorithm, termed Distributional Advantage Actor-Critic (DA2C or QR-A2C), to achieve at least as good as baseline algorithms, and outperforming baseline in some tasks with smaller variance and increased stability.
11