6 Papers
25 Citations
Xiaobin Wang is an academic researcher from University of Electronic Science and Technology of China. The author has contributed to research in topics: Yen's algorithm & Recurrent neural network. The author has an hindex of 4, co-authored 4 publications.
Chat about Author
Papers
Switching analysis of 2-D neural networks with nonsaturating linear threshold transfer functions
Hong Qu,Zhang Yi,Xiaobin Wang +2 more
TL;DR: Dynamical properties of the equilibria of two-dimensional networks are analyzed theoretically and it is proved by mathematics that the given conditions can drive the network (two-dimensional) converging to the different global stable stationary points, or the different multistable stationary points.
10
A Winner-Take-All Neural Networks of N Linear Threshold Neurons without Self-Excitatory Connections
Hong Qu,Zhang Yi,Xiaobin Wang +2 more
TL;DR: It is proved by mathematics that the proposed model is Non-Divergent, which means this network performs a Winner-Take-All behavior, which has been recognized as a basic computational model done in brain.
8
A novel neural network method for shortest path tree computation
Hong Qu,Simon X. Yang,Zhang Yi,Xiaobin Wang +3 more
- 01 Oct 2012
TL;DR: An efficient modified continued pulse coupled neural network (MCPCNN) model for SPT computation in a large scale instance is presented and it proves that the generated wave in the network spreads outward with travel times proportional to the connection weight between neurons.
7
Solving Poker Games Efficiently: Adaptive Memory based Deep Counterfactual Regret Minimization
Shuqing Shi,Xiaobin Wang,Dong Hao,Zhi-Xuan Yang,Hong Qu +4 more
- 18 Jul 2022
TL;DR: The adaptive memory sampling method is proposed which aims to find the distribution of the sampling length by using posterior sampling to update it iteratively and performs better than the state-of-the-art algorithms.
1
Journal Article
Double Thompson Sampling in Finite stochastic Games
TL;DR: In this article , a double Thompson sampling reinforcement learning (DTS) algorithm was proposed to solve the trade-off problem between exploration and exploitation under finite discounted Markov Decision Process, where the state transition matrix of the underlying environment stays unknown.