The moving target function problem in multi-agent learning

doi:10.1109/ICMAS.1998.699075

Proceedings Article10.1109/ICMAS.1998.699075

The moving target function problem in multi-agent learning

José M. Vidal, +1 more

- 03 Jul 1998

- pp 317-324

34

TL;DR: A framework that can be used to model and predict the behavior of MASs with learning agents is described, which uses a difference equation for calculating the progression of an agent's error in its decision function to tell us how the agent is expected to fare in the MAS.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/S10458-005-2631-2

Cooperative Multi-Agent Learning: The State of the Art

Liviu Panait, +1 more

- 01 Nov 2005

- Autonomous Agents and Multi-Agent System...

TL;DR: This survey attempts to draw from multi-agent learning work in a spectrum of areas, including RL, evolutionary computation, game theory, complex systems, agent modeling, and robotics, and finds that this broad view leads to a division of the work into two categories.

...read moreread less

1.5K

Journal Article•10.1016/S1389-1286(00)00026-8

Dynamic pricing by software agents

Jeffrey O. Kephart, +2 more

- 30 May 2000

- Computer Networks

TL;DR: The potential impact of widespread shopbot usage on prices, the price dynamics that may ensue from various mixtures of automated pricing agents, the potential use of machine-learning algorithms to improve profits, and more generally the interplay among learning, optimization, and dynamics in agent-based information economies are studied.

...read moreread less

257

An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning

Michael Bowling, +1 more

- 01 Oct 2000

TL;DR: This paper contributes a comprehensive presentation of the relevant techniques for solving stochastic games from both the game theory community and reinforcement learning communities, and examines the assumptions and limitations of these algorithms.

...read moreread less

170

•Proceedings Article•10.1145/860575.860599

Resource allocation games with changing resource capacities

Aram Galstyan, +2 more

- 14 Jul 2003

TL;DR: The results indicate that for a certain range of parameters the system as a whole adapts effectively to the changing capacity levels and results in very little under- or over-utilization of the resources.

...read moreread less

68

•Journal Article•10.1023/A:1024133006761

Congregation Formation in Multiagent Systems

Christopher H. Brooks, +1 more

- 01 Jul 2003

- Autonomous Agents and Multi-Agent System...

TL;DR: This paper presents a formal model of a congregation and then applies Vidal and Durfee's CLRI framework to the affinity group domain, and shows that if agents are unable to describe congregations to each other, the problem of forming optimal congregations grows exponentially with the number of agents.

...read moreread less

62

...

Expand

References

•Proceedings Article

The dynamics of reinforcement learning in cooperative multiagent systems

Caroline Claus, +1 more

- 01 Jul 1998

TL;DR: This work distinguishes reinforcement learners that are unaware of (or ignore) the presence of other agents from those that explicitly attempt to learn the value of joint actions and the strategies of their counterparts, and proposes alternative optimistic exploration strategies that increase the likelihood of convergence to an optimal equilibrium.

...read moreread less

1.3K

•Book

An open agent architecture

Philip R. Cohen, +3 more

- 01 Oct 1997

TL;DR: The goal of this ongoing project is to develop an open agent architecture and accompanying user interface for networked desktop and handheld machines that support distributed execution of a user’s requests, interoperability of multiple application subsystems, addition of new agents, and incorporation of existing applications.

...read moreread less

577

•Journal Article•10.1016/S0004-3702(97)00028-3

On the emergence of social conventions: modeling, analysis, and simulations

Yoav Shoham, +1 more

- 15 Jul 1997

- Artificial Intelligence

TL;DR: This work introduces a simple and natural strategy-selection rule, called highest cumulative reward (HCR), and shows a class of games in which HCR guarantees eventual convergence to a rationally acceptable social convention.

...read moreread less

328

Journal Article•10.1049/IP-SEN:19971024