Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Multi-agent system
  4. 2012
  1. Home
  2. Topics
  3. Multi-agent system
  4. 2012
Showing papers on "Multi-agent system published in 2012"
Journal Article•10.1109/TAC.2011.2174666•
Distributed Event-Triggered Control for Multi-Agent Systems

[...]

Dimos V. Dimarogonas1, Emilio Frazzoli2, Karl Henrik Johansson1•
Royal Institute of Technology1, Massachusetts Institute of Technology2
01 May 2012-IEEE Transactions on Automatic Control
TL;DR: The controller updates considered here are event-driven, depending on the ratio of a certain measurement error with respect to the norm of a function of the state, and are applied to a first order agreement problem.
Abstract: Event-driven strategies for multi-agent systems are motivated by the future use of embedded microprocessors with limited resources that will gather information and actuate the individual agent controller updates. The controller updates considered here are event-driven, depending on the ratio of a certain measurement error with respect to the norm of a function of the state, and are applied to a first order agreement problem. A centralized formulation is considered first and then its distributed counterpart, in which agents require knowledge only of their neighbors' states for the controller implementation. The results are then extended to a self-triggered setup, where each agent computes its next update time at the previous one, without having to keep track of the state error that triggers the actuation between two consecutive update instants. The results are illustrated through simulation examples.

2,454 citations

Journal Article•10.1109/TAC.2011.2169618•
Cooperative Output Regulation of Linear Multi-Agent Systems

[...]

Youfeng Su1, Jie Huang1•
The Chinese University of Hong Kong1
01 Apr 2012-IEEE Transactions on Automatic Control
TL;DR: This technical note considers the cooperative output regulation of linear multi-agent systems and devising a distributed observer can solve the problem by a dynamic full information distributed control scheme.
Abstract: In this technical note, we consider the cooperative output regulation of linear multi-agent systems. The overall system consists of two groups of subsystems. While the first group of subsystems can access the exogenous signal, the second cannot. As a result, the problem cannot be solved by the decentralized approach. By devising a distributed observer, we can solve the problem by a dynamic full information distributed control scheme. The problem can also be viewed as a generalization of some results of the leader-following consensus problem of multi-agent systems.

588 citations

Journal Article•10.1109/TSMCB.2011.2179981•
Cooperative Output Regulation With Application to Multi-Agent Consensus Under Switching Network

[...]

Youfeng Su1, Jie Huang1•
The Chinese University of Hong Kong1
1 Jun 2012
TL;DR: This paper considers the cooperative output regulation of linear multi-agent systems under switching network and develops a distributed observer network that can solve the problem by both dynamic state feedback control and dynamic measurement output feedback control.
Abstract: In this paper, we consider the cooperative output regulation of linear multi-agent systems under switching network. The problem can be viewed as a generalization of the leader-following consensus problem of multi-agent systems. Due to the limited information exchanges of different subsystems, the problem cannot be solved by the decentralized approach and is not allowed to be solved by the centralized control. By devising a distributed observer network, we can solve the problem by both dynamic state feedback control and dynamic measurement output feedback control. As an application of our main result, we show that a special case of our results leads to the solution of the leader-following consensus problem of linear multi-agent systems.

485 citations

Journal Article•10.1109/TAC.2011.2158130•
Consensus Computation in Unreliable Networks: A System Theoretic Approach

[...]

Fabio Pasqualetti1, Antonio Bicchi, Francesco Bullo1•
University of California, Santa Barbara1
01 Jan 2012-IEEE Transactions on Automatic Control
TL;DR: In this article, the authors address the problem of ensuring trustworthy computation in a linear consensus network, where the authors model misbehaviors as unknown and unmeasurable inputs affecting the network, and cast the misbehavior detection and identification problem into an unknown-input system theoretic framework.
Abstract: This paper addresses the problem of ensuring trustworthy computation in a linear consensus network. A solution to this problem is relevant for several tasks in multi-agent systems including motion coordination, clock synchronization, and cooperative estimation. In a linear consensus network, we allow for the presence of misbehaving agents, whose behavior deviate from the nominal consensus evolution. We model misbehaviors as unknown and unmeasurable inputs affecting the network, and we cast the misbehavior detection and identification problem into an unknown-input system theoretic framework. We consider two extreme cases of misbehaving agents, namely faulty (non-colluding) and malicious (Byzantine) agents. First, we characterize the set of inputs that allow misbehaving agents to affect the consensus network while remaining undetected and/or unidentified from certain observing agents. Second, we provide worst-case bounds for the number of concurrent faulty or malicious agents that can be detected and identified. Precisely, the consensus network needs to be 2k+1 (resp. k+1) connected for k malicious (resp. faulty) agents to be generically detectable and identifiable by every well behaving agent. Third, we quantify the effect of undetectable inputs on the final consensus value. Fourth, we design three algorithms to detect and identify misbehaving agents. The first and the second algorithm apply fault detection techniques, and affords complete detection and identification if global knowledge of the network is available to each agent, at a high computational cost. The third algorithm is designed to exploit the presence in the network of weakly interconnected subparts, and provides local detection and identification of misbehaving agents whose behavior deviates more than a threshold, which is quantified in terms of the interconnection structure.

418 citations

Book Chapter•10.1007/978-3-642-27645-3_14•
Game Theory and Multi-agent Reinforcement Learning

[...]

Ann Nowé1, Peter Vrancx1, Yann-Michaël De Hauwere1•
Vrije Universiteit Brussel1
1 Jan 2012
TL;DR: A basic learning framework based on the economic research into game theory is described, and a representative selection of algorithms for the different areas of multi-agent reinforcement learning research is described.
Abstract: Reinforcement Learning was originally developed for Markov Decision Processes (MDPs). It allows a single agent to learn a policy that maximizes a possibly delayed reward signal in a stochastic stationary environment. It guarantees convergence to the optimal policy, provided that the agent can sufficiently experiment and the environment in which it is operating is Markovian. However, when multiple agents apply reinforcement learning in a shared environment, this might be beyond the MDP model. In such systems, the optimal policy of an agent depends not only on the environment, but on the policies of the other agents as well. These situations arise naturally in a variety of domains, such as: robotics, telecommunications, economics, distributed control, auctions, traffic light control, etc. In these domains multi-agent learning is used, either because of the complexity of the domain or because control is inherently decentralized. In such systems it is important that agents are capable of discovering good solutions to the problem at hand either by coordinating with other learners or by competing with them. This chapter focuses on the application reinforcement learning techniques in multi-agent systems. We describe a basic learning framework based on the economic research into game theory, and illustrate the additional complexity that arises in such systems. We also described a representative selection of algorithms for the different areas of multi-agent reinforcement learning research.

247 citations

Journal Article•10.1109/TSG.2011.2178044•
Intelligent Multiagent Control System for Energy and Comfort Management in Smart and Sustainable Buildings

[...]

Lingfeng Wang1, Zhu Wang1, Rui Yang1•
University of Toledo1
06 Feb 2012-IEEE Transactions on Smart Grid
TL;DR: A hierarchical multiagent control system with an intelligent optimizer is proposed in this study, which aims to minimize the power consumption without compromising the customers comfort in smart and energy-efficient buildings.
Abstract: Smart and energy-efficient buildings have recently become a trend for future building industry. The major challenge in the control system design for such a building is to minimize the power consumption without compromising the customers comfort. For this purpose, a hierarchical multiagent control system with an intelligent optimizer is proposed in this study. Four types of agents, which are switch agent, central coordinator-agent, local controller-agent, and load agent, cooperate with each other to achieve the overall control goals. Particle swarm optimization (PSO) is utilized to optimize the overall system and enhance the intelligence of the integrated building and microgrid system. A Graphical User Interface (GUI) based platform is designed for customers to input their preferences and monitor the results. Two sets of case studies are carried out and corresponding simulation results are presented in this paper.

238 citations

Journal Article•10.1016/J.AUTOMATICA.2012.02.032•
Brief paper: Target containment control of multi-agent systems with random switching interconnection topologies

[...]

Youcheng Lou1, Yiguang Hong1•
Chinese Academy of Sciences1
01 May 2012-Automatica
TL;DR: The multi-leader control problem is investigated via a combination of convex analysis and stochastic process and a necessary and sufficient condition is obtained to make all the mobile agents almost surely asymptotically converge to the static convex leader set.

235 citations

Journal Article•10.1109/TSG.2012.2208658•
Demand Response in Smart Distribution System With Multiple Microgrids

[...]

H. S. V. S. K. Nunna1, Suryanarayana Doolla1•
Indian Institute of Technology Bombay1
04 Sep 2012-IEEE Transactions on Smart Grid
TL;DR: Based on extensive simulation results of the system developed using Java Agent DEvelopment framework, it has been found that multi-agent based demand response is successful in reducing the system peak in addition to cost benefit for the customers with high priority index.
Abstract: In this paper, an agent based intelligent energy management system is proposed to facilitate power trading among microgrids and allow customers to participate in demand response. The proposed intelligence uses demand response, and diversity in electricity consumption patterns of the customers and availability of power from distributed generators as the vital means in managing power in the system. A new priority index is proposed for customers participating in the market based on frequency and size of load participating in demand response. In order to validate the proposed method, a case study with two interconnected microgrids is simulated. Based on extensive simulation results of the system developed using Java Agent DEvelopment framework (JADE), it has been found that multi-agent based demand response is successful in reducing the system peak in addition to cost benefit for the customers with high priority index.

231 citations

Journal Article•10.1007/S10515-011-0088-X•
Model checking agent programming languages

[...]

Louise A. Dennis1, Michael Fisher1, Matt Webster2, Rafael H. Bordini3•
University of Liverpool1, Daresbury Laboratory2, Universidade Federal do Rio Grande do Sul3
1 Mar 2012
TL;DR: This is the first comprehensive approach to the verification of programs developed using programming languages based on the BDI (belief-desire-intention) model of agency, and develops a specific layer of abstraction that maps the semantics of agent programs into the relevant model-checking framework.
Abstract: In this paper we describe a verification system for multi-agent programs. This is the first comprehensive approach to the verification of programs developed using programming languages based on the BDI (belief-desire-intention) model of agency. In particular, we have developed a specific layer of abstraction, sitting between the underlying verification system and the agent programming language, that maps the semantics of agent programs into the relevant model-checking framework. Crucially, this abstraction layer is both flexible and extensible; not only can a variety of different agent programming languages be implemented and verified, but even heterogeneous multi-agent programs can be captured semantically. In addition to describing this layer, and the semantic mapping inherent within it, we describe how the underlying model-checker is driven and how agent properties are checked. We also present several examples showing how the system can be used. As this is the first system of its kind, it is relatively slow, so we also indicate further work that needs to be tackled to improve performance.

225 citations

Journal Article•10.1016/J.SYSCONLE.2011.10.011•
An iterative learning approach to formation control of multi-agent systems

[...]

Yang Liu1, Yingmin Jia1•
Beihang University1
01 Jan 2012-Systems & Control Letters
TL;DR: A distributed D-type iterative learning scheme is developed for the multi-agent system with switching topology, whose switching time and sequence are allowed to be varied at different iterations according to the actual trajectories of agents.

220 citations

Journal Article•10.1016/J.AUTOMATICA.2012.03.017•
Brief paper: High-order consensus of heterogeneous multi-agent systems with unknown communication delays

[...]

Yu-Ping Tian1, Ya Zhang1•
Southeast University1
01 Jun 2012-Automatica
TL;DR: A necessary and sufficient condition is given for the existence of a high-order consensus solution to heterogeneous multi-agent systems with unknown communication delays and the condition shows that, for systems with diverse communication delays, high-orders does not require the self-delay of each agent to be equal to the corresponding communication delay.
Journal Article•10.1016/J.AUTOMATICA.2012.03.029•
Two consensus problems for discrete-time multi-agent systems with switching network topology

[...]

Youfeng Su1, Jie Huang1•
The Chinese University of Hong Kong1
01 Sep 2012-Automatica
TL;DR: Under the assumption that the system matrix is marginally stable, it is shown that these two consensus problems can be solved via the state feedback protocols, provided that the dynamic graph is jointly connected.
Journal Article•10.1016/J.AUTOMATICA.2012.08.037•
State based potential games

[...]

Jason R. Marden
01 Dec 2012-Automatica
TL;DR: A new framework, termed state based potential games, is proposed, which introduces an underlying state space into the framework of potential games that provides a system designer with an additional degree of freedom to help coordinate group behavior and overcome limitations.
Journal Article•10.1109/TAC.2011.2178883•
Network Connectivity Preserving Formation Stabilization and Obstacle Avoidance via a Decentralized Controller

[...]

Zhen Kan1, Ashwin P. Dani1, John M. Shea1, Warren E. Dixon1•
University of Florida1
01 Jul 2012-IEEE Transactions on Automatic Control
TL;DR: To guide the agents to a desired configuration while avoiding obstacles, a decentralized controller is developed based on the navigation function formalism by proving that the proposed controller is a qualified navigation function, convergence to the desired formation is guaranteed.
Abstract: A decentralized control method is developed to enable a group of agents to achieve a desired global configuration while maintaining global network connectivity and avoiding obstacles, using only local feedback and no radio communication between the agents for navigation. By modeling the interaction among the agents as a graph, and given a connected initial graph with a desired neighborhood between agents, the developed method ensures the desired communication links remain connected for all time. To guide the agents to a desired configuration while avoiding obstacles, a decentralized controller is developed based on the navigation function formalism. By proving that the proposed controller is a qualified navigation function, convergence to the desired formation is guaranteed.
Journal Article•10.1109/TSMCC.2012.2218596•
Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch

[...]

Yinliang Xu1, Wei Zhang1, Wenxin Liu1, Frank Ferrese2•
New Mexico State University1, Naval Surface Warfare Center2
1 Nov 2012
TL;DR: This paper proposes a fully distributed multiagent-based reinforcement learning method for optimal reactive power dispatch that can significantly speed up the learning process and decrease the occurrences of undesirable disturbances.
Abstract: This paper proposes a fully distributed multiagent-based reinforcement learning method for optimal reactive power dispatch. According to the method, two agents communicate with each other only if their corresponding buses are electrically coupled. The global rewards that are required for learning are obtained with a consensus-based global information discovery algorithm, which has been demonstrated to be efficient and reliable. Based on the discovered global rewards, a distributed Q-learning algorithm is implemented to minimize the active power loss while satisfying operational constraints. The proposed method does not require accurate system model and can learn from scratch. Simulation studies with power systems of different sizes show that the method is very computationally efficient and able to provide near-optimal solutions. It can be observed that prior knowledge can significantly speed up the learning process and decrease the occurrences of undesirable disturbances. The proposed method has good potential for online implementation.
Journal Article•10.1016/J.ENGAPPAI.2011.12.003•
A multi-agent system for distributed multi-project scheduling: An auction-based negotiation approach

[...]

Sunil Adhau1, M.L. Mittal1, Abhinav Mittal1•
Malaviya National Institute of Technology, Jaipur1
01 Dec 2012-Engineering Applications of Artificial Intelligence
TL;DR: A novel distributed multi-agent system using auctions based negotiation (DMAS/ABN) approach for resolving the resource conflicts and allocating multiple different types of shared resources amongst multiple competing projects and winner determination problem is solved by efficient new heuristic.
Journal Article•10.1080/00207179.2011.654264•
Consensus and its ℒ 2 -gain performance of multi-agent systems with intermittent information transmissions

[...]

Guanghui Wen1, Zhisheng Duan1, Zhongkui Li2, Guanrong Chen3•
Peking University1, Beijing Institute of Technology2, City University of Hong Kong3
27 Feb 2012-International Journal of Control
TL;DR: By combining tools from switching systems and Lyapunov stability theory, some sufficient conditions are established for consensus of multi-agent systems without any external disturbances under a fixed strongly connected topology.
Abstract: This article addresses the consensus problem for cooperative multiple agents with nonlinear dynamics on a fixed directed information network, where each agent can only communicate with its neighbours intermittently. A class of control algorithms is first introduced, using only intermittent relative local information. By combining tools from switching systems and Lyapunov stability theory, some sufficient conditions are established for consensus of multi-agent systems without any external disturbances under a fixed strongly connected topology. Theoretical analyses are further provided for consensus of multi-agent systems in the presence of external disturbances. It is shown that a finite ℒ2-gain performance index for the closed-loop multi-agent systems can be guaranteed if the coupling strength of the network is larger than a threshold value determined by the average communication rate and the generalised algebraic connectivity of the strongly connected topology. The results are then extended to consensus ...
Journal Article•10.1137/100800324•
Mean Field Games for Large-Population Multiagent Systems with Markov Jump Parameters

[...]

Bing-Chang Wang, Ji-Feng Zhang
23 Aug 2012-Siam Journal on Control and Optimization
TL;DR: In this paper, distributed games for large-population multiagent systems with random time-varying parameters are investigated, where the agents are coupled via their individual costs and the structure parameters are a family of independent Markov chains with identical generators.
Abstract: In this paper, distributed games for large-population multiagent systems with random time-varying parameters are investigated, where the agents are coupled via their individual costs and the structure parameters are a family of independent Markov chains with identical generators. The cost function of each agent is a long-run average tracking-type functional with an unknown mean field coupling nonlinear term as “reference signal.” To reduce the computational complexity, the mean field approach is applied to construct distributed strategies. The population statistics effect (PSE) is used to approximate the average effect of all the agents, and the distributed strategies are given through solving a Markov jump tracking problem. Here the PSE is a deterministic quantity and can be obtained by solving the Stackelberg equilibrium of an auxiliary two-player game. It is shown that the closed-loop system is uniformly stable, and the distributed strategies are asymptotically optimal in the sense of Nash equilibrium,...
Journal Article•10.1016/J.ASOC.2012.02.001•
A multi-agent based approach to dynamic scheduling of machines and automated guided vehicles in manufacturing systems

[...]

Rızvan Erol1, Cenk Sahin1, Adil Baykasoğlu2, Vahit Kaplanoglu3•
Çukurova University1, Dokuz Eylül University2, University of Gaziantep3
1 Jun 2012
TL;DR: The proposed multi-agent based approach works under a real-time environment and generates feasible schedules using negotiation/bidding mechanisms between agents and is tested on off-line scheduling problems from the literature.
Abstract: In real manufacturing environments, the control of system elements such as automated guided vehicles has some difficulties when planning operations dynamically. Multi agent-based systems, a newly maturing area of distributed artificial intelligence, provide some effective mechanisms for the management of such dynamic operations in manufacturing environments. This paper proposes a multi-agent based scheduling approach for automated guided vehicles and machines within a manufacturing system. The proposed multi-agent based approach works under a real-time environment and generates feasible schedules using negotiation/bidding mechanisms between agents. This approach is tested on off-line scheduling problems from the literature. The results show that our approach is capable of generating good schedules in real time comparable with the optimization algorithms and the frequently used dispatching rules.
Journal Article•10.1109/TAC.2012.2225539•
An Optimal Control Approach to the Multi-Agent Persistent Monitoring Problem

[...]

Christos G. Cassandras1, Xuchao Lin1, Xu Chu Ding•
Boston University1
18 Oct 2012
TL;DR: This work presents an optimal control framework for persistent monitoring problems where the objective is to control the movement of multiple cooperating agents to minimize an uncertainty metric in a given mission space and shows that the solution is robust with respect to the uncertainty model used.
Abstract: We propose an optimal control framework for persistent monitoring problems where the objective is to control the movement of multiple cooperating agents to minimize an uncertainty metric in a given mission space. In a one-dimensional mission space, we show that the optimal solution is for each agent to move at maximal speed from one switching point to the next, possibly waiting some time at each point before reversing its direction. Thus, the solution is reduced to a simpler parametric optimization problem: determining a sequence of switching locations and associated waiting times at these switching points for each agent. This amounts to a hybrid system which we analyze using Infinitesimal Perturbation Analysis (IPA) to obtain a complete on-line solution through a gradient-based algorithm. We also show that the solution is robust with respect to the uncertainty model used. This establishes the basis for extending this approach to a two-dimensional mission space.
Journal Article•10.1016/J.SYSCONLE.2012.06.003•
Leaders in multi-agent controllability under consensus algorithm and tree topology

[...]

Zhijian Ji1, Hai Lin2, Haisheng Yu1•
Qingdao University1, University of Notre Dame2
01 Sep 2012-Systems & Control Letters
TL;DR: The main objective of this paper aims to characterize the virtue that leaders should have from the perspective of algebraic and graphical conditions and shows for path topologies that controllability completely depends on the leaders’ location.
Journal Article•10.1016/J.ESWA.2012.01.086•
Turist@: Agent-based personalised recommendation of tourist activities

[...]

Montserrat Batet, Antonio Moreno, David Sánchez, David Isern, Aida Valls 
01 Jun 2012-Expert Systems With Applications
TL;DR: A novel recommendation system, [email protected], which incorporates a mixture of content-based and collaborative recommendation strategies, thus avoiding the drawbacks of each individual method, and is able to perform recommendations in heterogeneous scenarios.
Abstract: Recommender systems in e-Tourism normally focus on helping tourists to select appropriate destinations. A related problem that has been less explored in the literature is how to provide personalised recommendations of cultural and leisure activities when the tourist has already arrived at the destination. This paper presents a novel recommendation system, [email protected], which addresses this issue. Its agent-based modular design permits to model different kinds of activities in a flexible way, and allows the implementation of a location-aware front-end in the mobile device of the user. Special care has been put in the recommendation engine, implemented via a specialised Recommender Agent. It incorporates a mixture of content-based and collaborative recommendation strategies, thus avoiding the drawbacks of each individual method, and is able to perform recommendations in heterogeneous scenarios. Recommendations take into account user profiles which are implicitly updated after the analysis of user actions (e.g., queries, evaluations). The system has been successfully deployed and tested in the World Heritage-listed city of Tarragona.
Journal Article•10.1080/00207179.2012.674644•
Distributed robust control of linear multi-agent systems with parameter uncertainties

[...]

Zhongkui Li1, Zhisheng Duan2, Lihua Xie3, Xiangdong Liu1•
Beijing Institute of Technology1, Peking University2, Nanyang Technological University3
12 Jun 2012-International Journal of Control
TL;DR: It is shown for both the continuous- and discrete-time cases that the distributed robust control problems under such controllers in the sense of quadratic stability are equivalent to the H ∞ control problems of a set of decoupled linear systems having the same dimensions as a single agent.
Abstract: This article considers the distributed robust control problems of uncertain linear multi-agent systems with undirected communication topologies. It is assumed that the agents have identical nominal dynamics while subject to different norm-bounded parameter uncertainties, leading to weakly heterogeneous multi-agent systems. Distributed controllers are designed for both continuous- and discrete-time multi-agent systems, based on the relative states of neighbouring agents and a subset of absolute states of the agents. It is shown for both the continuous- and discrete-time cases that the distributed robust control problems under such controllers in the sense of quadratic stability are equivalent to the H ∞ control problems of a set of decoupled linear systems having the same dimensions as a single agent. A two-step algorithm is presented to construct the distributed controller for the continuous-time case, which does not involve any conservatism and meanwhile decouples the feedback gain design from the commun...
Proceedings Article•
Sampled-data based average consensus with logarithmic quantizers

[...]

Shuai Liu1, Tao Li2, Lihua Xie1, Minyue Fu3, Ji-Feng Zhang2 •
Nanyang Technological University1, Chinese Academy of Sciences2, University of Newcastle3
25 Jul 2012
TL;DR: In this article, a distributed consensus protocol is proposed based on sampled measurements, which is robust to the logarithmic quantization, i.e. all the states of the agents are uniformly bounded and the gap between the state of each agent and the average value of the initial conditions converges to zero as the density of quantization levels goes to infinity.
Abstract: This paper considers the sampled-data average consensus problem for multi-agent systems with first order continuous dynamics. The communication channels among the agents are constrained in which the exchanged information is digital rather than analogue. In this paper, the logarithmic quantizer is applied to the communication channels. A distributed consensus protocol is proposed based on sampled measurements. It is proved that as long as the quantization levels are dense enough, the proposed protocol is robust to the logarithmic quantization, i.e. all the states of the agents are uniformly bounded and the gap between the state of each agent and the average value of the initial conditions converges to zero as the density of quantization levels goes to infinity. An example is given to demonstrate the effectiveness of the protocol.
Journal Article•10.1016/J.SYSCONLE.2012.04.007•
Tracking control over a finite interval for multi-agent systems with a time-varying reference trajectory☆

[...]

Deyuan Meng1, Yingmin Jia1, Junping Du2, Fashan Yu•
Beihang University1, Beijing University of Posts and Telecommunications2
01 Jul 2012-Systems & Control Letters
TL;DR: A unified algorithm is presented for agents described by both discrete-time and continuous-time models through using the iterative learning approach to achieve the formation control for multi-agent systems.
Journal Article•10.1109/TAMD.2011.2160943•
Autonomous Learning of High-Level States and Actions in Continuous Environments

[...]

Jonathan Mugan, Benjamin Kuipers1•
University of Michigan1
01 Mar 2012-IEEE Transactions on Autonomous Mental Development
TL;DR: This paper proposes attacking the problem of learning high-level states and actions in continuous environments by using a qualitative representation to bridge the gap between continuous and discrete variable representations and shows that the agent was able to use this method to autonomously learn to perform the tasks.
Abstract: How can an agent bootstrap up from a low-level representation to autonomously learn high-level states and actions using only domain-general knowledge? In this paper, we assume that the learning agent has a set of continuous variables describing the environment. There exist methods for learning models of the environment, and there also exist methods for planning. However, for autonomous learning, these methods have been used almost exclusively in discrete environments. We propose attacking the problem of learning high-level states and actions in continuous environments by using a qualitative representation to bridge the gap between continuous and discrete variable representations. In this approach, the agent begins with a broad discretization and initially can only tell if the value of each variable is increasing, decreasing, or remaining steady. The agent then simultaneously learns a qualitative representation (discretization) and a set of predictive models of the environment. These models are converted into plans to perform actions. The agent then uses those learned actions to explore the environment. The method is evaluated using a simulated robot with realistic physics. The robot is sitting at a table that contains a block and other distractor objects that are out of reach. The agent autonomously explores the environment without being given a task. After learning, the agent is given various tasks to determine if it learned the necessary states and actions to complete them. The results show that the agent was able to use this method to autonomously learn to perform the tasks.
Journal Article•10.1609/AIMAG.V33I3.2429•
Distributed Problem Solving

[...]

William Yeoh1, Makoto Yokoo2•
Singapore Management University1, Kyushu University2
20 Sep 2012-Ai Magazine
TL;DR: This article provides an overview of two distributed problem solving models, namely distributed constraint satisfaction problems ( DCSPs) and distributed constraint optimization problems (DCOPs), and some of their algorithms.
Abstract: Distributed problem solving is a subfield within multiagent systems, where agents are assumed to be part of a team and collaborate with each other to reach a common goal. In this article, we illustrate the motivations for distributed problem solving and provide an overview of two distributed problem solving models, namely distributed constraint satisfaction problems (DCSPs) and distributed constraint optimization problems (DCOPs), and some of their algorithms.
Journal Article•10.1016/J.SIMPAT.2011.12.011•
TAPAS: A multi-agent-based model for simulation of transport chains

[...]

Johan Holmgren1, Paul Davidsson1, Paul Davidsson2, Jan A. Persson2, Jan A. Persson1, Linda Ramstedt •
Blekinge Institute of Technology1, Malmö University2
01 Apr 2012-Simulation Modelling Practice and Theory
TL;DR: TAPAS is more powerful than traditional approaches to freight transport analysis, as it explicitly models production and customer demand, and it captures the interaction between individual transport chain actors, their heterogeneity and decision making processes, as well as time aspects.
Proceedings Article•10.1109/HPCC.2012.79•
Exploitation of High Performance Computing in the FLAME Agent-Based Simulation Framework

[...]

Simon Coakley1, Marian Gheorghe1, Mike Holcombe1, Shawn Chin, David Worth, Chris Greenough •
University of Sheffield1
25 Jun 2012
TL;DR: The framework presented allows modellers to concentrate on the model while the framework handles the efficient execution of simulations, and a state machine based representation of agents that allows a statically calculated optimal ordering of agent execution and parallel communication routines.
Abstract: This paper describes the design of an agent-based modelling framework for high performance computing. Rather than a collection of methods that require parallel programming expertise the framework presented allows modellers to concentrate on the model while the framework handles the efficient execution of simulations. The framework uses a state machine based representation of agents that allows a statically calculated optimal ordering of agent execution and parallel communication routines. Some experiments with the current implementation and the results of using a simple communication dominant model for benchmarking performance are reported. The model with half a million agents is used to show that a parallel efficiency of above 80% is achievable when distributed over 432 processors. Future improvements are discussed including data dependency analysis, vector operations over agents, and dynamic task scheduling.
Book Chapter•10.1002/9780470828229.CH7•
Consensus in Heterogeneous Multi-Agent Systems

[...]

Yu-Ping Tian1•
Southeast University1
22 Aug 2012
TL;DR: This chapter uses the frequency-domain method to develop variable scalable consensus conditions for low-order agent systems with diverse communication and input delays andHigh-order consensus is defined and investigated for a class of high-order heterogeneous agent systems.
Abstract: In this chapter, we use the frequency-domain method to develop variable scalable consensus conditions for low-order agent systems with diverse communication and input delays. High-order consensus is also defined and investigated for a class of high-order heterogeneous agent systems.
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve