Top 1168 papers published in the topic of Multi-agent system in 2017

Showing papers on "Multi-agent system published in 2017"

Journal Article•10.1371/JOURNAL.PONE.0172395•

Multiagent cooperation and competition with deep reinforcement learning

[...]

Ardi Tampuu¹, Tambet Matiisen¹, Dorian Kodelja¹, Ilya Kuzovkin¹, Kristjan Korjus¹, Juhan Aru², Jaan Aru¹, Raul Vicente¹ - Show less +4 more•Institutions (2)

University of Tartu¹, ETH Zurich²

05 Apr 2017-PLOS ONE

TL;DR: The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments and describes the progression from competitive to collaborative behavior when the incentive to cooperate is increased.

...read moreread less

Abstract: Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent environments to investigate the interaction between two learning agents in the well-known video game Pong. By manipulating the classical rewarding scheme of Pong we show how competitive and collaborative behaviors emerge. We also describe the progression from competitive to collaborative behavior when the incentive to cooperate is increased. Finally we show how learning by playing against another adaptive agent, instead of against a hard-wired algorithm, results in more robust strategies. The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments.

...read moreread less

982 citations

Proceedings Article•10.1109/ICRA.2017.7989037•

Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning

[...]

Yu Fan Chen¹, Miao Liu², Michael Everett¹, Jonathan P. How¹•Institutions (2)

Massachusetts Institute of Technology¹, IBM²

1 May 2017

TL;DR: In this article, a value network is proposed to estimate the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors, and the value network not only admits efficient (i.e., realtime implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion.

...read moreread less

Abstract: Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion. Simulation results show more than 26% improvement in paths quality (i.e., time to reach the goal) when compared with optimal reciprocal collision avoidance (ORCA), a state-of-the-art collision avoidance strategy.

...read moreread less

720 citations

Journal Article•10.1109/TIE.2017.2701778•

Distributed Formation Control of Networked Multi-Agent Systems Using a Dynamic Event-Triggered Communication Mechanism

[...]

Xiaohua Ge¹, Qing-Long Han¹•Institutions (1)

Swinburne University of Technology¹

05 May 2017-IEEE Transactions on Industrial Electronics

TL;DR: An event-triggered formation protocol is delicately proposed by using only locally triggered sampled data in a distributed manner and the state formation control problem is cast into an asymptotic stability problem of a reduced-order closed-loop system.

...read moreread less

Abstract: This paper addresses the distributed formation control problem of a networked multi-agent system (MAS) subject to limited communication resources. First, a dynamic event-triggered communication mechanism (DECM) is developed to schedule inter-agent communication such that some unnecessary data exchanges among agents can be reduced so as to achieve better resource efficiency. Different from most of the existing event-triggered communication mechanisms, wherein threshold parameters are fixed all the time, the threshold parameter in the developed event triggering condition is dynamically adjustable in accordance with a dynamic rule. It is numerically shown that the proposed DECM can achieve a better tradeoff between reducing inter-agent communication frequency and preserving an expected formation than some existing ones. Second, an event-triggered formation protocol is delicately proposed by using only locally triggered sampled data in a distributed manner. Based on the formation protocol, it is shown that the state formation control problem is cast into an asymptotic stability problem of a reduced-order closed-loop system. Then, criteria for designing desired formation protocol and communication mechanism are derived. Finally, the effectiveness and advantages of the proposed approach are demonstrated through a comparative study in multirobot formation control.

...read moreread less

662 citations

Proceedings Article•10.1109/ICCV.2017.321•

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

[...]

Abhishek Das¹, Satwik Kottur², Jose M. F. Moura², Stefan Lee³, Dhruv Batra¹ - Show less +1 more•Institutions (3)

Georgia Institute of Technology¹, Carnegie Mellon University², Virginia Tech³

20 Mar 2017

TL;DR: This work poses a cooperative ‘image guessing’ game between two agents who communicate in natural language dialog so that Q-BOT can select an unseen image from a lineup of images and shows the emergence of grounded language and communication among ‘visual’ dialog agents with no human supervision.

...read moreread less

Abstract: We introduce the first goal-driven training for visual question answering and dialog agents. Specifically, we pose a cooperative ‘image guessing’ game between two agents – Q-BOT and A-BOT– who communicate in natural language dialog so that Q-BOT can select an unseen image from a lineup of images. We use deep reinforcement learning (RL) to learn the policies of these agents end-to-end – from pixels to multi-agent multi-round dialog to game reward.,,We demonstrate two experimental results.,,First, as a ‘sanity check’ demonstration of pure RL (from scratch), we show results on a synthetic world, where the agents communicate in ungrounded vocabularies, i.e., symbols with no pre-specified meanings (X, Y, Z). We find that two bots invent their own communication protocol and start using certain symbols to ask/answer about certain visual attributes (shape/color/style). Thus, we demonstrate the emergence of grounded language and communication among ‘visual’ dialog agents with no human supervision.,,Second, we conduct large-scale real-image experiments on the VisDial dataset [5], where we pretrain on dialog data with supervised learning (SL) and show that the RL finetuned agents significantly outperform supervised pretraining. Interestingly, the RL Q-BOT learns to ask questions that A-BOT is good at, ultimately resulting in more informative dialog and a better team.

...read moreread less

509 citations

Journal Article•10.1109/TIE.2016.2542134•

Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method

[...]

Huaguang Zhang¹, He Jiang¹, Yanhong Luo¹, Geyang Xiao¹•Institutions (1)

Northeastern University (China)¹

01 May 2017-IEEE Transactions on Industrial Electronics

TL;DR: A data-based adaptive dynamic programming method is presented using the current and past system data rather than the accurate system models also instead of the traditional identification scheme which would cause the approximation residual errors.

...read moreread less

Abstract: This paper investigates the optimal consensus control problem for discrete-time multi-agent systems with completely unknown dynamics by utilizing a data-driven reinforcement learning method. It is known that the optimal consensus control for multi-agent systems relies on the solution of the coupled Hamilton–Jacobi–Bellman equation, which is generally impossible to be solved analytically. Even worse, most real-world systems are too complicated to obtain accurate mathematical models. To overcome these deficiencies, a data-based adaptive dynamic programming method is presented using the current and past system data rather than the accurate system models also instead of the traditional identification scheme which would cause the approximation residual errors. First, we establish a discounted performance index and formulate the optimal consensus problem via Bellman optimality principle. Then, we introduce the policy iteration algorithm which motivates this paper. To implement the proposed online action-dependent heuristic dynamic programming method, two neural networks (NNs), 1) critic NN and 2) actor NN, are employed to approximate the iterative performance index functions and control policies, respectively, in real time. Finally, two simulation examples are provided to demonstrate the effectiveness of the proposed method.

...read moreread less

401 citations

Journal Article•10.1109/TCYB.2016.2602327•

Output Consensus of Heterogeneous Linear Multi-Agent Systems by Distributed Event-Triggered/Self-Triggered Strategy

[...]

Wenfeng Hu¹, Lu Liu¹, Gang Feng¹•Institutions (1)

City University of Hong Kong¹

01 Aug 2017-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: This paper addresses the output consensus problem of heterogeneous linear multi-agent systems by introducing a fixed timer into both event- and self-triggered control schemes, so that Zeno behavior can be ruled out for each agent.

...read moreread less

Abstract: This paper addresses the output consensus problem of heterogeneous linear multi-agent systems. We first propose a novel distributed event-triggered control scheme. It is shown that, with the proposed control scheme, the output consensus problem can be solved if two matrix equations are satisfied. Then, we further propose a novel self-triggered control scheme, with which continuous monitoring is avoided. By introducing a fixed timer into both event- and self-triggered control schemes, Zeno behavior can be ruled out for each agent. The effectiveness of the event- and self-triggered control schemes is illustrated by an example.

...read moreread less

336 citations

Journal Article•10.1109/TAC.2016.2610945•

A Multi-Agent System With a Proportional-Integral Protocol for Distributed Constrained Optimization

[...]

Shaofu Yang¹, Qingshan Liu², Jun Wang¹•Institutions (2)

City University of Hong Kong¹, Huazhong University of Science and Technology²

01 Jul 2017-IEEE Transactions on Automatic Control

TL;DR: It is proved that all agents with any initial state can reach output consensus at an optimal solution to the given constrained optimization problem, provided that the graph describing the communication links among agents is undirected and connected.

...read moreread less

Abstract: This technical note presents a continuous-time multi-agent system for distributed optimization with an additive objective function composed of individual objective functions subject to bound, equality, and inequality constraints. Each individual objective function is assumed to be convex in the region defined by its local bound constraints only without the need to be globally convex. All agents in the system communicate using a proportional-integral protocol with their output information instead of state information to reduce communication bandwidth. It is proved that all agents with any initial state can reach output consensus at an optimal solution to the given constrained optimization problem, provided that the graph describing the communication links among agents is undirected and connected. It is further proved that the system with only integral protocol is also convergent to the unique optimal solution if each individual objective function is strictly convex. Simulation results are presented to substantiate the theoretical results.

...read moreread less

331 citations

Journal Article•10.1016/J.INS.2015.06.005•

Network-based leader-following consensus of nonlinear multi-agent systems via distributed impulsive control ☆

[...]

Wangli He¹, Guanrong Chen², Qing-Long Han³, Feng Qian¹•Institutions (3)

East China University of Science and Technology¹, City University of Hong Kong², Griffith University³

20 Feb 2017-Information Sciences

TL;DR: This paper investigates the problem of network-based leader-following consensus of nonlinear multi-agent systems via distributed impulsive control by taking network-induced delays into account and derives a general consensus criterion.

...read moreread less

328 citations

Journal Article•10.1109/TCYB.2015.2514119•

Leader-Following Consensus of Nonlinear Multiagent Systems With Stochastic Sampling

[...]

Wangli He¹, Biao Zhang¹, Qing-Long Han², Feng Qian¹, Jürgen Kurths³, Jinde Cao⁴ - Show less +2 more•Institutions (4)

East China University of Science and Technology¹, Griffith University², Potsdam Institute for Climate Impact Research³, Southeast University⁴

01 Feb 2017-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: It is shown that the leader-following consensus problem with stochastic sampling can be transferred into a master-slave synchronization problem with only one master system and two slave systems.

...read moreread less

Abstract: This paper is concerned with sampled-data leader-following consensus of a group of agents with nonlinear characteristic. A distributed consensus protocol with probabilistic sampling in two sampling periods is proposed. First, a general consensus criterion is derived for multiagent systems under a directed graph. A number of results in several special cases without transmittal delays or with the deterministic sampling are obtained. Second, a dimension-reduced condition is obtained for multiagent systems under an undirected graph. It is shown that the leader-following consensus problem with stochastic sampling can be transferred into a master–slave synchronization problem with only one master system and two slave systems. The problem solving is independent of the number of agents, which greatly facilitates its application to large-scale networked agents. Third, the network design issue is further addressed, demonstrating the positive and active roles of the network structure in reaching consensus. Finally, two examples are given to verify the theoretical results.

...read moreread less

309 citations

Journal Article•10.1016/J.ENVSOFT.2016.10.008•

Theoretical foundations of human decision-making in agent-based land use models A review

[...]

Jürgen Groeneveld¹, B. Mller¹, Carsten M. Buchmann², Gunnar Dressler¹, Cheng Guo¹, Niklas Hase¹, Falk Hoffmann¹, F. John¹, Christian Klassert³, T. Lauf¹, Veronika Liebelt¹, Henning Nolzen¹, N. Pannicke¹, Jule Schulze¹, Hanna Weise⁴, Nina Schwarz¹ - Show less +12 more•Institutions (4)

Helmholtz Centre for Environmental Research - UFZ¹, University of Hohenheim², Leipzig University³, Free University of Berlin⁴

01 Jan 2017-Environmental Modelling and Software

TL;DR: It is shown that most human decision sub-models are not explicitly based on a specific theory and if so they are mostly based on economic theories, such as the rational actor, and mainly ignoring other relevant disciplines.

...read moreread less

Abstract: Recent reviews stated that the complex and context-dependent nature of human decision-making resulted in ad-hoc representations of human decision in agent-based land use change models (LUCC ABMs) and that these representations are often not explicitly grounded in theory. However, a systematic survey on the characteristics (e.g. uncertainty, adaptation, learning, interactions and heterogeneities of agents) of representing human decision-making in LUCC ABMs is missing. Therefore, the aim of this study is to inform this debate by reviewing 134 LUCC ABM papers. We show that most human decision sub-models are not explicitly based on a specific theory and if so they are mostly based on economic theories, such as the rational actor, and mainly ignoring other relevant disciplines. Consolidating and enlarging the theoretical basis for modelling human decision-making may be achieved by using a structural framework for modellers, re-using published decision models, learning from other disciplines and fostering collaboration with social scientists. Review on human decisions in agent-based models of land use change.Most human decision models are not based on theory.Used theories are mainly from economics and not from psychology.A framework to guide modellers how to implement human decision is needed.

...read moreread less

304 citations

Journal Article•10.1016/J.RSER.2017.03.107•

Towards the next generation of smart grids: semantic and holonic multi-agent management of distributed energy resources

[...]

Shaun Howell¹, Yacine Rezgui¹, Jean-Laurent Hippolyte¹, Bejay Jayan¹, Haijiang Li¹ - Show less +1 more•Institutions (1)

Cardiff University¹

01 Sep 2017-Renewable & Sustainable Energy Reviews

TL;DR: In this article, the authors argue that a new generation of holonic energy systems is required to orchestrate the interplay between these dense, diverse and distributed energy components, which promotes the systemic features of autonomy, belonging, connectivity, diversity and emergence, and balances global and local system objectives.

...read moreread less

Abstract: The energy landscape is experiencing accelerating change; centralized energy systems are being decarbonized, and transitioning towards distributed energy systems, facilitated by advances in power system management and information and communication technologies. This paper elaborates on these generations of energy systems by critically reviewing relevant authoritative literature. This includes a discussion of modern concepts such as ‘smart grid’, ‘microgrid’, ‘virtual power plant’ and ‘multi-energy system’, and the relationships between them, as well as the trends towards distributed intelligence and interoperability. Each of these emerging urban energy concepts holds merit when applied within a centralized grid paradigm, but very little research applies these approaches within the emerging energy landscape typified by a high penetration of distributed energy resources, prosumers (consumers and producers), interoperability, and big data. Given the ongoing boom in these fields, this will lead to new challenges and opportunities as the status-quo of energy systems changes dramatically. We argue that a new generation of holonic energy systems is required to orchestrate the interplay between these dense, diverse and distributed energy components. The paper therefore contributes a description of holonic energy systems and the implicit research required towards sustainability and resilience in the imminent energy landscape. This promotes the systemic features of autonomy, belonging, connectivity, diversity and emergence, and balances global and local system objectives, through adaptive control topologies and demand responsive energy management. Future research avenues are identified to support this transition regarding interoperability, secure distributed control and a system of systems approach.

...read moreread less

Journal Article•10.1007/S11192-011-0468-9•

Agent-based computing from multi-agent systems to agent-based Models: a visual survey

[...]

Muaz A. Niazi¹, Amir Hussain¹•Institutions (1)

University of Stirling¹

19 Aug 2017-arXiv: Social and Information Networks

TL;DR: In this article, the authors used scientometric analysis to analyze all sub-domains of agent-based computing, including Life Sciences, Ecological Sciences and Social Sciences, using 1,064 journal articles indexed in the ISI web of knowledge published during a twenty-year period: 1990-2010.

...read moreread less

Abstract: Agent-Based Computing is a diverse research domain concerned with the building of intelligent software based on the concept of "agents". In this paper, we use Scientometric analysis to analyze all sub-domains of agent-based computing. Our data consists of 1,064 journal articles indexed in the ISI web of knowledge published during a twenty year period: 1990-2010. These were retrieved using a topic search with various keywords commonly used in sub-domains of agent-based computing. In our proposed approach, we have employed a combination of two applications for analysis, namely Network Workbench and CiteSpace - wherein Network Workbench allowed for the analysis of complex network aspects of the domain, detailed visualization-based analysis of the bibliographic data was performed using CiteSpace. Our results include the identification of the largest cluster based on keywords, the timeline of publication of index terms, the core journals and key subject categories. We also identify the core authors, top countries of origin of the manuscripts along with core research institutes. Finally, our results have interestingly revealed the strong presence of agent-based computing in a number of non-computing related scientific domains including Life Sciences, Ecological Sciences and Social Sciences.

...read moreread less

Journal Article•10.1109/TAC.2016.2535102•

Decentralized Robust Synchronization of Unknown High Order Nonlinear Multi-Agent Systems With Prescribed Transient and Steady State Performance

[...]

Charalampos P. Bechlioulis¹, George A. Rovithakis²•Institutions (2)

National Technical University of Athens¹, Aristotle University of Thessaloniki²

01 Jan 2017-IEEE Transactions on Automatic Control

TL;DR: A robust decentralized control law of minimal complexity is proposed that achieves prescribed, arbitrarily fast and accurate synchronization of the following agents with the leader.

...read moreread less

Abstract: In this paper, we consider the synchronization control problem for uncertain high-order nonlinear multi-agent systems in a leader-follower scheme, under a directed communication protocol. A robust decentralized control law of minimal complexity is proposed that achieves prescribed, arbitrarily fast and accurate synchronization of the following agents with the leader. The control protocol is decentralized in the sense that the control signal of each agent is calculated based solely on local relative state information from its neighborhood set. Additionally, no information regarding the agents' dynamic model is employed in the design procedure. Moreover, provided that the communication graph is connected and contrary to the related works on multi-agent systems, the controller-imposed transient and steady state performance bounds are fully decoupled from: 1) the underlying graph topology, 2) the control gains selection, and 3) the agents' model uncertainties, and are solely prescribed by certain designer-specified performance functions. Extensive simulation results clarify and verify the approach.

...read moreread less

Journal Article•10.1109/TIE.2016.2613929•

Event-Based Set-Membership Leader-Following Consensus of Networked Multi-Agent Systems Subject to Limited Communication Resources and Unknown-But-Bounded Noise

[...]

Xiaohua Ge¹, Qing-Long Han¹, Fuwen Yang²•Institutions (2)

Swinburne University of Technology¹, Griffith University²

01 Jun 2017-IEEE Transactions on Industrial Electronics

TL;DR: A delicate convex optimization algorithm in terms of recursive linear matrix inequalities is proposed to design desired consensus protocol and event-based mechanism for networked multi-agent systems subject to limited communication resources and unknown-but-bounded process and measurement noise.

...read moreread less

Abstract: This paper addresses the problem of leader-following consensus for networked multi-agent systems subject to limited communication resources and unknown-but-bounded process and measurement noise. First, a new distributed event-based communication mechanism on the basis of a time-varying threshold parameter is developed to schedule transmission of each sensor's measurement through a communication network so as to alleviate consecutive occupancy of communication resources. Second, a novel concept of set-membership leader-following consensus is put forward, through which the true states of all followers are guaranteed to always reside in a bounding ellipsoidal set of the leader's state. Third, in the case that full information of followers’ states are not measurable, a distributed observer-based consensus protocol is presented to provide a set-membership estimation of each follower's state. Then, based on a recursive computation of confidence state estimation ellipsoids and leader state ellipsoid, a delicate convex optimization algorithm in terms of recursive linear matrix inequalities is proposed to design desired consensus protocol and event-based mechanism. Finally, an illustrative example is given to show the effectiveness and advantage of the developed approach.

...read moreread less

Journal Article•10.1109/TCYB.2016.2570860•

Consensus of Multiagent Systems Subject to Partially Accessible and Overlapping Markovian Network Topologies

[...]

Xiaohua Ge¹, Qing-Long Han²•Institutions (2)

Griffith University¹, Swinburne University of Technology²

01 Aug 2017-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: This paper addresses the consensus problem for a continuous-time multiagent system (MAS) with Markovian network topologies and external disturbance with a proposed consensus protocol that relies only on group and partial modes and eliminates the need for complete knowledge of global modes.

...read moreread less

Abstract: This paper addresses the consensus problem for a continuous-time multiagent system (MAS) with Markovian network topologies and external disturbance. Different from some existing results, global jumping modes of the Markovian network topologies are not required to be completely available for consensus protocol design. A network topology mode regulator (NTMR) is first developed to decompose unavailable global modes into several overlapping groups, where overlapping groups refer to the scenario that there exist commonly shared local modes between any two distinct groups. The NTMR schedules which group modes each agent may access at every time step. Then a new group mode-dependent distributed consensus protocol on the basis of relative measurement outputs of neighboring agents is delicately constructed. In this sense, the proposed consensus protocol relies only on group and partial modes and eliminates the need for complete knowledge of global modes. Sufficient conditions on the existence of desired distributed consensus protocols are derived to ensure consensus of the MAS with a prescribed $H_{\infty }$ performance level. Two examples are provided to show the effectiveness of the proposed consensus protocol.

...read moreread less

Journal Article•10.1109/TAC.2016.2628204•

Adaptive Consensus of Nonlinear Multi-Agent Systems With Non-Identical Partially Unknown Control Directions and Bounded Modelling Errors

[...]

Ci Chen¹, Changyun Wen², Zhi Liu¹, Kan Xie¹, Yun Zhang¹, C. L. Philip Chen³ - Show less +2 more•Institutions (3)

Guangdong University of Technology¹, Nanyang Technological University², University of Macau³

01 Sep 2017-IEEE Transactions on Automatic Control

TL;DR: This note proposes an adaptive method to relax such a requirement to allow non-identical control directions, under the condition that some control directions are known.

...read moreread less

Abstract: Existing Nussbaum function based results on consensus of multi-agent systems require that the unknown control directions of all the agents should be the same. This note proposes an adaptive method to relax such a requirement to allow non-identical control directions, under the condition that some control directions are known. Technically, a novel idea is proposed to construct a new Nussbaum function, from which a conditional inequality is developed to handle time-varying input gains. Then, the inequality is integrated with adaptive control technique such that the proposed Nussbaum function for each agent is adaptively updated. Moreover, in addition to parametric uncertainties, each agent has non-parametric bounded modelling errors which may include external disturbances and approximation errors of static input nonlinearities. Even in the presence of such uncertainties, the proposed control scheme is still able to ensure the states of all the agents asymptotically reach perfect consensus. Finally, simulation study is performed to show the effectiveness of the proposed approach.

...read moreread less

Journal Article•10.1109/TAC.2016.2614486•

Event-Triggered Mean-Square Consensus Control for Time-Varying Stochastic Multi-Agent System With Sensor Saturations

[...]

Lifeng Ma¹, Zidong Wang², Hak-Keung Lam³•Institutions (3)

Nanjing University of Science and Technology¹, Brunel University London², King's College London³

01 Jul 2017-IEEE Transactions on Automatic Control

TL;DR: The aim of the proposed problem is to design time-varying output-feedback controllers such that, at each time step, the mean-square consensus index of the closed-loop multi-agent system satisfies the pre-specified upper bound constraints subject to certain triggering mechanism.

...read moreread less

Abstract: In this technical note, the consensus control problem is investigated for a class of discrete time-varying stochastic multi-agent system subject to sensor saturations. An event-based mechanism is adopted where each agent updates the control input signal only when the pre-specified triggering condition is violated. To reflect the time-varying manner and characterize the transient consensus behavior, a new index for mean-square consensus is put forward to quantify the deviation level from individual agent to the average value of all agents’ states. For a fixed network topology, the aim of the proposed problem is to design time-varying output-feedback controllers such that, at each time step, the mean-square consensus index of the closed-loop multi-agent system satisfies the pre-specified upper bound constraints subject to certain triggering mechanism. Both the existence conditions and the explicit expression of the desired controllers are established by resorting to the solutions to a set of recursive matrix inequalities. An illustrative simulation example is utilized to demonstrate the usefulness of the proposed algorithms.

...read moreread less

Journal Article•10.1016/J.JFRANKLIN.2016.12.021•

Distributed MPC for formation of multi-agent systems with collision avoidance and obstacle avoidance

[...]

Li Dai¹, Qun Cao¹, Yuanqing Xia¹, Yulong Gao¹•Institutions (1)

Beijing Institute of Technology¹

01 Mar 2017-Journal of The Franklin Institute-engineering and Applied Mathematics

TL;DR: The proposed algorithm for formation of multiple linear second-order agents with collision avoidance and obstacle avoidance with recursive feasibility of the resulting optimization problem is guaranteed and closed-loop stability of the whole system is ensured.

...read moreread less

Abstract: The paper is concerned with the problem of distributed model predictive control (DMPC) for formation of multiple linear second-order agents with collision avoidance and obstacle avoidance. All the agents are permitted to implement optimization simultaneously at each time step. The assumed input trajectory and state trajectory are introduced to obtain a computationally tractable optimization problem in a distributed manner. As a result, a compatibility constraint is required to ensure the consistency between each agent׳s real operation and its plan and to establish the agreement among agents. The terminal ingredients are tailored by making use of the specific form of the system model and the control objective. The terminal set is ensured to be positively invariant with the designed terminal controller. The collision avoidance constraint and the obstacle avoidance constraint are satisfied for any state in the terminal set. The weighted matrix of the terminal cost is determined by solving a Lyapunov equation. Moreover, recursive feasibility of the resulting optimization problem is guaranteed and closed-loop stability of the whole system is ensured. Finally, a numerical example is given to illustrate the effectiveness of the proposed algorithm.

...read moreread less

Journal Article•10.1109/TCYB.2015.2508561•

Cooperative Output Regulation of Heterogeneous Linear Multi-Agent Systems by Event-Triggered Control

[...]

Wenfeng Hu¹, Lu Liu¹•Institutions (1)

City University of Hong Kong¹

01 Jan 2017-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A novel distributed event-triggered control scheme is developed to solve the cooperative output regulation problem of heterogeneous MASs and an internal reference model for each agent is proposed, such that continuous monitoring of measurement errors can be avoided.

...read moreread less

Abstract: In this paper, we consider the cooperative output regulation problem of heterogeneous linear multi-agent systems (MASs) by event-triggered control. We first develop an event-triggering mechanism for leader-following consensus of homogeneous MASs. Then by proposing an internal reference model for each agent, a novel distributed event-triggered control scheme is developed to solve the cooperative output regulation problem of heterogeneous MASs. Furthermore, a novel self-triggered control scheme is also proposed, such that continuous monitoring of measurement errors can be avoided. The feasibility of both proposed control schemes is studied by excluding Zeno behavior for each agent. An example is finally provided to demonstrate the effectiveness of the control schemes.

...read moreread less

Journal Article•10.1016/J.APENERGY.2016.10.056•

Multi-agent systems applied for energy systems integration: State-of-the-art applications and trends in microgrids

[...]

Vitor Nazário Coelho¹, Vitor Nazário Coelho², Miri Weiss Cohen, Igor Machado Coelho³, Nian Liu⁴, Frederico Gadelha Guimarães² - Show less +2 more•Institutions (4)

Federal Fluminense University¹, Universidade Federal de Minas Gerais², Rio de Janeiro State University³, North China Electric Power University⁴

01 Feb 2017-Applied Energy

TL;DR: A review of state-of-the-art applications and trends in multi-agent system and smart microgrids and several combinatorial optimization problems opened to be improved and discussed along the next coming years are presented.

...read moreread less

Journal Article•10.1109/TAC.2017.2680602•

A Differential Game Approach to Multi-agent Collision Avoidance

[...]

Thulasi Mylvaganam¹, Mario Sassano², Alessandro Astolfi¹•Institutions (2)

Imperial College London¹, University of Rome Tor Vergata²

24 Apr 2017-IEEE Transactions on Automatic Control

TL;DR: In this article, a multi-agent system consisting of $N$ agents is considered and the problem of steering each agent from its initial position to a desired goal while avoiding collisions with obstacles and other agents is studied.

...read moreread less

Abstract: A multi-agent system consisting of $N$ agents is considered. The problem of steering each agent from its initial position to a desired goal while avoiding collisions with obstacles and other agents is studied. This problem, referred to as the multi-agent collision avoidance problem , is formulated as a differential game. Dynamic feedback strategies that approximate the feedback Nash equilibrium solutions of the differential game are constructed and it is shown that, provided certain assumptions are satisfied, these guarantee that the agents reach their targets while avoiding collisions.

...read moreread less

Journal Article•10.1109/TNNLS.2016.2609500•

Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games

[...]

Jinna Li¹, Hamidreza Modares², Tianyou Chai³, Frank L. Lewis⁴, Lihua Xie⁵ - Show less +1 more•Institutions (5)

Shenyang University of Chemical Technology¹, Missouri University of Science and Technology², Northeastern University (China)³, University of Texas at Arlington⁴, Nanyang Technological University⁵

17 Apr 2017-IEEE Transactions on Neural Networks

TL;DR: This paper develops an off-policy reinforcement learning (RL) algorithm to solve optimal synchronization of multiagent systems by using the framework of graphical games and shows that the optimal distributed policies found by the proposed algorithm satisfy the global Nash equilibrium and synchronize all agents to the leader.

...read moreread less

Abstract: This paper develops an off-policy reinforcement learning (RL) algorithm to solve optimal synchronization of multiagent systems. This is accomplished by using the framework of graphical games. In contrast to traditional control protocols, which require complete knowledge of agent dynamics, the proposed off-policy RL algorithm is a model-free approach, in that it solves the optimal synchronization problem without knowing any knowledge of the agent dynamics. A prescribed control policy, called behavior policy, is applied to each agent to generate and collect data for learning. An off-policy Bellman equation is derived for each agent to learn the value function for the policy under evaluation, called target policy, and find an improved policy, simultaneously. Actor and critic neural networks along with least-square approach are employed to approximate target control policies and value functions using the data generated by applying prescribed behavior policies. Finally, an off-policy RL algorithm is presented that is implemented in real time and gives the approximate optimal control policy for each agent using only measured data. It is shown that the optimal distributed policies found by the proposed algorithm satisfy the global Nash equilibrium and synchronize all agents to the leader. Simulation results illustrate the effectiveness of the proposed method.

...read moreread less

Journal Article•10.1016/J.SYSCONLE.2017.01.004•

Adaptive finite-time bipartite consensus for second-order multi-agent systems with antagonistic interactions

[...]

Lin Zhao¹, Lin Zhao², Yingmin Jia¹, Jinpeng Yu²•Institutions (2)

Beihang University¹, Qingdao University²

01 Apr 2017-Systems & Control Letters

TL;DR: The proposed algorithm is extended to solve the adaptive finite-time bipartite consensus tracking problem for leader–follower case by designing distributed finite- time estimator.

...read moreread less

Journal Article•10.1109/TAC.2016.2637005•

Distributed Velocity-Constrained Consensus of Discrete-Time Multi-Agent Systems With Nonconvex Constraints, Switching Topologies, and Delays

[...]

Peng Lin¹, Wei Ren², Huijun Gao•Institutions (2)

Central South University¹, University of California, Riverside²

01 Nov 2017-IEEE Transactions on Automatic Control

TL;DR: It is shown that the algorithm is robust to arbitrarily bounded communication delays and arbitrarily switching communication graphs provided that the union of the graphs has directed spanning trees among each certain time interval.

...read moreread less

Abstract: In this technical note, a distributed velocity-constrained consensus problem is studied for discrete-time multi-agent systems, where each agent's velocity is constrained to lie in a nonconvex set. A distributed constrained control algorithm is proposed to enable all agents to converge to a common point using only local information. The gains of the algorithm for all agents need not to be the same or predesigned and can be adjusted by each agent itself based on its own and neighbors' information. It is shown that the algorithm is robust to arbitrarily bounded communication delays and arbitrarily switching communication graphs provided that the union of the graphs has directed spanning trees among each certain time interval. The analysis approach is based on multiple novel model transformations, proper control parameter selections, boundedness analysis of state-dependent stochastic matrices1, exploitation of the convexity of stochastic matrices, and the joint connectivity of the communication graphs. Numerical examples are included to illustrate the theoretical results.

...read moreread less

Journal Article•10.1109/TIE.2017.2668983•

Distributed Multi-Agent System-Based Load Frequency Control for Multi-Area Power System in Smart Grid

[...]

Vijay P. Singh¹, Nand Kishor¹, Paulson Samuel¹•Institutions (1)

Motilal Nehru National Institute of Technology Allahabad¹

14 Feb 2017-IEEE Transactions on Industrial Electronics

TL;DR: The results of this study validate the feasibility of the proposed control, as well as the capability of the MAS for the operation of LFC in SG with changes in CT through a multi-agent system (MAS) technology.

...read moreread less

Abstract: This paper presents an intelligent controller for “load frequency control (LFC)” application in “smart grid (SG)”environment having changes in communication topology (CT) via a multi-agent system (MAS) technology. In this study, network-induced effects, time delay, and change in CT have been addressed to examine the system performance in a closed loop. An event-triggered control method is used to reduce the communication burden in a network. An intelligent controller based on reinforcement learning consists of two levels, estimator agent and controller agent, in each multi-area system. Particle swarm optimization is used to tune the controller parameters. Furthermore, the proposed control strategy and system architecture as MAS for LFC in SG are analyzed in detail, verified for various load conditions and different network configurations. In addition, mean-square error of the power system states with CT is also analyzed. The results of this study validate the feasibility of the proposed control, as well as the capability of the MAS for the operation of LFC in SG with changes in CT.

...read moreread less

Journal Article•10.1109/TSG.2016.2587741•

Multiagent-Based Optimal Microgrid Control Using Fully Distributed Diffusion Strategy

[...]

Ricardo de Azevedo¹, Mehmet H. Cintuglu¹, Tan Ma¹, Osama A. Mohammed¹•Institutions (1)

Florida International University¹

03 Apr 2017-IEEE Transactions on Smart Grid

TL;DR: The proposed algorithm is superior over consensus algorithms in terms of convergence speed and utilizes reduced communication infrastructure compared to centralized controllers, and can be deployed in real-world microgrids and offer superior decision making on optimal microgrid control.

...read moreread less

Abstract: This paper proposes a multiagent-based optimal microgrid control scheme using a fully distributed diffusion strategy. A two-level cooperative optimization multiagent system is adapted for distributed energy resources economic dispatch. The lower level implements an adaptive droop scheme based on online no-load frequency adjustments. The upper level implements distributed communication using diffusion between neighboring agents for optimal microgrid management. The proposed control scheme enables peer-to-peer communication among the agents without the necessity of a centralized controller, and simultaneously performs resource optimization while regulating the system frequency. The results are compared with centralized and consensus-based optimization algorithms. We have concluded that the proposed algorithm is superior over consensus algorithms in terms of convergence speed and utilizes reduced communication infrastructure compared to centralized controllers. Simulation demonstrations were conducted along with experimental results from a hardware-based microgrid using an industrial multiagent framework. The simulation and experimental results show that the proposed method and the agent framework can be deployed in real-world microgrids and offer superior decision making on optimal microgrid control.

...read moreread less

Journal Article•10.1109/TII.2016.2612645•

Multi-Agent Approach for Enhancing Security of Protection Schemes in Cyber-Physical Energy Systems

[...]

Shihanur Rahman¹, Apel Mahmud¹, Aman Maung Than Oo¹, Hemanshu R. Pota²•Institutions (2)

Deakin University¹, University of New South Wales²

01 Apr 2017-IEEE Transactions on Industrial Informatics

TL;DR: The problem of secure communication protocols is introduced and the comparative studies for enhancing the security of the protection systems are highlighted, using a benchmark power system under various fault and cyber attack scenarios.

...read moreread less

Abstract: This paper presents a distributed multiagent scheme to detect and identify cyber threats on the protection systems of power grids. The integration of information and communication technologies into existing power grids builds critical cyber-physical energy systems, in which digital relays are networked cyber-physical components subject to various cyber threats. Cyber attacks on protection systems may mimic real faults, cause component failure, and disable the communication links. Agents utilize both cyber and physical properties to reinforce the detection technique and further distinguish cyber attacks from physical faults. This paper also introduces the problem of secure communication protocols and highlights the comparative studies for enhancing the security of the protection systems. The proposed scheme is validated using a benchmark power system under various fault and cyber attack scenarios.

...read moreread less

Journal Article•10.1109/TAC.2016.2579264•

Linear Quadratic Risk-Sensitive and Robust Mean Field Games

[...]

Jun Moon¹, Tamer Basar²•Institutions (2)

Ulsan National Institute of Science and Technology¹, University of Illinois at Urbana–Champaign²

01 Mar 2017-IEEE Transactions on Automatic Control

TL;DR: It is proved that as the parameter characterizing this robustness becomes infinite, the two Nash equilibria become identical and equivalent to that of the risk-neutral case, as in the one-agent risk-sensitive and robust control theory.

...read moreread less

Abstract: This paper considers two classes of large population stochastic differential games connected to optimal and robust decentralized control of large-scale multiagent systems. The first problem ( P1 ) is one where each agent minimizes an exponentiated cost function, capturing risk-sensitive behavior, whereas in the second problem ( P2 ) each agent minimizes a worst-case risk-neutral cost function, where the “worst case” stems from the presence of an adversary entering each agent’s dynamics characterized by a stochastic differential equation. In both problems, the individual agents are coupled through the mean field term included in each agent’s cost function, which captures the average or mass behavior of the agents. We solve both P1 and P2 via mean field game theory. Specifically, we first solve a generic risk-sensitive optimal control problem and a generic stochastic zero-sum differential game, where the corresponding optimal controllers are applied by each agent to construct the mean field systems of P1 and P2 . We then characterize an approximated mass behavior effect on an individual agent via a fixed-point analysis of the mean field system. For each problem, P1 and P2 , we show that the approximated mass behavior is in fact the best estimate of the actual mass behavior in various senses as the population size, $N$ , goes to infinity. Moreover, we show that for finite $N$ , there exist $\epsilon$ - Nash equilibria for both P1 and P2 , where the corresponding individual Nash strategies are decentralized in terms of local state information and the approximated mass behavior. We also show that $\epsilon$ can be taken to be arbitrarily small when $N$ is sufficiently large. We show that the $\epsilon$ - Nash equilibria of P1 and P2 are partially equivalent in the sense that the individual Nash strategies share identical control laws, but the approximated mass behaviors for P1 and P2 are different, since in P2 , the mass behavior is also affected by the associated worst-case disturbance. Finally, we prove that the Nash equilibria for P1 and P2 both feature robustness, and as the parameter characterizing this robustness becomes infinite, the two Nash equilibria become identical and equivalent to that of the risk-neutral case, as in the one-agent risk-sensitive and robust control theory.

...read moreread less

Proceedings Article•10.1109/RED-UAS.2017.8101648•

Blockchain-based protocol of autonomous business activity for multi-agent systems consisting of UAVs

[...]

Aleksandr Kapitonov¹, Sergey Lonshakov, Aleksandr Krupenkin, Ivan Berman•Institutions (1)

Saint Petersburg State University of Information Technologies, Mechanics and Optics¹

1 Oct 2017

TL;DR: The article shows how to organize a communication system between agents in a peer-to-peer network using the decentralized Ethereum Blochchain technology and smart contracts and the experience of implementation an autonomous economic system with unmanned aerial vehicles (UAV).

...read moreread less

Abstract: This article describes a method of organizing the communication protocol, which allows agents of the multiagent system (MAS) to make decisions about their actions. Plan activities and interact with each other to perform tasks of modern industrial and business processes based on cyber-physical systems. The main attention is paid to those multi-agent systems, where autonomous agents — robots or smart things — participate in business processes among people, and their activities are organized in an unreliable and unknown environment. The article shows, how to organize a communication system between agents in a peer-to-peer network using the decentralized Ethereum Blochchain technology and smart contracts. The architecture of protocol of autonomous business activity, based on this communication method is given. As a result, the experience of implementation an autonomous economic system with unmanned aerial vehicles (UAV) is described.

...read moreread less

Journal Article•10.1109/TFUZZ.2016.2554151•

Adaptive Fuzzy Leader-Following Consensus Control for Stochastic Multiagent Systems with Heterogeneous Nonlinear Dynamics

[...]

Chang-E Ren¹, Long Chen¹, C. L. Philip Chen¹•Institutions (1)

University of Macau¹

01 Feb 2017-IEEE Transactions on Fuzzy Systems

TL;DR: This paper presents a new distributed consensus controller for each follower agent only based on local information that is measured or received from its neighbors and itself, and proves that all the follower agents can keep consensus with the leader, even though only a very small part ofollower agents can measure or receive the state information of the leader.

...read moreread less

Abstract: This paper focuses on the leader-following consensus control problem of multiagent systems in random vibration environment. The Ito stochastic systems with heterogeneous unknown dynamics and external disturbances are established to describe the agents in random vibration environment. The fuzzy logic systems are applied to approximate the unknown nonlinear dynamics, and one adaptive parameter is designed to decay the effect of external disturbances. We present a new distributed consensus controller for each follower agent only based on local information that is measured or received from its neighbors and itself. Under the consensus controller, we prove that all the follower agents can keep consensus with the leader, even though only a very small part of follower agents can measure or receive the state information of the leader. Furthermore, the states of all the follower agents are bounded in probability. Finally, the simulation results are provided to illustrate the effectiveness of the designed algorithm.

...read moreread less

...

Expand