-
作者:Jing, Gangshan; Bai, He; George, Jemin; Chakrabortty, Aranya; Sharma, Piyush K.
作者单位:Chongqing University; Oklahoma State University System; Oklahoma State University - Stillwater; United States Department of Defense; US Army Research, Development & Engineering Command (RDECOM); US Army Research Laboratory (ARL); North Carolina State University
摘要:Achieving distributed reinforcement learning (RL) for large-scale cooperative multiagent systems (MASs) is challenging because: 1) each agent has access to only limited information and 2) issues on scalability and sample efficiency emerge due to the curse of dimensionality. In this article, we propose a general distributed framework for sample efficient cooperative multiagent reinforcement learning (MARL) by utilizing the structures of graphs involved in this problem. We introduce three coupli...
-
作者:Yu, Xin; Lin, Wei
作者单位:Jiangsu Normal University; University System of Ohio; Case Western Reserve University
摘要:For stochastic nonlinear systems which are only continuous but not necessarily local Lipschitz nor linear growth, we study the problem of asymptotic stabilization in mean square (AS-in-MS) via sampled-data feedback. We begin by establishing the existence of solutions for a class of hybrid stochastic systems. With the aid of weighted homogeneity, we then prove that for stochastic homogeneous systems of degree zero, asymptotic stabilizability in mean square by homogeneous feedback implies asympt...
-
作者:Rickenbach, Rahel; Kohler, Johannes; Scampicchio, Anna; Zeilinger, Melanie N.; Carron, Andrea
作者单位:Swiss Federal Institutes of Technology Domain; ETH Zurich
摘要:The problem of coverage control, i.e., of coordinating multiple agents to optimally cover an area, arises in various applications. However, coverage applications face two major challenges: 1) dealing with nonlinear dynamics while respecting system and safety critical constraints and 2) performing the task in an initially unknown environment. We solve the coverage problem by using a hierarchical framework, in which references are calculated at a central server and passed to the agents' local mo...
-
作者:Zhang, Weihai; Zhong, Shiyu; Jiang, Xiushan
作者单位:Shandong University of Science & Technology; China University of Petroleum
摘要:This article mainly investigates the stochastic finite-time annular domain stability (SFTADS) and asynchronous H-infinity control for nonlinear stochastic switching Markov jump systems (SMJSs). First, the criterion of SFTADS of the system is given by the mode-dependent average dwell time method, and two results, which consider particular cases with no switching signal and no Markov jump, are obtained. Second, when there are asynchronous phenomena in both deterministic switching and Markov jump...
-
作者:Fisher, Michael W.; Hug, Gabriela; Dorfler, Florian
作者单位:University of Waterloo; Swiss Federal Institutes of Technology Domain; ETH Zurich
摘要:Optimal linear feedback control design is a valuable but challenging problem due to the nonconvexity of the underlying optimization and the infinite dimensionality of the Hardy space of stabilizing controllers. A powerful class of techniques for solving optimal control problems involves using reparameterization to transform the control design into a convex but infinite-dimensional optimization. To make the problem tractable, historical work focuses on Galerkin-type finite-dimensional approxima...
-
作者:Li, Zhengcai; Shi, Yang; Xu, Shengyuan; Xu, Huiling; Dong, Lewei
作者单位:Nanjing University of Science & Technology; University of Victoria; Nanjing University of Science & Technology
摘要:This article provides a secure distributed output feedback model predictive control (DOFMPC) solution for the leader-following consensus problems of homogeneous linear disturbed multi-agent systems against multiple cyber attacks. The false data injection (FDI) attacks on the sensor-controller communication channel and denial-of-service (DoS) attacks on the controller-actuator communication channel occur simultaneously. To defend against dual-channel multiple attacks, we propose a secure DOFMPC...
-
作者:Wu, Jenq-Lang
作者单位:National Taiwan Ocean University
摘要:On the basis of the barrier method, a unified approach for the static output feedback sliding mode control of linear control systems with matched uncertainty is addressed. Without coordinate transformations, a structured static output feedback sliding surface matrix is obtained by solving a constrained minimization problem, and then, a globally stabilizing static output feedback sliding mode controller can be constructed directly. The Lagrange multiplier method is used to derive the necessary ...
-
作者:He, Zhiyu; Bolognani, Saverio; He, Jianping; Dorfler, Florian; Guan, Xinping
作者单位:Swiss Federal Institutes of Technology Domain; ETH Zurich; Shanghai Jiao Tong University
摘要:Feedback optimization is a control paradigm that enables physical systems to autonomously reach efficient operating points. Its central idea is to interconnect optimization iterations in a closed loop with the physical plant. Since iterative gradient-based methods are extensively used to achieve optimality, feedback optimization controllers typically require knowledge of the steady-state sensitivity of the plant, which may not be easily accessible in some applications. In contrast, in this art...
-
作者:Kharitenko, Andrey; Scherer, Carsten W.
作者单位:University of Stuttgart; Eindhoven University of Technology; University of Stuttgart
摘要:In this note it is shown that the famous multiplier absolute stability test of R. O'Shea, G. Zames, and P. Falb is necessary and sufficient if the set of Lur'e interconnections is lifted to a Kronecker structure. An explicit method to construct the destabilizing static nonlinearity is presented.
-
作者:Haddad, Jack; Mirkin, Boris
作者单位:Technion Israel Institute of Technology
摘要:In the context of an event-based control paradigm, when the controller-to-plant channels are subject to event-driven time sampling, a new scheme of feedback model reference adaptive control is developed. The developed scheme can cope with multiple distinct input and state time delays in the dynamics description and mitigate the effect of uncertainties due to both the traditional and network-induced phenomena. The underlying idea is based on a suitable equivalent reformulation of the plant mode...