-
作者:Zehfroosh, Ashkan; Tanner, Herbert G.
作者单位:University of Delaware
摘要:This article presents a theoretical framework for probably approximately correct (PAC) multi-agent reinforcement learning (MARL) algorithms for Markov games. Using the idea of delayed Q-learning, this article extends the well-known Nash Q-learning algorithm to build a new PAC MARL algorithm for general-sum Markov games. In addition to guiding the design of a provably PAC MARL algorithm, the framework enables checking whether an arbitrary MARL algorithm is PAC. Comparative numerical results dem...
-
作者:Chen, Ci; Xie, Lihua; Jiang, Yi; Xie, Kan; Xie, Shengli
作者单位:Guangdong University of Technology; Nanyang Technological University; City University of Hong Kong; Guangdong University of Technology
摘要:In this article, we investigate the optimal output tracking problem for linear discrete-time systems with unknown dynamics using reinforcement learning (RL) and robust output regulation theory. This output tracking problem only allows to utilize the outputs of the reference system and the controlled system, rather than their states, and differs from most existing works that depend on the state of the system. The optimal tracking problem is formulated into a linear quadratic regulation problem ...
-
作者:Huang, Xiucai; Song, Yongduan
作者单位:Chongqing University
摘要:In this article, we investigate the distributed output tracking problem for networked uncertain nonlinear multi-inputs-multi-outputs (MIMO) strict-feedback systems with intermittent actuator faults under a directed protocol. By embedding some user-designed performance functions into a backstepping-like design procedure, a distributed robust control scheme is developed that exhibits several salient features: 1) relaxing the system controllability conditions by inserting some differentiable comp...
-
作者:Li, Pengfei; Kang, Yu; Wang, Tao; Zhao, Yun-Bo
作者单位:Chinese Academy of Sciences; University of Science & Technology of China, CAS
摘要:A disturbance prediction-based adaptive event-triggered model predictive control scheme is proposed for nonlinear systems in the presence of slowly varying disturbance. The optimal control problem in the model predictive control scheme is formulated by taking advantage of a proposed central path-based disturbance prediction approach, and the event-triggered mechanism is designed to be adaptive to the triggering interval. As a result, the proposed scheme improves the state prediction precision ...
-
作者:Wakaiki, Masashi
作者单位:Kobe University
摘要:We study the self-triggered stabilization of discrete-time linear systems with quantized state measurements. In the networked control system we consider, sensors may be spatially distributed and be connected to a self-triggering mechanism through finite data-rate channels. Each sensor independently encodes its measurements and sends them to the self-triggering mechanism. The self-triggering mechanism integrates quantized measurement data and then computes sampling times. Assuming that the clos...
-
作者:Kao, Yonggui; Ma, Suriguga; Xia, Hongwei; Wang, Changhong; Liu, Yunlong
作者单位:Harbin Institute of Technology; Inner Mongolia Normal University; Qufu Normal University; Harbin Institute of Technology
摘要:This article discusses the integral sliding mode control problem for a kind of periodically impulsive uncertain reaction-diffusion systems (IURDSs). A novel integral sliding surface containing impulsive effects and reaction-diffusion terms is constructed, such that the impulsive effects for IURDSs can be removed. A novel sliding mode controller with impulsive effects is designed to ensure the reachability of the specified sliding surface in a finite time interval. By means of linear matrix ine...
-
作者:Xie, Siyu; Wang, Le Yi
作者单位:Wayne State University; University of Electronic Science & Technology of China
摘要:Real-time optimization in cyber-physical network systems with unknown system parameters must integrate optimization and parameter estimation, leading to adaptive optimization problems. Such problems encounter fundamental conflict between optimization and system identifiability. Recently, a new method of employing a stochastic or periodic dither has been introduced to resolve this conflict and achieve convergence toward optimal solutions. However, adding a dither introduces persistent disturban...
-
作者:Zou, Suli; Lygeros, John
作者单位:Beijing Institute of Technology; Swiss Federal Institutes of Technology Domain; ETH Zurich
摘要:In this article, we address the problem of stochastic generalized Nash equilibrium (SGNE) seeking, where a group of noncooperative heterogeneous players aim at minimizing their expected cost under some unknown stochastic effects. Each player's strategy is constrained to a convex and compact set and should satisfy some global affine constraints. In order to decouple players' strategies under the global constraints, an extra player is introduced aiming at minimizing the violation of the coupling...
-
作者:Chacon, Juan; Chen, Mo; Fetecau, Razvan C. C.
作者单位:Simon Fraser University; Simon Fraser University
摘要:Autonomous coverage of a specified area by robots operating in close proximity with each other has many potential applications such as real-time monitoring of rapidly changing environments, and search and rescue; however, coordination and safety are two fundamental challenges. For coordination, we propose a distributed controller for covering moving, compact domains which consists in a double integrator with bounded input forces. This control policy is based on artificial potentials and alignm...
-
作者:Rong, Lina; Jiang, Guo-Ping; Xu, Shengyuan
作者单位:Nanjing University of Posts & Telecommunications; Nanjing University of Science & Technology
摘要:This article studies the distributed nonrecursive averaging filter design for the quantized consensus of discrete-time first-order multiagent systems over undirected and connected networks. The quantized consensus problem is cast into the robust consensus problem with sector bound uncertainties associated with the edges of communication graphs, and then an edge sensitivity design approach is utilized for the filter parameter design. Necessary and sufficient conditions for the filter parameter ...