-
作者:Zheng, Xin; Guo, Lei
作者单位:Chinese Academy of Sciences; Chinese Academy of Sciences; University of Chinese Academy of Sciences, CAS
摘要:It is well known that saturated output observations are prevalent in various practical systems and that the & ell;(1)-norm is more robust than the & ell;(2)-norm-based parameter estimation. Unfortunately, adaptive identification based on both saturated observations and the & ell;(1)-optimization turns out to be a challenging nonlinear problem, and has rarely been explored in the literature. Motivated by this and the need to fit with the & ell;(1)-based index of prediction accuracy in, e.g., ju...
-
作者:Jacquet, Quentin; van Ackooij, Wim; Alasseur, Clemence; Gaubert, Stephane
作者单位:Electricite de France (EDF); Institut Polytechnique de Paris; Ecole Polytechnique; Inria; Centre National de la Recherche Scientifique (CNRS)
摘要:We consider a control problem for a heterogeneous population composed of agents able to switch at any time between different options. The controller aims to maximize an average gain per time unit, supposing that the population is of infinite size. This leads to an ergodic control problem for a mean-field Markov decision process in which the state space is a product of simplices, and the population evolves according to controlled linear dynamics. By exploiting contraction properties of the dyna...
-
作者:Zhang, Xu; Vasconcelos, Marcos M.
作者单位:Xidian University; Xidian University; State University System of Florida; Florida A&M University; Florida State University
摘要:Collecting the most informative data from a large dataset distributed over a network is a fundamental problem in many fields, including control, signal processing, and machine learning. In this article, we establish a connection between selecting the most informative data and finding the top-k elements of a multiset. The top-k selection in a network can be formulated as a distributed nonsmooth convex optimization problem known as quantile estimation. Unfortunately, the lack of smoothness in th...
-
作者:Zhao, Feiran; Sha, Xingyu; You, Keyou
作者单位:Tsinghua University; Tsinghua University
摘要:Learning policies in an asynchronous parallel way is essential to numerous successes of reinforcement learning for solving complex problems. However, their convergence has not been rigorously evaluated. To improve the theoretical understanding, we adopt the asynchronous parallel zero-order policy gradient (AZOPG) method to solve the continuous-time linear quadratic regulation problem. Specifically, multiple workers independently perform system rollouts to estimate zero-order policy gradients (...
-
作者:Behrendt, Gabriel; Longmire, Matthew; Bell, Zachary I.; Hale, Matthew
作者单位:University System of Georgia; Georgia Institute of Technology; State University System of Florida; University of Florida
摘要:In this article, we present an algorithm that drives the outputs of a network of agents to jointly track the solution of a time-varying, strongly convex optimization problem. This algorithm is robust to asynchrony in the agents' operations, namely, first, computations of control inputs, second, linear measurements of network outputs, and third, communications of agents' inputs and outputs. We first show that our distributed asynchronous algorithm converges to the solution of a time-invariant f...
-
作者:Arbelaiz, Juncal; Bamieh, Bassam; Hosoi, Anette E.; Jadbabaie, Ali
作者单位:Massachusetts Institute of Technology (MIT); Princeton University; University of California System; University of California Santa Barbara; Massachusetts Institute of Technology (MIT); Massachusetts Institute of Technology (MIT); Massachusetts Institute of Technology (MIT)
摘要:In this article, we consider the centralized optimal estimation problem in spatially distributed systems. We use the setting of spatially invariant systems as an idealization for which concrete and detailed results are given. Such estimators are known to have a degree of spatial localization in the sense that the estimator gains decay in space, with the spatial decay rates serving as a proxy for how far measurements need to be shared in an optimal distributed estimator. In particular, we exami...
-
作者:Li, Shuai; Wang, Chen; Sun, Jinan; Zhang, Shikun; Xie, Guangming
作者单位:Peking University; Peking University; Peking University
摘要:In the domain of multiplayer pursuit-evasion games, it is crucial to address the practical aspects of the players' heterogeneity, the distributed control manner, and the pursuers' goal of minimum makespan. However, the three topics have received limited attention in existing literature, both separately and in combination. In this article, we address the multiplayer pursuit-evasion game integrating these key topics, where the pursuers with simple motions strive to capture as many evaders, chara...
-
作者:Liu, Fengjiao; Rapakoulias, George; Tsiotras, Panagiotis
作者单位:State University System of Florida; Florida A&M University; Florida State University; University System of Georgia; Georgia Institute of Technology
摘要:In this article, we study the optimal control problem for steering the state covariance of a discrete-time linear stochastic system over a finite time horizon. First, we establish the existence and uniqueness of the optimal control law for a quadratic cost function. Then, we show the separation of the optimal mean and the covariance steering problems. We also develop efficient computational methods to solve for the optimal control law, which is identified as the solution to a semidefinite prog...
-
作者:Simard, Joel D.; Nielsen, Christopher; Miller, Daniel E.
作者单位:Imperial College London; University of Waterloo
摘要:Linear parameter-varying (LPV) systems, which have dynamics that vary according to a scheduling parameter, are capable of representing a wide variety of nonlinear and time-varying dynamics. The LPV paradigm preserves well-understood linear design methods, although the stability analysis of these systems has remained difficult. In a recent paper, it is shown that under some stringent conditions, a linear continuous-time gain-scheduled output feedback controller can be designed to provide closed...
-
作者:Guan, Tao; Li, Bin; Song, Yongduan; Duan, Guang-Ren
作者单位:Sichuan University; Chongqing University; Harbin Institute of Technology
摘要:This note studies the problem of rapid attitude control with high accuracy for rigid spacecraft undergoing unwinding-free performance. By designing a novel nonsingular sliding function, fixed-time convergence and unwinding-free performance are rigorously established on the sliding surface. In addition, by introducing a novel potential function to design the controller, unwinding-free performance is also ensured outside of the sliding surface. Furthermore, the chattering problem is tackled by p...