-
作者:Roy, Arghyadip; Shakkottai, Sanjay; Srikant, R.
作者单位:Indian Institute of Technology System (IIT System); Indian Institute of Technology (IIT) - Guwahati; University of Texas System; University of Texas Austin; University of Illinois System; University of Illinois Urbana-Champaign; University of Illinois System; University of Illinois Urbana-Champaign
摘要:In the regret-based formulation of multi-armed Bandit (MAB) problems, except in rare instances, much of the literature focuses on arms with independent and identically distributed (i.i.d.) rewards. In this article, we consider the problem of obtaining regret guarantees for MAB problems, in which the rewards of each arm form a Markov chain that may not belong to a single parameter exponential family. To achieve a logarithmic regret in such problems is not difficult: a variation of standard Kull...
-
作者:Sebastian, Eduardo; Montijano, Eduardo; Sagues, Carlos
作者单位:University of Zaragoza
摘要:This article presents ECO-DKF, the first Event-Triggered and Certifiable Optimal Distributed Kalman Filter. Our algorithm addresses two major issues inherent to distributed Kalman filters: fully distributed and scalable optimal estimation and reduction of the communication bandwidth usage. The first requires to solve an NP-hard optimization problem, forcing relaxations that lose optimality guarantees over the original problem. Using only information from one-hop neighbors, we propose a tight s...
-
作者:Zhu, Yanzheng; Xu, Nuo; Basin, Michael V.; Zhou, Donghua; Chen, Xinkai
作者单位:Shandong University of Science & Technology; Huaqiao University; Ningbo University of Technology; Universidad Autonoma de Nuevo Leon; Shibaura Institute of Technology
摘要:This technical note studies both the stability and tracking recovery problems for a class of continuous-time Markov jump piecewise-affine systems against sensor faults. A novel reconfigurable control design approach is proposed to recover the mean-square input-to-state stability of the closed-loop system and the tracking property of constant reference inputs, the key idea of this approach is to insert a reconfiguration block including a separate virtual sensor between the faulty system and the...
-
作者:Elamvazhuthi, Karthik; Berman, Spring
作者单位:University of California System; University of California Riverside; Arizona State University; Arizona State University-Tempe
摘要:In this article, we consider the problem of stabilizing a class of degenerate stochastic processes, which are constrained to a bounded Euclidean domain or a compact smooth manifold, to a given target probability density. This stabilization problem arises in the field of swarm robotics, for example, in applications where a swarm of robots is required to cover an area according to a target probability density. Most existing works on modeling and control of robotic swarms that use partial differe...
-
作者:Ma, Tianfu; Xu, Juanjuan; Wang, Wei; Zhang, Huanshui
作者单位:Shandong University; Shandong University of Science & Technology
摘要:In this article, we consider the discrete-time rational expectations model with state delays. In particular, the dynamic equation relies both on the history of states and the conditional expectation of the future states. The novelty of this article lies in transforming the solvability of the rational expectations model to that of the forward and backward stochastic difference equations (FBSDEs). The main contribution is to obtain the (unique) solvability and the explicit solutions for the FBSD...
-
作者:Wu, Shizhen; Liang, Xiao; Fang, Yongchun; He, Wei
作者单位:Nankai University; Nankai University
摘要:The geometric maneuvering problem for underactuated vertical takeoff and landing vehicles is considered in this article. First, driven by the demands in this study, the input-to-state stability (ISS) theory is extended to the closed set case with input restriction considered. Then, the inner-outer loop hierarchical paradigm, originally proposed for tracking, is modified for the maneuvering problem, based on which the maneuvering problem is formulated as a closed set stabilization problem. Then...
-
作者:Sun, Libei; Huang, Xiucai; Song, Yongduan
作者单位:Chongqing University
摘要:This note investigates the decentralized stabilization problem for a class of interconnected systems in the presence of non-triangular structural uncertainties and time-varying parameters, where each subsystem exchanges information only with its neighbors and only intermittent (rather than continuous) states and input are to be utilized. Thus far to the best of authors' knowledge, no solution exists priori to this work, despite its high prevalence in practice. Two globally decentralized adapti...
-
作者:Tao, Tian; Roy, Spandan; De Schutter, Bart; Baldi, Simone
作者单位:Delft University of Technology; International Institute of Information Technology Hyderabad; Southeast University - China
摘要:In this article, we propose a new practical synchronization protocol for multiple Euler-Lagrange systems without structural linear-in-the-parameters (LIP) knowledge of the uncertainty and where the agents can be interconnected before control design by unknown state-dependent interconnection terms. This setting is meant to overcome two standard a priori assumptions in the literature concerning uncertainty with LIP structure and absence of interaction among agents before designing the synchroniz...
-
作者:Zhang, Pengfei; Zhang, Yiqun
摘要:In this article, we aim to study the analytical solution of two identical weak pursuers and one strong evader closed-loop game with a one-/two-step Stackelberg approach. Toward this, the basic idea is presented, that is, the solutions of the closed-loop game and an n-step Stackelberg game might be identical. In particular, we develop the optimal one-/two-step Stackelberg strategy for the evader, as well as the optimal state feedback/closed-loop strategy for the pursuers. On this basis, we proc...
-
作者:Hossain, Md. Sumon; Trenn, Stephan
作者单位:University of Groningen
摘要:We propose a novel model-reduction approach for switched linear systems with a known switching signal. The class of considered systems encompasses switched systems with mode-dependent state dimension as well as impulsive systems. Our method is based on a suitable definition of (time-varying) reachability and observability Gramians, and we show that these Gramians satisfy precise interpretations in terms of input and output energy. Based on balancing the midpoint Gramians, we propose a piecewis...