-
作者:Chawla, Ronshee; Sankararaman, Abishek; Shakkottai, Sanjay
作者单位:University of Texas System; University of Texas Austin; University of California System; University of California Berkeley; University of Texas System; University of Texas Austin
摘要:We study a multiagent stochastic linear bandit with side information, parameterized by an unknown vector 0(*) ? R-d. The side information consists of a finite collection of low-dimensional subspaces, one of which contains 0(*). In our setting, agents can collaborate to reduce regret by sending recommendations across a communication graph connecting them. We present a novel decentralized algorithm, where agents communicate subspace indices with each other and each agent plays a projected varian...
-
作者:Furieri, Luca; Guo, Baiwei; Martin, Andrea; Ferrari-Trecate, Giancarlo
作者单位:Swiss Federal Institutes of Technology Domain; Ecole Polytechnique Federale de Lausanne
摘要:As we transition toward the deployment of data-driven controllers for black-box cyberphysical systems, complying with hard safety constraints becomes a primary concern. Two key aspects should be addressed when input-output data are corrupted by noise: how much uncertainty can one tolerate without compromising safety, and to what extent is the control performance affected? By focusing on finite-horizon constrained linear- quadratic problems, we provide an answer to these questions in terms of t...
-
作者:Massiani, Pierre-Francois; Heim, Steve; Solowjow, Friedrich; Trimpe, Sebastian
作者单位:RWTH Aachen University; Max Planck Society; Massachusetts Institute of Technology (MIT)
摘要:Safety constraints and optimality are important but sometimes conflicting criteria for controllers. Although these criteria are often solved separately with different tools to maintain formal guarantees, it is also common practice in reinforcement learning (RL) to simply modify reward functions by penalizing failures, with the penalty treated as a mere heuristic. We rigorously examine the relationship of both safety and optimality to penalties, and formalize sufficient conditions for safe valu...
-
作者:Cheng, Daizhan; Zhang, Lijun; Bi, Dongyao
作者单位:Liaocheng University; Chinese Academy of Sciences; Northwestern Polytechnical University; Northwestern Polytechnical University
摘要:A logical function can be used to characterize a property of states of a Boolean network (BN), which is considered as an aggregation of states. The dynamics of a set of logical functions are called the dual dynamics of the set. To illustrate the dual dynamics of a given set, which characterizes our concerned properties of a BN, the invariant subspace containing the set of logical functions is proposed, and its properties are investigated. Then, the invariant subspace of Boolean control network...
-
作者:Su, Housheng; Wang, Xiaotian; Gao, Zhiwei
作者单位:Huazhong University of Science & Technology; Northumbria University
摘要:This article studies interval coordination problems for multiagent systems with antagonistic interactions. For strongly connected signed networks, it is shown that when the intersection of intervals imposed by agents is nonempty: the multiagent system achieves bipartite consensus with structurally balanced network; all agents' states must converge to 0, if the signed network is structurally unbalanced. We establish the consensus conditions for bipartite consensus and zero-value consensus by em...
-
作者:Yang, Nachuan; Tang, Jiawei; Wong, Yik Ben; Li, Yuzhe; Shi, Ling
作者单位:Hong Kong University of Science & Technology; Northeastern University - China
摘要:This technical article investigates the linear quadratic regulator (LQR) design for continuous-time positive linear systems. Based on positive systems theory and Lyapunov theory, the solvability and optimality of the positivity-preserving LQR problem are analyzed through the lens of optimization, and two projection theorems are derived for single-input and multi-input positive systems, respectively, which paves the way for developing a projected gradient descent algorithm. The proposed results...
-
作者:Barrau, Axel; Bonnabel, Silvere
作者单位:Safran S.A.; Universite PSL; MINES ParisTech
摘要:While many works exploiting an existing Lie group structure have been proposed for state estimation, in particular the invariant extended Kalman filter (IEKF), few papers address the construction of a group structure that allows casting a given system into the framework of invariant filtering. In this article, we introduce a large class of systems encompassing most problems involving a navigating vehicle encountered in practice. For those systems we introduce a novel methodology that systemati...
-
作者:Jin, Long; Wei, Lin; Li, Shuai
作者单位:Lanzhou University
摘要:In this technical article, to seek the optimal solution to time-dependent nonlinear optimization subject to linear inequality and equality constraints (TDNO-IEC), the gradient-based differential neural-solution, termed as GDN model, is proposed and researched. Notably, TDNO-IEC is first converted into the nonhomogeneous linear equation with the dynamic parameter. Second, differential neural-solution with the aid of gradient is designed. The contrastive theoretical analyses among the GDN model,...
-
作者:Bouvier, Jean-Baptiste; Xu, Kathleen; Ornik, Melkior
作者单位:University of Illinois System; University of Illinois Urbana-Champaign; Massachusetts Institute of Technology (MIT)
摘要:When failure is not an option, systems are designed to be resistant to various malfunctions, such as a loss of control authority over actuators. This malfunction consists in some actuators producing uncontrolled and, thus, possibly undesirable inputs with their full actuation range. After such a malfunction, a system is deemed resilient if its target is still reachable despite these undesirable inputs. However, the malfunctioning system might be significantly slower to reach its target compare...
-
作者:Mathias, Joel; Meyn, Sean; Moye, Robert; Warrington, Joseph
作者单位:Arizona State University; Arizona State University-Tempe; State University System of Florida; University of Florida; AstraZeneca
摘要:It is now well established that many electric loads are inherently flexible, and this flexibility can be harnessed to provide grid services identical to those obtained today through batteries and responsive generators. This article concerns the resource allocation problem associated with control of a large collection of heterogeneous loads. This problem is posed as a finite-horizon optimal control problem, in which the cost function reflects both the needs of the grid and the needs of the user...