-
作者:Ito, Kaito; Kashima, Kenji
作者单位:Institute of Science Tokyo; Tokyo Institute of Technology; Kyoto University
摘要:We consider an entropy-regularized version of optimal density control of deterministic discrete-time linear systems. Entropy regularization, or a maximum entropy (MaxEnt) method for optimal control, has attracted much attention especially in reinforcement learning due to its many advantages, such as a natural exploration strategy. Despite the merits, high-entropy control policies induced by the regularization introduce probabilistic uncertainty into systems, which severely limits the applicabi...
-
作者:Oliva, Gabriele; Franceschelli, Mauro; Gasparri, Andrea; Scala, Antonio
作者单位:University Campus Bio-Medico - Rome Italy; University of Cagliari; Roma Tre University; Consiglio Nazionale delle Ricerche (CNR); Istituto dei Sistemi Complessi (ISC-CNR)
摘要:In this article, we develop a general open multiagent systems (OMAS) framework over undirected graphs where the agents' interaction is, in general, nonlinear, time-varying, and heterogeneous, in that the agents interact with different pairwise interaction rules for each link, possibly nonlinear, which may change over time. In particular, assuming the agents interact by exchanging flows, which modify their states, our framework guarantees that the sum of the states of agents participating in th...
-
作者:Phogat, Karmvir Singh; Tsumura, Koji
作者单位:University of Tokyo
摘要:In automatic negotiation, an automated agent is trained to negotiate on behalf of a human negotiator. We consider that the domain of negotiations is known to both the agent and its opponents. In this setting, a greedy concession algorithm (GCA) is employed to find an optimal policy for the agent when the agent's belief about the opponent is known. However, the GCA is computationally expensive to certain class of policies. In this article, we propose a reverse GCA that is computationally less e...
-
作者:Corless, Martin J.; Coduti, Leonardo
作者单位:Purdue University System; Purdue University
摘要:In this article, we consider a network of agents running a linear or nonlinear consensus algorithm with the following goal: in addition to requiring that the outputs of all agents converge to the same value, we also require that a specified linear function of the agent states remains constant during the evolution of the consensus algorithm. To achieve this goal requires the construction of a matrix of weighting parameters with specific properties. In this article, we present a noniterative cen...
-
作者:Ferrante, Augusto; Hakimi-Moghaddam, Mojtaba
作者单位:University of Padua; Quchan University of Technology
摘要:In this article, we consider linear systems having real rational transfer matrices that may be improper. We investigate the properties of strictly positive real and weakly strictly positive real systems and their connections. Our first contribution is to establish a necessary and sufficient condition for a rational and possibly improper transfer matrix to be strictly positive real. Besides stability, this condition only involves the properties of the transfer matrix in the imaginary axis and a...
-
作者:Kao, Yonggui; Han, Yueqiao; Zhu, Yanzheng; Shu, Zhan
作者单位:Harbin Institute of Technology; Ocean University of China; Shandong University of Science & Technology; University of Alberta
摘要:In this article, the sliding-mode control (SMC) strategy is outlined for discrete-time singular Markovian jump systems with time-varying delays and time-varying transition probabilities (TPs). To simplify the complexities arising from the time-varying TPs in the Markov chain, the TPs in this study are reasonably considered to be finite piecewise-homogeneous. The variations of TPs are stochastic and governed by a higher level transition probability (HTP) matrix. It is acceptable for both the TP...
-
作者:Jimenez-Pastor, Antonio; Toller, Daniele; Tribastone, Mirco; Tschaikowski, Max; Vandin, Andrea
作者单位:Aalborg University; IMT School for Advanced Studies Lucca; Technical University of Denmark
摘要:Positive systems naturally arise in situations where the model tracks physical quantities. Although the linear case is well understood, analysis and controller design for nonlinear positive systems remain challenging. Model reduction methods can help tame this problem. Here, we propose a notion of model reduction for a class of positive bilinear systems with (bounded) matrix and exogenous controls. Our reduction, called proper positive lumping, aggregates the original system such that states o...
-
作者:Wang, Zheming; Berger, Guillaume O.; Jungers, Raphael M.
作者单位:Zhejiang University of Technology; Universite Catholique Louvain
摘要:We tackle uniform state feedback control of switched linear systems under arbitrary switching using scenario optimization. We propose a data-driven control framework, in which scenario programs are formulated to compute stabilizing state feedback control relying on a finite set of observations of trajectories with quadratic and sum of squares (SOS) Lyapunov functions. We do not require the exact dynamical model or the switching signal, and as a consequence, we aim at solving uniform stabilizat...
-
作者:Aforozi, Thomais A.; Rovithakis, George A.
作者单位:Aristotle University of Thessaloniki
摘要:In this article we consider designing tracking controllers for MIMO, pure-feedback systems, having unknown and partially nonconstant control directions. The proposed control scheme is static, continuous, and requires no hard calculations, analytic or numerical, to produce the control signal. Significant attribute is the enforcement of prescribed performance bounds in terms of steady-state accuracy and convergence rate. No prior knowledge regarding system nonlinearities is required and no high-...
-
作者:Arditti, Laura; Como, Giacomo; Fagnani, Fabio; Vanelli, Martina
作者单位:Polytechnic University of Turin; Lund University
摘要:We study dynamics in a network of interacting agents updating their binary states according to a time-varying threshold rule. Specifically, agents revise their state asynchronously by comparing the weighted average of the current states of their neighbors in the interaction network with possibly heterogeneous time-varying threshold values. Such thresholds are determined by an exogenous signal representing an external influence field, modeling the different agents' biases toward one state with ...