-
作者:Rockafellar, R. Tyrrell; Uryasev, Stan; Zabarankin, Michael
作者单位:University of Washington; University of Washington Seattle; State University System of Florida; University of Florida; Stevens Institute of Technology
摘要:A framework is set up in which linear regression, as a way of approximating a random variable by other random variables, can be carried out in a variety of ways, which, moreover, can be tuned to the needs of a particular model in finance, or operations research more broadly. Although the idea of adapting the form of regression to the circumstances at hand has already found advocates in promoting quantile regression as an alternative to classical least-squares approaches, it is carried here muc...
-
作者:Levi, Retsef; Lodi, Andrea; Sviridenko, Maxim
作者单位:Massachusetts Institute of Technology (MIT); University of Bologna; International Business Machines (IBM); IBM USA
摘要:We study the classical capacitated multi-item lot-sizing problem with hard capacities. There are N items, each of which has a specified sequence of demands over a finite planning horizon of T discrete periods; the demands are known in advance but can vary from period to period. All demands must be satisfied on time. Each order incurs a time-dependent fixed ordering cost regardless of the combination of items or the number of units ordered, but the total number of units ordered cannot exceed a ...
-
作者:So, Anthony Man-Cho; Ye, Yinyu; Zhang, Jiawei
作者单位:Chinese University of Hong Kong; Stanford University; New York University
摘要:We consider the problem of finding a low-rank approximate solution to a system of linear equations in symmetric, positive semidefinite matrices, where the approximation quality of a solution is measured by its maximum relative deviation, both above and below, from the prescribed quantities. We show that a simple randomized polynomial-time procedure produces a low-rank solution that has provably good approximation qualities. Our result provides a unified treatment of and generalizes several wel...
-
作者:Vladimirsky, Alexander
作者单位:Cornell University
摘要:Stochastic shortest path (SSP) problems arise in a variety of discrete stochastic control contexts. An optimal solution to such a problem is typically computed using the value function, which can be found by solving the corresponding dynamic programming equations. In the deterministic case, these equations can be often solved by highly efficient label-setting methods (such as Dijkstra's and Dial's algorithms). In this paper we de. ne and study a class of multimode stochastic shortest path (MSS...
-
作者:Lugosi, Gabor; Mannor, Shie; Stoltz, Gilles
作者单位:ICREA; Pompeu Fabra University; Pompeu Fabra University; McGill University; Universite PSL; Ecole Normale Superieure (ENS); Centre National de la Recherche Scientifique (CNRS); Centre National de la Recherche Scientifique (CNRS); Hautes Etudes Commerciales (HEC) Paris
摘要:We propose simple randomized strategies for sequential decision ( or prediction) under imperfect monitoring, that is, when the decision maker ( forecaster) does not have access to the past outcomes but rather to a feedback signal. The proposed strategies are consistent in the sense that they achieve, asymptotically, the best-possible average reward among all fixed actions. It was Rustichini [Rustichini, A. 1999. Minimizing regret: The general case. Games Econom. Behav. 29 224-243] who first pr...
-
作者:Helmes, Kurt; Roehl, Stefan
作者单位:Humboldt University of Berlin
摘要:We present a formula for the corner points of the multidimensional Hausdorff polytopes and show how this result can be used to improve linear programming models for computing, e.g., moments of exit time distributions of diffusion processes. Specifically, we compute the mean exit time of two-dimensional Brownian motion from the unit square, as well as higher moments of the exit time of time-space Brownian motion, i.e., the two-dimensional process composed of a one-dimensional Wiener process and...
-
作者:Jennings, Otis B.
作者单位:Duke University
摘要:We consider two serial single-server stations in heavy traffic. There are two job types: All jobs visit station I and then station 2. Station 1 processes jobs in an exhaustive service or gated service fashion; station 2 uses an arbitrary nonidling service discipline. Neither station incurs switchover delays. We prove two heavy-traffic limit theorems (HTLT) for the diffusion-scaled, two-dimensional total workload process: one for when the first station implements exhaustive service and the othe...
-
作者:Eisenbrand, Friedrich; Shmonin, Gennady
作者单位:Swiss Federal Institutes of Technology Domain; Ecole Polytechnique Federale de Lausanne
摘要:Parametric integer programming deals with a family of integer programs that is defined by the same constraint matrix but where the right-hand sides are points of a given polyhedron. The question is whether all these integer programs are feasible. Kannan showed that this can be checked in polynomial time if the number of variables in the integer programs is fixed and the polyhedron of right-hand sides has fixed affine dimension. In this paper, we extend this result by providing a polynomial alg...
-
作者:Belloni, Alexandre
作者单位:Duke University
摘要:where K is a n-dimensional convex set that contains the origin, parameters t > 0 and p > 0, and vertical bar vertical bar.vertical bar vertical bar is any norm. We also develop connections between these densities and geometric properties of K such as diameter, width of the recession cone, and others. Since f(t) is log-concave only if p >= 1, this framework also covers nonlog-concave densities. Moreover, we establish a new set inclusion characterization for convex sets. This leads to a new conc...
-
作者:Basu, Arnab; Bhattacharyya, Tirthankar; Borkar, Vivek S.
作者单位:Indian Institute of Management (IIM System); Indian Institute of Management Bangalore; Indian Institute of Science (IISC) - Bangalore; Tata Institute of Fundamental Research (TIFR)
摘要:A linear function approximation-based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk-sensitive cost. Its convergence is proved using the o.d.e. method for stochastic approximation. The scheme is also extended to continuous state space processes.