Equilibrium Control Policies for Markov Chains

by Andreas Malikopoulos

Publication Type

Conference Paper

Publication Date

December, 2011

Page Numbers

7093 to 7098

Conference Name

50th IEEE Conference on Decision and Control and European Control Conference

Conference Location

Orlando, Florida, United States of America

Conference Sponsor

IEEE, SIAM, Control Systems Society, INFORMS

Conference Date

Dec 12, 2011 - Dec 15, 2011

Abstract

The average cost criterion has held great intuitive appeal and has attracted considerable attention. It is widely employed when controlling dynamic systems that evolve stochastically over time by means of formulating an optimization problem to achieve long-term goals efficiently. The average cost criterion is especially appealing when the decision-making process is long compared to other timescales involved, and there is no compelling motivation to select short-term optimization. This paper addresses the problem of controlling a Markov chain so as to minimize the average cost per unit time. Our approach treats the problem as a dual constrained optimization problem. We derive conditions guaranteeing that a saddle point exists for the new dual problem and we show that this saddle point is an equilibrium control policy for each state of the Markov chain. For practical situations with constraints consistent to those we study here, our results imply that recognition of such saddle points may be of value in deriving in real time an optimal control policy.

Equilibrium Control Policies for Markov Chains

Abstract

Organizations