skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes

Journal Article · · Applied Mathematics and Optimization
 [1];  [2]
  1. Escola Politecnica da Universidade de Sao Paulo, Departamento de Engenharia de Telecomunicacoes e Controle (Brazil)
  2. Universite Bordeaux I, IMB, Institut Mathematiques de Bordeaux, INRIA Bordeaux Sud Ouest, Team: CQFD (France)

The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.

OSTI ID:
21480258
Journal Information:
Applied Mathematics and Optimization, Vol. 62, Issue 2; Other Information: DOI: 10.1007/s00245-010-9099-4; Copyright (c) 2010 Springer Science+Business Media, LLC; ISSN 0095-4616
Country of Publication:
United States
Language:
English