The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes
Journal Article
·
· Applied Mathematics and Optimization
- Escola Politecnica da Universidade de Sao Paulo, Departamento de Engenharia de Telecomunicacoes e Controle (Brazil)
- Universite Bordeaux I, IMB, Institut Mathematiques de Bordeaux, INRIA Bordeaux Sud Ouest, Team: CQFD (France)
The main goal of this paper is to apply the so-called policy iteration algorithm (PIA) for the long run average continuous control problem of piecewise deterministic Markov processes (PDMP's) taking values in a general Borel space and with compact action space depending on the state variable. In order to do that we first derive some important properties for a pseudo-Poisson equation associated to the problem. In the sequence it is shown that the convergence of the PIA to a solution satisfying the optimality equation holds under some classical hypotheses and that this optimal solution yields to an optimal control strategy for the average control problem for the continuous-time PDMP in a feedback form.
- OSTI ID:
- 21480258
- Journal Information:
- Applied Mathematics and Optimization, Vol. 62, Issue 2; Other Information: DOI: 10.1007/s00245-010-9099-4; Copyright (c) 2010 Springer Science+Business Media, LLC; ISSN 0095-4616
- Country of Publication:
- United States
- Language:
- English
Similar Records
Singular Perturbation for the Discounted Continuous Control of Piecewise Deterministic Markov Processes
Linearization Techniques for Controlled Piecewise Deterministic Markov Processes; Application to Zubov's Method
Efficient analysis of stochastic gene dynamics in the non-adiabatic regime using piecewise deterministic Markov processes
Journal Article
·
Wed Jun 15 00:00:00 EDT 2011
· Applied Mathematics and Optimization
·
OSTI ID:21480258
Linearization Techniques for Controlled Piecewise Deterministic Markov Processes; Application to Zubov's Method
Journal Article
·
Mon Oct 15 00:00:00 EDT 2012
· Applied Mathematics and Optimization
·
OSTI ID:21480258
Efficient analysis of stochastic gene dynamics in the non-adiabatic regime using piecewise deterministic Markov processes
Journal Article
·
Wed Jan 31 00:00:00 EST 2018
· Journal of the Royal Society Interface
·
OSTI ID:21480258