Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Huang, Yonghui; Guo, Xianping

doi:10.1007/S00245-014-9278-9

Title: Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Journal Article · Thu Oct 15 00:00:00 EDT 2015 · Applied Mathematics and Optimization

DOI:https://doi.org/10.1007/S00245-014-9278-9· OSTI ID:22722847

Huang, Yonghui; Guo, Xianping ^[1]

Sun Yat-Sen University, School of Mathematics and Computational Science (China)

This paper deals with a mean-variance problem for finite horizon semi-Markov decision processes. The state and action spaces are Borel spaces, while the reward function may be unbounded. The goal is to seek an optimal policy with minimal finite horizon reward variance over the set of policies with a given mean. Using the theory of N-step contraction, we give a characterization of policies with a given mean and convert the second order moment of the finite horizon reward to a mean of an infinite horizon reward/cost generated by a discrete-time Markov decision processes (MDP) with a two dimension state space and a new one-step reward/cost under suitable conditions. We then establish the optimality equation and the existence of mean-variance optimal policies by employing the existing results of discrete-time MDPs. We also provide a value iteration and a policy improvement algorithms for computing the value function and mean-variance optimal policies, respectively. In addition, a linear program and the dual program are developed for solving the mean-variance problem.

Cite

Export

Save

OSTI ID:: 22722847

Journal Information:: Applied Mathematics and Optimization, Vol. 72, Issue 2; Other Information: Copyright (c) 2015 Springer Science+Business Media New York; http://www.springer-ny.com; Country of input: International Atomic Energy Agency (IAEA); ISSN 0095-4616

Country of Publication:: United States

Language:: English

Similar Records

Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes

Journal Article · Mon Aug 15 00:00:00 EDT 2016 · Applied Mathematics and Optimization · OSTI ID:22722847

Nearly optimal control of singularly perturbed Markov decision processes in discrete time

Journal Article · Sun Jul 01 00:00:00 EDT 2001 · Applied Mathematics and Optimization · OSTI ID:22722847

Liu, R H; edu, qingz@math uga

K-Spin Hamiltonian for Quantum-Resolvable Markov Decision Processes

Journal Article · Fri Oct 30 00:00:00 EDT 2020 · Quantum Machine Intelligence · OSTI ID:22722847

Jones, Eric B.; Graf, Peter; Kapit, Eliot; +1 more

Related Subjects

71 CLASSICAL AND QUANTUM MECHANICS
GENERAL PHYSICS
ALGORITHMS
DECISION MAKING
FUNCTIONS
MARKOV PROCESS
MATHEMATICAL SPACE
OPTIMIZATION

Title: Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Citation Formats

Similar Records

Related Subjects