| | |
Summary: PolicyGradient Algorithms for
Partially Observable Markov
Decision Processes
Douglas Alexander Aberdeen
A thesis submitted for the degree of
Doctor of Philosophy at
The Australian National University
April 2003
c
# Douglas Alexander Aberdeen
Typeset in Computer Modern by T E X and L A T E X 2 # .
Except where otherwise indicated, this thesis is my own original work.
Douglas Alexander Aberdeen
25 April 2003
Acknowledgements
Academic
|