Home
About
Advanced Search
Browse by Discipline
Scientific Societies
E-print Alerts
Add E-prints
FAQ
•
HELP
•
SITE MAP
•
CONTACT US
Search
Advanced Search
Scherrer, Bruno - INRIA & Laboratoire Lorrain de Recherche en Informatique et ses Applications (Loria), Université Henri Poincaré -Nancy-Université
What is Policy Iteration ? Exact case analysis Approximate case analysis Application to Tetris Conclusion Analysis of Policy Iteration
Policy Iteration / Optimistic Policy Iteration Least-Squares Policy Iteration Experiments Least Squares Policy Iteration
Two Examples Relation and stability issues The unied oblique projection view Empirical comparison Should one compute the Temporal Dierence x
Recursive Least-Squares Off-policy Learning with Eligibility Traces