
- Point-Based Policy Iteration Shihao Ji, Ronald Parr
- Hierarchical Linear/Constant Time SLAM Using Particle Filters for Dense Maps
- Reinforcement Learning Using Approximate Belief States
- Learning Probabilistic Motion Models for Mobile Robots Austin I. Eliazar ELIAZAR@CS.DUKE.EDU
- Hierarchical Control and Learning Markov Decision Processes
- Generalized Value Functions for Large Action Sets Jason Pazis JPAZIS@CS.DUKE.EDU
- Multiagent Planning with Factored MDPs Carlos Guestrin
- Learning in ZeroSum Team Markov Games using Factored Value Functions
- Solving Stackelberg Games with Uncertain Observability Dmytro Korzhyk, Vincent Conitzer, Ronald Parr
- Multi-step Multi-sensor Hider-Seeker Games Erik Halvorson, Vincent Conitzer and Ronald Parr
- An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning
- Analyzing Feature Generation for Value-Function Approximation Ronald Parr PARR@CS.DUKE.EDU
- Planning Aims for a Network of Horizontal and Overhead Sensors Erik Halvorson and Ronald Parr
- Efficient Selection of Disambiguating Actions for Stereo Vision Monika Schaeffer and Ronald Parr
- DP-SLAM 2.0 Austin I. Eliazar and Ronald Parr
- IJCAI01 workshop on Planning under Uncertainty and Incomplete Information (PRO2), pp. 67 75, Seattle, Washington, August 2001. Solving Factored POMDPs with Linear Value Functions
- International Joint Conference on Artificial Intelligence (IJCAI01), Seattle, Washington, August 2001. Maxnorm Projections for Factored MDPs
- Non-parametric Approximate Linear Programming for MDPs Jason Pazis and Ronald Parr
- Reinforcement Learning with Hierarchies of Machines \Lambda
- Linear Value Function Approximation Linear Models
- Planning Aims for a Network of Horizontal and Overhead Sensors
- Coordinated Reinforcement Learning Carlos Guestrin GUESTRIN@CS.STANFORD.EDU
- ModelFree Least Squares Policy Iteration Michail G. Lagoudakis
- Computing factored value functions for policies in structured MDPs Daphne Koller
- Security Games with Multiple Attacker Resources Dmytro Korzhyk, Vincent Conitzer, Ronald Parr
- Textured Occupancy Grids for Monocular Localization Without Features
- Art Cooking Morgan Kaufmann
- Copyright c 2005 by Austin Eliazar
- CS--2001--05 ModelFree LeastSquares Policy Iteration 1
- Approximating Optimal Policies for Partially Observable Stochastic Domains Ronald Parr, Stuart Russell
- Provably bounded optimal agents Stuart J. Russell
- Maxnorm Projections for Factored MDPs Carlos Guestrin
- DPSLAM: Fast, Robust Simultaneous Localization and Mapping Without Predetermined Landmarks
- Bayesian Fault Detection and Diagnosis in Dynamic Systems Computer Science Dept.
- Making Rational Decisions using Adaptive Utility Elicitation Urszula Chajewska
- Inference in Hybrid Networks: Theoretical Limits and Practical Algorithms Computer Science Department
- Complexity of Computing Optimal Stackelberg Strategies in Security Resource Allocation Games
- Kernelized Value Function Approximation for Reinforcement Learning Gavin Taylor GVTAYLOR@CS.DUKE.EDU
- Flexible Decomposition Algorithms for Weakly Coupled Markov Decision Problems
- Policy Iteration for Factored MDPs Daphne Koller
- Technical Report CS-2012-01 L1 Regularized Linear Temporal Difference Learning