Reinforcement Learning via Gaussian Processes with Neural Network Dual Kernels

Goumiri, Imene R.; Priest, Benjamin W.; Schneider, Michael D.

doi:10.1109/cog47356.2020.9231744

Reinforcement Learning via Gaussian Processes with Neural Network Dual Kernels

Journal Article · Sat Aug 01 00:00:00 EDT 2020 · 2020 IEEE Conference on Games (CoG)

DOI:https://doi.org/10.1109/cog47356.2020.9231744· OSTI ID:1780581

Goumiri, Imene R. ^[1]; Priest, Benjamin W. ^[1]; Schneider, Michael D. ^[1]

Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

While deep neural networks (DNNs) and Gaussian Processes (GPs) are both popularly utilized to solve problems in reinforcement learning, both approaches feature undesirable drawbacks for challenging problems. DNNs learn complex non-linear embeddings, but do not naturally quantify uncertainty and are often data-inefficient to train. GPs infer posterior distributions over functions, but popular kernels exhibit limited expressivity on complex and high-dimensional data. Fortunately, recently discovered conjugate and neural tangent kernel functions encode the behavior of overparameterized neural networks in the kernel domain. We demonstrate that these kernels can be efficiently applied to regression and reinforcement learning problems by analyzing a baseline case study.We apply GPs with neural network dual kernels to solve reinforcement learning tasks for the first time. We demonstrate, using the well understood mountain-car problem, that GPs empowered with dual kernels perform at least as well as those using the conventional radial basis function kernel. Finally, we conjecture that by inheriting the probabilistic rigor of GPs and the powerful embedding properties of DNNs, GPs using NN dual kernels will empower future reinforcement learning models on difficult domains.

View Accepted Manuscript (DOE)

Research Organization:: Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA)

Grant/Contract Number:: AC52-07NA27344

OSTI ID:: 1780581

Report Number(s):: LLNL-JRNL--808440; 1014384

Journal Information:: 2020 IEEE Conference on Games (CoG), Journal Name: 2020 IEEE Conference on Games (CoG) Vol. 2020; ISSN 2325-4289

Publisher:: IEEECopyright Statement

Country of Publication:: United States

Language:: English

References (14)

Local Gaussian process regression for real-time model-based robot control No authors listed 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems https://doi.org/10.1109/iros.2008.4650850	conference	September 2008
Priors for Infinite Networks Neal, Radford M. Bayesian Learning for Neural Networks https://doi.org/10.1007/978-1-4612-0745-0_2	book	January 1996
Comprehensive comparison of online ADP algorithms for continuous-time optimal control Zhu, Yuanheng; Zhao, Dongbin Artificial Intelligence Review, Vol. 49, Issue 4 https://doi.org/10.1007/s10462-017-9548-4	journal	February 2017
Gaussian process dynamic programming Deisenroth, Marc Peter; Rasmussen, Carl Edward; Peters, Jan Neurocomputing, Vol. 72, Issue 7-9 https://doi.org/10.1016/j.neucom.2008.12.019	journal	March 2009
Human-level control through deep reinforcement learning Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David Nature, Vol. 518, Issue 7540 https://doi.org/10.1038/nature14236	journal	February 2015
Learning about physical parameters: the importance of model discrepancy Brynjarsdóttir, Jenný; OʼHagan, Anthony Inverse Problems, Vol. 30, Issue 11 https://doi.org/10.1088/0266-5611/30/11/114007	journal	October 2014
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning Nagabandi, Anusha; Kahn, Gregory; Fearing, Ronald S. 2018 IEEE International Conference on Robotics and Automation (ICRA) https://doi.org/10.1109/ICRA.2018.8463189	conference	May 2018
GP-UKF: Unscented kalman filters with Gaussian process prediction and observation models Ko, Jonathan; Klein, Daniel J.; Fox, Dieter 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems https://doi.org/10.1109/IROS.2007.4399284	conference	October 2007
Local Gaussian process regression for real-time model-based robot control No authors listed 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems https://doi.org/10.1109/IROS.2008.4650850	conference	September 2008
Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups Hinton, Geoffrey; Deng, Li; Yu, Dong IEEE Signal Processing Magazine, Vol. 29, Issue 6 https://doi.org/10.1109/MSP.2012.2205597	journal	November 2012
Gaussian Processes for Data-Efficient Learning in Robotics and Control Deisenroth, Marc Peter; Fox, Dieter; Rasmussen, Carl Edward IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 37, Issue 2 https://doi.org/10.1109/TPAMI.2013.218	journal	February 2015
Learning to Schedule Control Fragments for Physics-Based Characters Using Deep Q-Learning Liu, Libin; Hodgins, Jessica ACM Transactions on Graphics, Vol. 36, Issue 3 https://doi.org/10.1145/3083723	journal	June 2017
DeepMimic Peng, Xue Bin; Abbeel, Pieter; Levine, Sergey ACM Transactions on Graphics, Vol. 37, Issue 4 https://doi.org/10.1145/3197517.3201311	journal	July 2018
Nonlinear Adaptive Control Using Nonparametric Gaussian Process Prior Models Murray-Smith, Roderick; Sbarbaro, Daniel IFAC Proceedings Volumes, Vol. 35, Issue 1 https://doi.org/10.3182/20020721-6-ES-1901.01040	journal	January 2002

Similar Records

Representation Learning via Quantum Neural Tangent Kernels

Journal Article · Wed Aug 17 00:00:00 EDT 2022 · PRX Quantum · OSTI ID:1982853

Correspondence of NNGP Kernel and the Matérn Kernel

Technical Report · Mon Oct 04 00:00:00 EDT 2021 · OSTI ID:2461672

Optimizing thermodynamic trajectories using evolutionary reinforcement learning

Journal Article · Wed Mar 20 00:00:00 EDT 2019 · arXiv.org Repository · OSTI ID:1601197

Related Subjects

97 MATHEMATICS AND COMPUTING
Gaussian processes
deep neural networks
mathematics and computing
reinforcement learning

Reinforcement Learning via Gaussian Processes with Neural Network Dual Kernels

Citation Formats

References (14)

Similar Records

Related Subjects