K-Spin Hamiltonian for Quantum-Resolvable Markov Decision Processes

Jones, Eric B.; Graf, Peter; Kapit, Eliot; Jones, Wesley

doi:10.1007/s42484-020-00026-6

K-Spin Hamiltonian for Quantum-Resolvable Markov Decision Processes

Journal Article · Fri Oct 30 04:00:00 EDT 2020 · Quantum Machine Intelligence

DOI:https://doi.org/10.1007/s42484-020-00026-6· OSTI ID:1885979

Jones, Eric B.; Graf, Peter; Kapit, Eliot; Jones, Wesley

The Markov decision process is the mathematical formalization underlying the modern field of reinforcement learning when transition and reward functions are unknown. We derive a pseudo-Boolean cost function that is equivalent to a K-spin Hamiltonian representation of the discrete, finite, discounted Markov decision process with infinite horizon. This K-spin Hamiltonian furnishes a starting point from which to solve for an optimal policy using heuristic quantum algorithms such as adiabatic quantum annealing and the quantum approximate optimization algorithm on near-term quantum hardware. In arguing that the variational minimization of our Hamiltonian is approximately equivalent to the Bellman optimality condition for a prevalent class of environments we establish an interesting analogy with classical field theory. Along with proof-of-concept calculations to corroborate our formulation by simulated and quantum annealing against classical Q-Learning, we analyze the scaling of physical resources required to solve our Hamiltonian on quantum hardware.

Research Organization:: National Renewable Energy Laboratory (NREL), Golden, CO (United States)

Sponsoring Organization:: USDOE National Renewable Energy Laboratory (NREL), Laboratory Directed Research and Development (LDRD) Program

DOE Contract Number:: AC36-08GO28308

OSTI ID:: 1885979

Report Number(s):: NREL/JA-2C00-83527; MainId:84300; UUID:a9fde509-222a-4c32-86e8-e8c664189bdf; MainAdminID:65284

Journal Information:: Quantum Machine Intelligence, Journal Name: Quantum Machine Intelligence Vol. 2

Country of Publication:: United States

Language:: English

References (29)

The Complexity of Markov Decision Processes Papadimitriou, Christos H.; Tsitsiklis, John N. Mathematics of Operations Research, Vol. 12, Issue 3 https://doi.org/10.1287/moor.12.3.441	journal	August 1987
A graph cut algorithm for higher-order Markov Random Fields Fix, Alexander; Gruber, Aritanan; Boros, Endre 2011 International Conference on Computer Vision https://doi.org/10.1109/ICCV.2011.6126347	conference	November 2011
Quantum-Enhanced Machine Learning Dunjko, Vedran; Taylor, Jacob M.; Briegel, Hans J. Physical Review Letters, Vol. 117, Issue 13 https://doi.org/10.1103/PhysRevLett.117.130501	journal	September 2016
Path integrals and symmetry breaking for optimal control theory Kappen, H. J. Journal of Statistical Mechanics: Theory and Experiment, Vol. 2005, Issue 11 https://doi.org/10.1088/1742-5468/2005/11/P11011	journal	November 2005
The quantum adiabatic algorithm applied to random optimization problems: The quantum spin glass perspective Bapst, V.; Foini, L.; Krzakala, F. Physics Reports, Vol. 523, Issue 3 https://doi.org/10.1016/j.physrep.2012.10.002	journal	February 2013
Obstacles to quantum annealing in a planar embedding of XORSAT Patil, Pranay; Kourtis, Stefanos; Chamon, Claudio Physical Review B, Vol. 100, Issue 5 https://doi.org/10.1103/PhysRevB.100.054435	journal	August 2019
Direct implementation of an N-qubit controlled-unitary gate in a single step Kumar, Preethika Quantum Information Processing, Vol. 12, Issue 2 https://doi.org/10.1007/s11128-012-0465-9	journal	August 2012
Quantum-Enhanced Reinforcement Learning for Finite-Episode Games with Discrete State Spaces Neukart, Florian; Von Dollen, David; Seidel, Christian Frontiers in Physics, Vol. 5 https://doi.org/10.3389/fphy.2017.00071	journal	February 2018
Reinforcement learning is direct adaptive optimal control Sutton, R. S.; Barto, A. G.; Williams, R. J. IEEE Control Systems, Vol. 12, Issue 2, p. 19-22 https://doi.org/10.1109/37.126844	journal	April 1992
Glassy Phase of Optimal Quantum Control Day, Alexandre G. R.; Bukov, Marin; Weinberg, Phillip Physical Review Letters, Vol. 122, Issue 2 https://doi.org/10.1103/PhysRevLett.122.020601	journal	January 2019
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play Silver, David; Hubert, Thomas; Schrittwieser, Julian Science, Vol. 362, Issue 6419 https://doi.org/10.1126/science.aar6404	journal	December 2018
Reinforcement learning architecture for Web recommendations Golovin, N.; Rahm, E. International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. https://doi.org/10.1109/ITCC.2004.1286487	conference	January 2004
Random-Energy Model: Limit of a Family of Disordered Models Derrida, B. Physical Review Letters, Vol. 45, Issue 2 https://doi.org/10.1103/PhysRevLett.45.79	journal	July 1980
An Introduction To Quantum Field Theory Peskin, Michael E. https://doi.org/10.1201/9780429503559	book	January 2018
Optimised simulated annealing for Ising spin glasses Isakov, S. V.; Zintchenko, I. N.; Rønnow, T. F. Computer Physics Communications, Vol. 192 https://doi.org/10.1016/j.cpc.2015.02.015	journal	July 2015
Hard combinatorial problems and minor embeddings on lattice graphs Lucas, Andrew Quantum Information Processing, Vol. 18, Issue 7 https://doi.org/10.1007/s11128-019-2323-5	journal	May 2019
Performance of the quantum adiabatic algorithm on random instances of two optimization problems on regular hypergraphs Farhi, Edward; Gosset, David; Hen, Itay Physical Review A, Vol. 86, Issue 5 https://doi.org/10.1103/PhysRevA.86.052334	journal	November 2012
Advances in quantum reinforcement learning Dunjko, Vedran; Taylor, Jacob M.; Briegel, Hans J. 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) https://doi.org/10.1109/SMC.2017.8122616	conference	October 2017
Pseudo-Boolean optimization Boros, Endre; Hammer, Peter L. Discrete Applied Mathematics, Vol. 123, Issue 1-3 https://doi.org/10.1016/S0166-218X(01)00341-9	journal	November 2002
Quantum-enhanced deliberation of learning agents using trapped ions Dunjko, V.; Friis, N.; Briegel, H. J. New Journal of Physics, Vol. 17, Issue 2 https://doi.org/10.1088/1367-2630/17/2/023006	journal	January 2015
Markov processes as a tool in field theory Dynkin, E. B. Journal of Functional Analysis, Vol. 50, Issue 2 https://doi.org/10.1016/0022-1236(83)90066-6	journal	February 1983
Quantum Reinforcement Learning No authors listed IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 38, Issue 5 https://doi.org/10.1109/TSMCB.2008.925743	journal	October 2008
Basic protocols in quantum reinforcement learning with superconducting circuits Lamata, Lucas Scientific Reports, Vol. 7, Issue 1 https://doi.org/10.1038/s41598-017-01711-6	journal	May 2017
Projective simulation for artificial intelligence Briegel, Hans J.; De las Cuevas, Gemma Scientific Reports, Vol. 2, Issue 1 https://doi.org/10.1038/srep00400	journal	May 2012
Quantum annealing in the transverse Ising model Kadowaki, Tadashi; Nishimori, Hidetoshi Physical Review E, Vol. 58, Issue 5 https://doi.org/10.1103/PhysRevE.58.5355	journal	November 1998
On the computational complexity of Ising spin glass models Barahona, F. Journal of Physics A: Mathematical and General, Vol. 15, Issue 10 https://doi.org/10.1088/0305-4470/15/10/028	journal	October 1982
Elementary gates for quantum computation Barenco, Adriano; Bennett, Charles H.; Cleve, Richard Physical Review A, Vol. 52, Issue 5 https://doi.org/10.1103/PhysRevA.52.3457	journal	November 1995
Native three-body interaction in superconducting circuits Pedersen, Simon Panyella; Christensen, K. S.; Zinner, N. T. Physical Review Research, Vol. 1, Issue 3 https://doi.org/10.1103/PhysRevResearch.1.033123	journal	November 2019
From local to global ground states in Ising spin glasses Zintchenko, Ilia; Hastings, Matthew B.; Troyer, Matthias Physical Review B, Vol. 91, Issue 2 https://doi.org/10.1103/PhysRevB.91.024201	journal	January 2015

Similar Records

Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes

Journal Article · Thu Oct 15 00:00:00 EDT 2015 · Applied Mathematics and Optimization · OSTI ID:22722847

Quantum logic gate synthesis as a Markov decision process

Journal Article · Tue Oct 24 20:00:00 EDT 2023 · npj Quantum Information · OSTI ID:2216915

Related Subjects

Hamiltonian
K-spin
MATHEMATICS AND COMPUTING
Markov
quantum algorithms
quantum hardware
reinforcement learning

K-Spin Hamiltonian for Quantum-Resolvable Markov Decision Processes

Citation Formats

References (29)

Similar Records

Related Subjects