Optimizing thermodynamic trajectories using evolutionary and gradient-based reinforcement learning
Journal Article
·
· Physical Review. E
- Univ. of Ottawa, ON (Canada); National Research Council of Canada, Ottawa (Canada)
- Univ. of Ontario Institute of Technology, Oshawa (Canada)
- Univ. of Victoria, BC (Canada)
- Univ. of Ontario Institute of Technology, Oshawa (Canada); Vector Institute for Artificial Intelligence, Toronto (Canada)
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Univ. of Ontario Institute of Technology, Oshawa (Canada); Vector Institute for Artificial Intelligence, Toronto (Canada); Univ. of Ottawa, ON (Canada)
Here using a model heat engine, we show that neural-network-based reinforcement learning can identify thermodynamic trajectories of maximal efficiency. We consider both gradient and gradient-free reinforcement learning. We use an evolutionary learning algorithm to evolve a population of neural networks, subject to a directive to maximize the efficiency of a trajectory composed of a set of elementary thermodynamic processes; the resulting networks learn to carry out the maximally efficient Carnot, Stirling, or Otto cycles. When given an additional irreversible process, this evolutionary scheme learns a previously unknown thermodynamic cycle. Gradient-based reinforcement learning is able to learn the Stirling cycle, whereas an evolutionary approach achieves the optimal Carnot cycle. Our results show how the reinforcement learning strategies developed for game playing can be applied to solve physical problems conditioned upon path-extensive order parameters.
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Basic Energy Sciences (BES)
- Grant/Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1860343
- Journal Information:
- Physical Review. E, Journal Name: Physical Review. E Journal Issue: 6 Vol. 104; ISSN 2470-0045
- Publisher:
- American Physical Society (APS)Copyright Statement
- Country of Publication:
- United States
- Language:
- English
| A reinforcement learning approach to rare trajectory sampling | text | January 2020 |
| Accelerating GMRES with Deep Learning in Real-Time | preprint | January 2021 |
Similar Records
Optimizing thermodynamic trajectories using evolutionary reinforcement learning
Evolutionary reinforcement learning of dynamical large deviations
Journal Article
·
Wed Mar 20 00:00:00 EDT 2019
· arXiv.org Repository
·
OSTI ID:1601197
Evolutionary reinforcement learning of dynamical large deviations
Journal Article
·
Mon Jul 27 20:00:00 EDT 2020
· Journal of Chemical Physics
·
OSTI ID:1783100