Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Parameterized MDPs and Reinforcement Learning Problems—A Maximum Entropy Principle-Based Framework

Journal Article · · IEEE Transactions on Cybernetics
 [1];  [1]
  1. Mechanical Science and Engineering Department and Coordinated Science Laboratory, University of Illinois at Urbana–,Champaign, Urbana, IL, USA

Not provided.

Research Organization:
Worcester Polytechnic Institute, MA (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE)
DOE Contract Number:
EE0009125
OSTI ID:
1980477
Journal Information:
IEEE Transactions on Cybernetics, Vol. 52, Issue 9; ISSN 2168-2267
Publisher:
IEEE
Country of Publication:
United States
Language:
English

References (27)

Probability Theory book January 2003
Toward Generalization of Automated Temporal Abstraction to Partially Observable Reinforcement Learning journal August 2015
Parameterized Markov decision process and its application to service rate control journal April 2015
Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control journal February 2021
Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning conference August 2019
Event-Triggered Distributed Control of Nonlinear Interconnected Systems Using Online Reinforcement Learning With Exploration journal September 2018
Reinforcement Learning with Parameterized Actions journal February 2016
Querying Beneficial Constraints Before Clustering Using Facility Location Analysis journal January 2018
Locally Weighted Ensemble Clustering journal May 2018
Wireless backhauling of 5G small cells: challenges and solution approaches journal October 2015
Location-aware self-organizing methods in femtocell networks journal December 2015
Simultaneous Facility Location and Path Optimization in Static and Dynamic Networks journal December 2020
Policy Search for the Optimal Control of Markov Decision Processes: A Novel Particle-Based Iterative Scheme journal November 2016
Entropy-Based Framework for Dynamic Coverage and Clustering Problems journal January 2012
Aggregation of Graph Models and Markov Chains by Deterministic Annealing journal October 2014
Protein structure alignment by deterministic annealing journal August 2004
Maximizing entropy over Markov processes journal September 2014
Linear Programming and Markov Decision Chains journal April 1979
Q-learning journal May 1992
A Scalable Approach to Combinatorial Library Design for Drug Discovery journal December 2007
Reinforcement learning from human reward: Discounting in episodic tasks conference September 2012
Maximal Entropy Random Walk for Region-Based Visual Saliency journal September 2014
Drn conference January 2018
Increasing the Action Gap: New Operators for Reinforcement Learning journal February 2016
Entropy Maximization for Constrained Markov Decision Processes conference October 2018
Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning journal September 2018
Robust control and model misspecification journal May 2006

Similar Records

A general maximum entropy framework for thermodynamic variational principles
Journal Article · 2014 · AIP Conference Proceedings · OSTI ID:22390756

Maximum-entropy principle and its application to the solution of inverse problems
Journal Article · 1981 · Sov. J. Particles Nucl. (Engl. Transl.); (United States) · OSTI ID:6132833

A Modified Maximum Entropy Inverse Reinforcement Learning Approach for Microgrid Energy Scheduling
Conference · 2023 · OSTI ID:2228804