Towards a Deep Unified Framework for Nuclear Reactor Perturbation Analysis
conference
November 2018
Multi-Temporal Abstraction with Time-Aware Deep Q-Learning for Septic Shock Prevention
conference
December 2021
Mastering the game of Go with deep neural networks and tree search
journal
January 2016
Epidemiology and Costs of Sepsis in the United States—An Analysis Based on Timing of Diagnosis and Severity Level*
journal
December 2018
Deep Reinforcement Learning with Double Q-Learning
journal
March 2016
Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision
journal
June 2021
Human-level control through deep reinforcement learning
journal
February 2015
DeepCare: A Deep Dynamic Memory Model for Predictive Medicine
book
January 2016
Prevalence, Underlying Causes, and Preventability of Sepsis-Associated Mortality in US Acute Care Hospitals
journal
February 2019
Individualized sepsis treatment using reinforcement learning
journal
November 2018
Bias and Variance Approximation in Value Function Estimates
journal
February 2007
The Arcade Learning Environment: An Evaluation Platform for General Agents
journal
May 2013
Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score
journal
July 2003
Temporal Belief Memory: Imputing Missing Data during RNN Training
conference
July 2018
An accident diagnosis algorithm using long short-term memory
journal
May 2018
Hyperbolically Discounted Temporal Difference Learning
journal
June 2010
The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care
journal
October 2018
Grandmaster level in StarCraft II using multi-agent reinforcement learning
journal
October 2019
Learning dexterous in-hand manipulation
journal
November 2019
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
journal
November 2007
DATA-GRU: Dual-Attention Time-Aware Gated Recurrent Unit for Irregular Multivariate Time Series
journal
April 2020
Reinforcement learning of motor skills with policy gradients
journal
May 2008
Reinforcement Learning in Continuous Time and Space
journal
January 2000
Management of severe sepsis: advances, challenges, and current status
journal
April 2015
Providing support to operators for monitoring safety functions using reinforcement learning
journal
January 2020
Temporal-Difference Reinforcement Learning with Distributed Representations
journal
October 2009
Condition-based probabilistic safety assessment for maintenance decision making regarding a nuclear power plant steam generator undergoing multiple degradation mechanisms
journal
November 2019
Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the Global Burden of Disease Study
journal
January 2020
Doubly Robust Policy Evaluation and Optimization
journal
November 2014
Recent Temporal Pattern Mining for Septic Shock Early Prediction
conference
June 2018
Recurrent Neural Networks for Multivariate Time Series with Missing Values
journal
April 2018
Temporal Dropout of Changes Approach to Convolutional Learning of Spatio-Temporal Features
conference
November 2014
A Note on Measurement of Utility
journal
February 1937
Deep recurrent Q-learning of behavioral intervention delivery by a robot from demonstration data
conference
August 2017
Algorithm for Autonomous Power-Increase Operation Using Deep Reinforcement Learning and a Rule-Based System
journal
January 2020
ATTAIN: Attention-based Time-Aware LSTM Networks for Disease Progression Modeling
conference
August 2019
Unobserved Is Not Equal to Non-existent: Using Gaussian Processes to Infer Immediate Rewards Across Contexts
conference
August 2019
Scaling data-driven robotics with reward sketching and batch reinforcement learning
conference
July 2020
The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3)
journal
February 2016
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
journal
August 1999
Time-aware Subgroup Matrix Decomposition: Imputing Missing Data Using Forecasting Events
conference
December 2018
Early Diagnosis and Prediction of Sepsis Shock by Combining Static and Dynamic Information Using Convolutional-LSTM
conference
June 2018
TE-ESN: Time Encoding Echo State Network for Prediction Based on Irregularly Sampled Time Series Data
conference
August 2021
Discounting of Delayed Rewards: Models of Individual Choice
journal
November 1995
Patient Subtyping via Time-Aware LSTM Networks
Baytas, Inci M.; Xiao, Cao; Zhang, Xi
KDD '17: The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
https://doi.org/10.1145/3097983.3097997
conference
August 2017
First return, then explore
journal
February 2021
Adaptive Power Transformer Lifetime Predictions Through Machine Learning and Uncertainty Modeling in Nuclear Power Plants
journal
June 2019
LSTM for septic shock: Adding unreliable labels to reliable predictions
conference
December 2017
Reinforcement learning in continuous time: advantage updating
conference
January 1994
Predictors of Patients Who Present to the Emergency Department With Sepsis and Progress to Septic Shock Between 4 and 48 Hours of Emergency Department Arrival*
journal
May 2015
Prediction of peak values in time series data for prognostics of critical components in nuclear power plants
journal
January 2016