DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Bayesian Approach for Quantifying Data Scarcity when Modeling Human Behavior via Inverse Reinforcement Learning

Journal Article · · ACM Transactions on Computer-Human Interaction
DOI: https://doi.org/10.1145/3551388 · OSTI ID:1907166

Computational models that formalize complex human behaviors enable study and understanding of such behaviors. However, collecting behavior data required to estimate the parameters of such models is often tedious and resource intensive. Thus, estimating dataset size as part of data collection planning (also known as Sample Size Determination) is important to reduce the time and effort of behavior data collection while maintaining an accurate estimate of model parameters. In this paper, we present a sample size determination method based on Uncertainty Quantification (UQ) for a specific Inverse Reinforcement Learning (IRL) model of human behavior, in two cases: 1) pre-hoc experiment design—conducted in the planning stage before any data is collected, to guide the estimation of how many samples to collect; and 2) post-hoc dataset analysis—performed after data is collected, to decide if the existing dataset has sufficient samples and whether more data is needed. Here, we validate our approach in experiments with a realistic model of behaviors of people with Multiple Sclerosis (MS) and illustrate how to pick a reasonable sample size target. Our work enables model designers to perform a deeper, principled investigation of effects of dataset size on IRL.

Research Organization:
Univ. of Michigan, Ann Arbor, MI (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
SC0021398
OSTI ID:
1907166
Journal Information:
ACM Transactions on Computer-Human Interaction, Journal Name: ACM Transactions on Computer-Human Interaction Journal Issue: 1 Vol. 30; ISSN 1073-0516
Publisher:
Association for Computing Machinery (ACM)Copyright Statement
Country of Publication:
United States
Language:
English

References (69)

Multivariate Density Estimation: Theory, Practice, and Visualization book March 2015
Monte Carlo Statistical Methods book January 2004
Statistical Decision Theory and Bayesian Analysis book January 1985
Handbook of Uncertainty Quantification reference-book January 2017
Predicting the Relationship Between the Size of Training Sample and the Predictive Power of Classifiers book January 2004
Kullback-Leibler Divergence book January 2011
Bayesian Inverse Reinforcement Learning for Modeling Conversational Agents in a Virtual Environment book January 2014
Eigenbehaviors: identifying structure in routine journal April 2009
40 years of cognitive architectures: core cognitive abilities and practical applications journal July 2018
Inverse reinforcement learning from summary data journal June 2018
An Efficient Over-sampling Approach Based on Mean Square Error Back-propagation for Dealing with the Multi-class Imbalance Problem journal August 2014
Development and validation of the positive affect and well-being scale for the neurology quality of life (Neuro-QOL) measurement system journal March 2013
Simulation Based Optimal Design book January 2005
Ecological Momentary Assessment of Pain, Fatigue, Depressive, and Cognitive Symptoms Reveals Significant Daily Variability in Multiple Sclerosis journal November 2017
Pain, Fatigue, and Cognitive Symptoms Are Temporally Associated Within but Not Across Days in Multiple Sclerosis journal November 2017
How Do Pain, Fatigue, Depressive, and Cognitive Symptoms Relate to Well-Being and Social and Physical Functioning in the Daily Lives of Individuals With Multiple Sclerosis? journal November 2017
Towards a standard for pointing device evaluation, perspectives on 27 years of Fitts’ law research in HCI journal December 2004
Simulation-based optimal Bayesian experimental design for nonlinear systems journal January 2013
The Neurobiology of Multiple Sclerosis: Genes, Inflammation, and Neurodegeneration journal October 2006
Probability Theory book January 2003
The need for uncertainty quantification in machine-assisted medical decision making journal January 2019
Variational Inference: A Review for Statisticians journal July 2016
emcee : The MCMC Hammer
  • Foreman-Mackey, Daniel; Hogg, David W.; Lang, Dustin
  • Publications of the Astronomical Society of the Pacific, Vol. 125, Issue 925 https://doi.org/10.1086/670067
journal March 2013
Bayesian inference in physics journal September 2011
Effects of sample size in classifier design journal January 1989
Ability-Based Optimization of Touchscreen Interactions journal January 2018
Control of Gene Regulatory Networks Using Bayesian Inverse Reinforcement Learning journal July 2019
Bayesian calibration of computer models journal August 2001
The choice of sample size journal July 1997
Sample size determination: a review journal July 1997
Parameter Inference for Computational Cognitive Models with Approximate Bayesian Computation journal May 2019
Computational Rationality: Linking Mechanism and Behavior Through Bounded Utility Maximization journal March 2014
Verification, validation, and predictive capability in computational engineering and physics journal September 2004
Computational rationality: A converging paradigm for intelligence in brains, minds, and machines journal July 2015
Navigate like a cabbie conference September 2008
Human model evaluation in interactive supervised learning conference May 2011
Probabilistic pointing target prediction via inverse optimal control conference February 2012
Directing exploratory search conference January 2013
The effect of time-based cost of error in target-directed pointing tasks conference April 2013
Modeling and Understanding Human Routine Behavior conference May 2016
Supervised and Unsupervised Transfer Learning for Activity Recognition from Simple In-home Sensors conference November 2016
Leveraging Human Routine Models to Detect and Generate Human Behaviors conference May 2017
A Cognitive Model of How People Make Decisions Through Interaction with Visual Displays conference May 2017
RDeepSense
  • Yao, Shuochao; Zhao, Yiran; Shao, Huajie
  • Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 1, Issue 4 https://doi.org/10.1145/3161181
journal January 2018
SenseGAN
  • Yao, Shuochao; Zhao, Yiran; Shao, Huajie
  • Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 2, Issue 3 https://doi.org/10.1145/3264954
journal September 2018
Computational Modeling in Human-Computer Interaction conference May 2019
Rl-Klm conference March 2019
Computer-supported form design using keystroke-level modeling with reinforcement learning conference March 2019
Learning Cooperative Personalized Policies from Gaze Data conference October 2019
Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition
  • Adaimi, Rebecca; Thomaz, Edison
  • Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 3, Issue 3 https://doi.org/10.1145/3351228
journal September 2019
Integrating Activity Recognition and Nursing Care Records
  • Inoue, Sozo; Lago, Paula; Hossain, Tahera
  • Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 3, Issue 3 https://doi.org/10.1145/3351244
journal September 2019
Leveraging Routine Behavior and Contextually-Filtered Features for Depression Detection among College Students
  • Xu, Xuhai; Chikersal, Prerna; Doryab, Afsaneh
  • Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 3, Issue 3 https://doi.org/10.1145/3351274
journal September 2019
A Systematic Study of Unsupervised Domain Adaptation for Robust Human-Activity Recognition
  • Chang, Youngjae; Mathur, Akhil; Isopoussu, Anton
  • Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Vol. 4, Issue 1 https://doi.org/10.1145/3380985
journal March 2020
Computational Rationality as a Theory of Interaction conference April 2022
Sample Size Planning for Statistical Power and Accuracy in Parameter Estimation journal January 2008
Predicting sample size required for classification performance journal February 2012
Multiple sclerosis: clinical profiling and data collection as prerequisite for personalized medicine approach journal August 2016
Some Practical Guidelines for Effective Sample Size Determination journal August 2001
Handbook of Markov Chain Monte Carlo book May 2011
On Information and Sufficiency journal March 1951
Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) journal August 2001
Bayesian Experimental Design: A Review journal August 1995
Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning journal April 2018
Reinforcement Learning: A Survey journal January 1996
Ensemble samplers with affine invariance journal January 2010
The NASA Langley Multidisciplinary Uncertainty Quantification Challenge conference January 2014
The class imbalance problem: A systematic study1 journal November 2002
Discovering hidden time patterns in behavior: T-patterns and their detection journal March 2000
Ten simple rules for the computational modeling of behavioral data journal November 2019