Using a Simple Binomial Model to Assess Improvement in Predictive Capability: Sequential Bayesian Inference, Hypothesis Testing, and Power Analysis

Sigeti, David E.; Pelak, Robert A.

doi:10.2172/1050516

Title: Using a Simple Binomial Model to Assess Improvement in Predictive Capability: Sequential Bayesian Inference, Hypothesis Testing, and Power Analysis

Technical Report · Tue Sep 11 00:00:00 EDT 2012

DOI:https://doi.org/10.2172/1050516· OSTI ID:1050516

Sigeti, David E. ^[1]; Pelak, Robert A. ^[1]

Los Alamos National Laboratory

We present a Bayesian statistical methodology for identifying improvement in predictive simulations, including an analysis of the number of (presumably expensive) simulations that will need to be made in order to establish with a given level of confidence that an improvement has been observed. Our analysis assumes the ability to predict (or postdict) the same experiments with legacy and new simulation codes and uses a simple binomial model for the probability, {theta}, that, in an experiment chosen at random, the new code will provide a better prediction than the old. This model makes it possible to do statistical analysis with an absolute minimum of assumptions about the statistics of the quantities involved, at the price of discarding some potentially important information in the data. In particular, the analysis depends only on whether or not the new code predicts better than the old in any given experiment, and not on the magnitude of the improvement. We show how the posterior distribution for {theta} may be used, in a kind of Bayesian hypothesis testing, both to decide if an improvement has been observed and to quantify our confidence in that decision. We quantify the predictive probability that should be assigned, prior to taking any data, to the possibility of achieving a given level of confidence, as a function of sample size. We show how this predictive probability depends on the true value of {theta} and, in particular, how there will always be a region around {theta} = 1/2 where it is highly improbable that we will be able to identify an improvement in predictive capability, although the width of this region will shrink to zero as the sample size goes to infinity. We show how the posterior standard deviation may be used, as a kind of 'plan B metric' in the case that the analysis shows that {theta} is close to 1/2 and argue that such a plan B should generally be part of hypothesis testing. All the analysis presented in the paper is done with a general beta-function prior for {theta}, enabling sequential analysis in which a small number of new simulations may be done and the resulting posterior for {theta} used as a prior to inform the next stage of power analysis.

View Technical Report

Cite

Export

Save

Research Organization:: Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)

Sponsoring Organization:: DOE/LANL

DOE Contract Number:: AC52-06NA25396

OSTI ID:: 1050516

Report Number(s):: LA-UR-12-24643; TRN: US201218%%1038

Country of Publication:: United States

Language:: English

Similar Records

Development and Validation of a Lifecycle-based Prognostics Architecture with Test Bed Validation

Technical Report · Thu Nov 06 00:00:00 EST 2014 · OSTI ID:1050516

Hines, J. Wesley; Upadhyaya, Belle; Sharp, Michael; +10 more

Risk Analysis of the Space Shuttle: Pre-Challenger Bayeisan Prediction of Failure

Conference · Fri Feb 01 00:00:00 EST 2008 · OSTI ID:1050516

Kelly, Dana L

Statistical Evaluation of Experimental Determinations of Neutrino Mass Hierarchy

Journal Article · Sat Dec 01 00:00:00 EST 2012 · Phys.Rev.D · OSTI ID:1050516

Related Subjects

42 ENGINEERING
97 MATHEMATICAL METHODS AND COMPUTING
COMPUTER CODES
DISTRIBUTION
FORECASTING
HYPOTHESIS
PRICES
PROBABILITY
SIMULATION
STATISTICS
TESTING

Title: Using a Simple Binomial Model to Assess Improvement in Predictive Capability: Sequential Bayesian Inference, Hypothesis Testing, and Power Analysis

Citation Formats

Similar Records

Related Subjects