skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Bayesian estimation applied to multiple species

Abstract

Observed data are often contaminated by undiscovered interlopers, leading to biased parameter estimation. Here we present BEAMS (Bayesian estimation applied to multiple species) which significantly improves on the standard maximum likelihood approach in the case where the probability for each data point being ''pure'' is known. We discuss the application of BEAMS to future type-Ia supernovae (SNIa) surveys, such as LSST, which are projected to deliver over a million supernovae light curves without spectra. The multiband light curves for each candidate will provide a probability of being Ia (pure) but the full sample will be significantly contaminated with other types of supernovae and transients. Given a sample of N supernovae with mean probability, <P>, of being Ia, BEAMS delivers parameter constraints equal to N<P> spectroscopically confirmed SNIa. In addition BEAMS can be simultaneously used to tease apart different families of data and to recover properties of the underlying distributions of those families (e.g. the type-Ibc and II distributions). Hence BEAMS provides a unified classification and parameter estimation methodology which may be useful in a diverse range of problems such as photometric redshift estimation or, indeed, any parameter estimation problem where contamination is an issue.

Authors:
; ;  [1];  [2];  [2]
  1. Departement de Physique Theorique, Universite de Geneve, 24 quai Ernest Ansermet, CH-1211 Geneva 4 (Switzerland)
  2. (South Africa)
Publication Date:
OSTI Identifier:
20935250
Resource Type:
Journal Article
Resource Relation:
Journal Name: Physical Review. D, Particles Fields; Journal Volume: 75; Journal Issue: 10; Other Information: DOI: 10.1103/PhysRevD.75.103508; (c) 2007 The American Physical Society; Country of input: International Atomic Energy Agency (IAEA)
Country of Publication:
United States
Language:
English
Subject:
71 CLASSICAL AND QUANTUM MECHANICS, GENERAL PHYSICS; CLASSIFICATION; DISTRIBUTION; MATHEMATICAL MODELS; MAXIMUM-LIKELIHOOD FIT; PROBABILITY; SPECTRA; VISIBLE RADIATION

Citation Formats

Kunz, Martin, Bassett, Bruce A., Hlozek, Renee A., South African Astronomical Observatory, Observatory, Cape Town, South Africa and Department of Mathematics and Applied Mathematics, University of Cape Town, Rondebosch, 7700, Cape Town, and National Astrophysics and Space Science Programme, University of Cape Town, Rondebosch, 7700, Cape Town. Bayesian estimation applied to multiple species. United States: N. p., 2007. Web. doi:10.1103/PHYSREVD.75.103508.
Kunz, Martin, Bassett, Bruce A., Hlozek, Renee A., South African Astronomical Observatory, Observatory, Cape Town, South Africa and Department of Mathematics and Applied Mathematics, University of Cape Town, Rondebosch, 7700, Cape Town, & National Astrophysics and Space Science Programme, University of Cape Town, Rondebosch, 7700, Cape Town. Bayesian estimation applied to multiple species. United States. doi:10.1103/PHYSREVD.75.103508.
Kunz, Martin, Bassett, Bruce A., Hlozek, Renee A., South African Astronomical Observatory, Observatory, Cape Town, South Africa and Department of Mathematics and Applied Mathematics, University of Cape Town, Rondebosch, 7700, Cape Town, and National Astrophysics and Space Science Programme, University of Cape Town, Rondebosch, 7700, Cape Town. Tue . "Bayesian estimation applied to multiple species". United States. doi:10.1103/PHYSREVD.75.103508.
@article{osti_20935250,
title = {Bayesian estimation applied to multiple species},
author = {Kunz, Martin and Bassett, Bruce A. and Hlozek, Renee A. and South African Astronomical Observatory, Observatory, Cape Town, South Africa and Department of Mathematics and Applied Mathematics, University of Cape Town, Rondebosch, 7700, Cape Town and National Astrophysics and Space Science Programme, University of Cape Town, Rondebosch, 7700, Cape Town},
abstractNote = {Observed data are often contaminated by undiscovered interlopers, leading to biased parameter estimation. Here we present BEAMS (Bayesian estimation applied to multiple species) which significantly improves on the standard maximum likelihood approach in the case where the probability for each data point being ''pure'' is known. We discuss the application of BEAMS to future type-Ia supernovae (SNIa) surveys, such as LSST, which are projected to deliver over a million supernovae light curves without spectra. The multiband light curves for each candidate will provide a probability of being Ia (pure) but the full sample will be significantly contaminated with other types of supernovae and transients. Given a sample of N supernovae with mean probability, <P>, of being Ia, BEAMS delivers parameter constraints equal to N<P> spectroscopically confirmed SNIa. In addition BEAMS can be simultaneously used to tease apart different families of data and to recover properties of the underlying distributions of those families (e.g. the type-Ibc and II distributions). Hence BEAMS provides a unified classification and parameter estimation methodology which may be useful in a diverse range of problems such as photometric redshift estimation or, indeed, any parameter estimation problem where contamination is an issue.},
doi = {10.1103/PHYSREVD.75.103508},
journal = {Physical Review. D, Particles Fields},
number = 10,
volume = 75,
place = {United States},
year = {Tue May 15 00:00:00 EDT 2007},
month = {Tue May 15 00:00:00 EDT 2007}
}
  • The mixing of groundwaters with different ages in aquifers, groundwater age is more appropriately represented by a distribution rather than a scalar number. To infer a groundwater age distribution from environmental tracers, a mathematical form is often assumed for the shape of the distribution and the parameters of the mathematical distribution are estimated using deterministic or stochastic inverse methods. We found that the prescription of the mathematical form limits the exploration of the age distribution to the shapes that can be described by the selected distribution. In this paper, the use of freeform histograms as groundwater age distributions is evaluated.more » A Bayesian Markov Chain Monte Carlo approach is used to estimate the fraction of groundwater in each histogram bin. This method was able to capture the shape of a hypothetical gamma distribution from the concentrations of four age tracers. The number of bins that can be considered in this approach is limited based on the number of tracers available. The histogram method was also tested on tracer data sets from Holten (The Netherlands; 3H, 3He, 85Kr, 39Ar) and the La Selva Biological Station (Costa-Rica; SF 6, CFCs, 3H, 4He and 14C), and compared to a number of mathematical forms. According to standard Bayesian measures of model goodness, the best mathematical distribution performs better than the histogram distributions in terms of the ability to capture the observed tracer data relative to their complexity. Among the histogram distributions, the four bin histogram performs better in most of the cases. The Monte Carlo simulations showed strong correlations in the posterior estimates of bin contributions, indicating that these bins cannot be well constrained using the available age tracers. The fact that mathematical forms overall perform better than the freeform histogram does not undermine the benefit of the freeform approach, especially for the cases where a larger amount of observed data is available and when the real groundwater distribution is more complex than can be represented by simple mathematical forms.« less
  • We have developed a device which employs parallel dc electric and magnetic fields in combination with an array of collectors and accompanying electronic circuits to measure simultaneously the momentum distributions of different ion species from a laser produced plasma. This diagnostic system automatically separates the ion species according to mass to charge ratio and utilizes the cyclotron motion to provide the momentum separation. The principle of operation, design configuration, and circuitry for the system are presented along with representative data obtained from a series of laser-induced plasma experiments.
  • Cited by 9
  • High temperature gas sensors have been highly demanded for combustion process optimization and toxic emissions control, which usually suffer from poor selectivity. In order to solve this selectivity issue and identify unknown reducing gas species (CO, CH 4 , and CH 8 ) and concentrations, a high temperature resistive sensor array data set was built in this study based on 5 reported sensors. As each sensor showed specific responses towards different types of reducing gas with certain concentrations, based on which calibration curves were fitted, providing benchmark sensor array response database, then Bayesian inference framework was utilized to process themore » sensor array data and build a sample selection program to simultaneously identify gas species and concentration, by formulating proper likelihood between input measured sensor array response pattern of an unknown gas and each sampled sensor array response pattern in benchmark database. This algorithm shows good robustness which can accurately identify gas species and predict gas concentration with a small error of less than 10% based on limited amount of experiment data. These features indicate that Bayesian probabilistic approach is a simple and efficient way to process sensor array data, which can significantly reduce the required computational overhead and training data.« less