Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Converting sWeights to probabilities with density ratios

Journal Article · · Computer Physics Communications

The use of machine learning approaches continues to have many benefits in experimental nuclear and particle physics. One common issue is generating training data which is sufficiently realistic to give reliable results. Here we advocate using real experimental data as the source of training data and demonstrate how one might subtract background contributions through the use of probabilistic weights which can be readily applied to training data. The sPlot formalism is a common tool used to isolate distributions from different sources. However, the negative sWeights produced by the sPlot technique can cause training problems and poor predictive power. This article demonstrates how density ratio estimation can be applied to convert sWeights to event probabilities, which we call drWeights. The drWeights can then be applied to produce the distributions of interest and are consistent with direct use of the sWeights. This article will also show how decision trees are particularly well suited to convert sWeights, with the benefit of fast prediction rates and adaptability to aspects of experimental data such as the data sample size and proportions of different event sources. We also show that a density ratio product approach in which the initial drWeights are reweighted by an additional converter gives substantially better results.

Research Organization:
Thomas Jefferson National Accelerator Facility (TJNAF)
Sponsoring Organization:
US Department of Energy Office of Nuclear Energy; Science and Technology Facilities Council; U.S. Department of Energy; Office of Science; Nuclear Physics
Grant/Contract Number:
AC05-06OR23177
OSTI ID:
2999561
Report Number(s):
JLAB-PHY-24-4183; arXiv:2409.08183; DOE/OR/23177-7669
Journal Information:
Computer Physics Communications, Journal Name: Computer Physics Communications Vol. 318; ISSN 0010-4655
Publisher:
Elsevier BVCopyright Statement
Country of Publication:
United States
Language:
English

References (14)

Energy flow networks: deep sets for particle jets journal January 2019
: A statistical tool to unfold data distributions
  • Pivk, M.; Le Diberder, F. R.
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 555, Issue 1-2 https://doi.org/10.1016/j.nima.2005.08.106
journal December 2005
The CLAS12 Spectrometer at Jefferson Laboratory
  • Burkert, V. D.; Elouadrhiri, L.; Adhikari, K. P.
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 959 https://doi.org/10.1016/j.nima.2020.163419
journal April 2020
The CLAS12 Geant4 simulation
  • Ungaro, M.; Angelini, G.; Battaglieri, M.
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 959 https://doi.org/10.1016/j.nima.2020.163422
journal April 2020
The CLAS12 forward electromagnetic calorimeter
  • Asryan, G.; Chandavar, Sh.; Chetry, T.
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 959 https://doi.org/10.1016/j.nima.2020.163425
journal April 2020
Custom Orthogonal Weight functions (COWs) for event classification
  • Dembinski, Hans; Kenzie, Matthew; Langenbruch, Christoph
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 1040 https://doi.org/10.1016/j.nima.2022.167270
journal October 2022
Advanced event reweighting using multivariate analysis journal June 2012
Neural networks for full phase-space reweighting and parameter tuning journal May 2020
Neural resampler for Monte Carlo reweighting with preserved uncertainties journal October 2020
OmniFold: A Method to Simultaneously Unfold All Observables journal May 2020
Physics opportunities with the 12 GeV upgrade at Jefferson Lab journal December 2012
Parameter uncertainties in weighted unbinned maximum likelihood fits journal May 2022
Unbiased elimination of negative weights in Monte Carlo samples journal May 2022
T HE C ONTINUOUS E LECTRON B EAM A CCELERATOR F ACILITY : CEBAF at the Jefferson Laboratory journal December 2001

Similar Records

Extracting and Converting Quantitative Data into Human Error Probabilities
Conference · Wed Aug 01 00:00:00 EDT 2007 · OSTI ID:919554

Pairwise Association of Seismic Arrivals with Convolutional Neural Networks
Journal Article · Tue Jan 08 23:00:00 EST 2019 · Seismological Research Letters · OSTI ID:1492542

Semisupervised Learning for Seismic Monitoring Applications
Journal Article · Wed Oct 21 00:00:00 EDT 2020 · Seismological Research Letters · OSTI ID:1830513

Related Subjects