Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

Perdue, G. N.; Ghosh, A.; Wospakrik, M.; Akbar, F.; Andrade, D. A.; Ascencio, M.; Bellantoni, L.; Bercellie, A.; Betancourt, M.; Vera, G. F.  R.  Caceres; Cai, T.; Carneiro, M. F.; Chaves, J.; Coplowe, D.; Motta, H. da; Díaz, G. A.; Felix, J.; Fields, L.; Fine, R.; Gago, A. M.; Galindo, R.; Golan, T.; Gran, R.; Han, J. Y.; Harris, D. A.; Jena, D.; Kleykamp, J.; Kordosky, M.; Lu, X. -G.; Maher, E.; Mann, W. A.; Marshall, C. M.; McFarland, K. S.; McGowan, A. M.; Messerly, B.; Miller, J.; Nelson, J. K.; Nguyen, C.; Norrick, A.; Nuruzzaman, Nuruzzaman; Olivier, A.; Patton, R.; Ramírez, M. A.; Ransome, R. D.; Ray, H.; Ren, L.; Rimal, D.; Ruterbories, D.; Schellman, H.; Salinas, C. J.  Solano; Su, H.; Upadhyay, S.; Valencia, E.; Wolcott, J.; Yaeggy, B.; Young, S.

doi:10.1088/1748-0221/13/11/P11020

Title: Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

Abstract

We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks (DANNs) in this context. DANNs are designed to be trained in one domain (simulated data) but tested in a second domain (physics data) and utilize unlabeled data from the second domain so that during training only features which are unable to discriminate between the domains are promoted. MINERvA is a neutrino-nucleus scattering experiment using the NuMI beamline at Fermilab. A-dependent cross sections are an important part of the physics program, and these measurements require vertex finding in complicated events. To illustrate the impact of the DANN we used a modified set of simulation in place of physics data during the training of the DANN and then used the label of the modified simulation during the evaluation of the DANN. We find that deep learning based methods offer significant advantages over our prior track-based reconstruction for the task of vertex finding, and that DANNs are able to improve the performance of deep networks by leveraging available unlabeled data and by mitigating network performance degradation rooted in biases in the physicsmore » models used for training.« less

Authors:: Perdue, G. N.; Ghosh, A.; Wospakrik, M.; Akbar, F.; Andrade, D. A.; Ascencio, M.; Bellantoni, L.; Bercellie, A.; Betancourt, M.; Vera, G. F. R. Caceres; Cai, T.; Carneiro, M. F.; Chaves, J.; Coplowe, D.; Motta, H. da; Díaz, G. A.; Felix, J.; Fields, L.; Fine, R.; Gago, A. M. more »« less

Publication Date:: Mon Nov 26 00:00:00 EST 2018

Research Org.:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)

Sponsoring Org.:: National Science Foundation (NSF); CNPq (Brazil); CoNaCyT (Mexico); IDI/IGI-UNI (Peru); Latin American Center for Physics (CLAF); Russian Ministry of Education and Science (Russia); National Science Centre of Poland; USDOE Office of Science (SC), High Energy Physics (HEP)

Contributing Org.:: MINERvA collaboration; MINERvA Collaboration

OSTI Identifier:: 1484978

Alternate Identifier(s):: OSTI ID: 1487058

Report Number(s):: arXiv:1808.08332; FERMILAB-PUB-18-432-CD-ND
Journal ID: ISSN 1748-0221

Grant/Contract Number:: AC05-00OR22725; AC02-07CH11359

Resource Type:: Accepted Manuscript

Journal Name:: Journal of Instrumentation

Additional Journal Information:: Journal Volume: 13; Journal Issue: 11; Journal ID: ISSN 1748-0221

Publisher:: Institute of Physics (IOP)

Country of Publication:: United States

Language:: English

Subject:: 72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS; 97 MATHEMATICS AND COMPUTING; Analysis and statistical methods; Pattern recognition, cluster finding, calibration and fitting methods; Neutrino detectors

Citation Formats


                    Perdue, G. N., Ghosh, A., Wospakrik, M., Akbar, F., Andrade, D. A., Ascencio, M., Bellantoni, L., Bercellie, A., Betancourt, M., Vera, G. F.  R.  Caceres, Cai, T., Carneiro, M. F., Chaves, J., Coplowe, D., Motta, H. da, Díaz, G. A., Felix, J., Fields, L., Fine, R., Gago, A. M., Galindo, R., Golan, T., Gran, R., Han, J. Y., Harris, D. A., Jena, D., Kleykamp, J., Kordosky, M., Lu, X. -G., Maher, E., Mann, W. A., Marshall, C. M., McFarland, K. S., McGowan, A. M., Messerly, B., Miller, J., Nelson, J. K., Nguyen, C., Norrick, A., Nuruzzaman, Nuruzzaman, Olivier, A., Patton, R., Ramírez, M. A., Ransome, R. D., Ray, H., Ren, L., Rimal, D., Ruterbories, D., Schellman, H., Salinas, C. J.  Solano, Su, H., Upadhyay, S., Valencia, E., Wolcott, J., Yaeggy, B., and Young, S. Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment.  United States: N. p., 2018. 
Web.  doi:10.1088/1748-0221/13/11/P11020.

Copy to clipboard


                    Perdue, G. N., Ghosh, A., Wospakrik, M., Akbar, F., Andrade, D. A., Ascencio, M., Bellantoni, L., Bercellie, A., Betancourt, M., Vera, G. F.  R.  Caceres, Cai, T., Carneiro, M. F., Chaves, J., Coplowe, D., Motta, H. da, Díaz, G. A., Felix, J., Fields, L., Fine, R., Gago, A. M., Galindo, R., Golan, T., Gran, R., Han, J. Y., Harris, D. A., Jena, D., Kleykamp, J., Kordosky, M., Lu, X. -G., Maher, E., Mann, W. A., Marshall, C. M., McFarland, K. S., McGowan, A. M., Messerly, B., Miller, J., Nelson, J. K., Nguyen, C., Norrick, A., Nuruzzaman, Nuruzzaman, Olivier, A., Patton, R., Ramírez, M. A., Ransome, R. D., Ray, H., Ren, L., Rimal, D., Ruterbories, D., Schellman, H., Salinas, C. J.  Solano, Su, H., Upadhyay, S., Valencia, E., Wolcott, J., Yaeggy, B., & Young, S. Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment.  United States.  https://doi.org/10.1088/1748-0221/13/11/P11020

Copy to clipboard


                    Perdue, G. N., Ghosh, A., Wospakrik, M., Akbar, F., Andrade, D. A., Ascencio, M., Bellantoni, L., Bercellie, A., Betancourt, M., Vera, G. F.  R.  Caceres, Cai, T., Carneiro, M. F., Chaves, J., Coplowe, D., Motta, H. da, Díaz, G. A., Felix, J., Fields, L., Fine, R., Gago, A. M., Galindo, R., Golan, T., Gran, R., Han, J. Y., Harris, D. A., Jena, D., Kleykamp, J., Kordosky, M., Lu, X. -G., Maher, E., Mann, W. A., Marshall, C. M., McFarland, K. S., McGowan, A. M., Messerly, B., Miller, J., Nelson, J. K., Nguyen, C., Norrick, A., Nuruzzaman, Nuruzzaman, Olivier, A., Patton, R., Ramírez, M. A., Ransome, R. D., Ray, H., Ren, L., Rimal, D., Ruterbories, D., Schellman, H., Salinas, C. J.  Solano, Su, H., Upadhyay, S., Valencia, E., Wolcott, J., Yaeggy, B., and Young, S. Mon .  
"Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment".  United States.  https://doi.org/10.1088/1748-0221/13/11/P11020.  https://www.osti.gov/servlets/purl/1484978.

Copy to clipboard


                    
@article{osti_1484978,

  title        = {Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment},

  author       = {Perdue, G. N. and Ghosh, A. and Wospakrik, M. and Akbar, F. and Andrade, D. A. and Ascencio, M. and Bellantoni, L. and Bercellie, A. and Betancourt, M. and Vera, G. F.  R.  Caceres and Cai, T. and Carneiro, M. F. and Chaves, J. and Coplowe, D. and Motta, H. da and Díaz, G. A. and Felix, J. and Fields, L. and Fine, R. and Gago, A. M. and Galindo, R. and Golan, T. and Gran, R. and Han, J. Y. and Harris, D. A. and Jena, D. and Kleykamp, J. and Kordosky, M. and Lu, X. -G. and Maher, E. and Mann, W. A. and Marshall, C. M. and McFarland, K. S. and McGowan, A. M. and Messerly, B. and Miller, J. and Nelson, J. K. and Nguyen, C. and Norrick, A. and Nuruzzaman, Nuruzzaman and Olivier, A. and Patton, R. and Ramírez, M. A. and Ransome, R. D. and Ray, H. and Ren, L. and Rimal, D. and Ruterbories, D. and Schellman, H. and Salinas, C. J.  Solano and Su, H. and Upadhyay, S. and Valencia, E. and Wolcott, J. and Yaeggy, B. and Young, S.},

  abstractNote = {We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks (DANNs) in this context. DANNs are designed to be trained in one domain (simulated data) but tested in a second domain (physics data) and utilize unlabeled data from the second domain so that during training only features which are unable to discriminate between the domains are promoted. MINERvA is a neutrino-nucleus scattering experiment using the NuMI beamline at Fermilab. A-dependent cross sections are an important part of the physics program, and these measurements require vertex finding in complicated events. To illustrate the impact of the DANN we used a modified set of simulation in place of physics data during the training of the DANN and then used the label of the modified simulation during the evaluation of the DANN. We find that deep learning based methods offer significant advantages over our prior track-based reconstruction for the task of vertex finding, and that DANNs are able to improve the performance of deep networks by leveraging available unlabeled data and by mitigating network performance degradation rooted in biases in the physics models used for training.},

  doi          = {10.1088/1748-0221/13/11/P11020},

  journal      = {Journal of Instrumentation},

  number       = 11,

  volume       = 13,

  place        = {United States},

  year         = {Mon Nov 26 00:00:00 EST 2018},

  month        = {Mon Nov 26 00:00:00 EST 2018}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Accepted Manuscript (DOE)

Publisher's Version of Record

https://doi.org/10.1088/1748-0221/13/11/P11020

Other availability

Search WorldCat to find libraries that may hold this journal

Citation Metrics:

Cited by: 8 works

Citation information provided by
Web of Science

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

The NuMI neutrino beam
journal, January 2016

Adamson, P.; Anderson, K.; Andrews, M.
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 806
DOI: 10.1016/j.nima.2015.08.063

Gradient-based learning applied to document recognition
journal, January 1998

Lecun, Y.; Bottou, L.; Bengio, Y.
Proceedings of the IEEE, Vol. 86, Issue 11
DOI: 10.1109/5.726791

ROOT — An object oriented data analysis framework
journal, April 1997

Brun, Rene; Rademakers, Fons
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 389, Issue 1-2
DOI: 10.1016/S0168-9002(97)00048-X

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
conference, June 2014

Razavian, Ali Sharif; Azizpour, Hossein; Sullivan, Josephine
2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
DOI: 10.1109/CVPRW.2014.131

Deep Learning and Its Application to LHC Physics
journal, October 2018

Guest, Dan; Cranmer, Kyle; Whiteson, Daniel
Annual Review of Nuclear and Particle Science, Vol. 68, Issue 1
DOI: 10.1146/annurev-nucl-101917-021019

A convolutional neural network neutrino event classifier
journal, September 2016

Aurisano, A.; Radovic, A.; Rocco, D.
Journal of Instrumentation, Vol. 11, Issue 09
DOI: 10.1088/1748-0221/11/09/P09001

Learning representations by back-propagating errors
journal, October 1986

Rumelhart, David E.; Hinton, Geoffrey E.; Williams, Ronald J.
Nature, Vol. 323, Issue 6088
DOI: 10.1038/323533a0

Vertex reconstruction of neutrino interactions using deep learning
conference, May 2017

Terwilliger, Adam M.; Perdue, Gabriel N.; Isele, David
2017 International Joint Conference on Neural Networks (IJCNN)
DOI: 10.1109/IJCNN.2017.7966131

Caffe: Convolutional Architecture for Fast Feature Embedding
conference, January 2014

Jia, Yangqing; Shelhamer, Evan; Donahue, Jeff
Proceedings of the ACM International Conference on Multimedia - MM '14
DOI: 10.1145/2647868.2654889

Neutrino-Nucleus Interactions
journal, November 2011

Gallagher, H.; Garvey, G.; Zeller, G. P.
Annual Review of Nuclear and Particle Science, Vol. 61, Issue 1
DOI: 10.1146/annurev-nucl-102010-130255

Design, calibration, and performance of the MINERvA detector
journal, April 2014

Aliaga, L.; Bagby, L.; Baldin, B.
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 743
DOI: 10.1016/j.nima.2013.12.053

The GENIE neutrino Monte Carlo generator
journal, February 2010

Andreopoulos, C.; Bell, A.; Bhattacharya, D.
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 614, Issue 1
DOI: 10.1016/j.nima.2009.12.009

Measurement of Ratios of $ν_{μ}$ Charged-Current Cross Sections on C, Fe, and Pb to CH at Neutrino Energies 2–20 GeV
journal, June 2014

Tice, B. G.; Datta, M.; Mousseau, J.
Physical Review Letters, Vol. 112, Issue 23
DOI: 10.1103/PhysRevLett.112.231801

Measurement of partonic nuclear effects in deep-inelastic neutrino scattering using MINERvA
journal, April 2016

Mousseau, J.; Wospakrik, M.; Aliaga, L.
Physical Review D, Vol. 93, Issue 7
DOI: 10.1103/PhysRevD.93.071101

Geant4—a simulation toolkit
journal, July 2003

Agostinelli, S.; Allison, J.; Amako, K.
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 506, Issue 3
DOI: 10.1016/S0168-9002(03)01368-8

MINERvA neutrino detector response measured with test beam data
journal, July 2015

Aliaga, L.; Altinok, O.; Araujo Del Castillo, C.
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 789
DOI: 10.1016/j.nima.2015.04.003

A New Approach to Linear Filtering and Prediction Problems
journal, March 1960

Kalman, R. E.
Journal of Basic Engineering, Vol. 82, Issue 1
DOI: 10.1115/1.3662552

Neutrino flux predictions for the NuMI beam
journal, November 2016

Aliaga, L.; Kordosky, M.; Golan, T.
Physical Review D, Vol. 94, Issue 9
DOI: 10.1103/PhysRevD.94.092005

Evolving Deep Networks Using HPC
conference, January 2017

Young, Steven R.; Rose, Derek C.; Johnston, Travis
Proceedings of the Machine Learning on HPC Environments - MLHPC'17
DOI: 10.1145/3146347.3146355

Similar Records in DOE PAGES and OSTI.GOV collections:

Adversarial methods to reduce simulation bias in neutrino interaction event filtering at liquid argon time projection chambers

Journal Article Babicz, Marta ; Alonso-Monsalve, Saúl ; Dolan, S. ; ... - Physical Review. D.

For current and future neutrino oscillation experiments using large liquid argon time projection chambers (LAr-TPCs), a key challenge is identifying neutrino interactions from the pervading cosmic-ray background. Rejection of such background is often possible using traditional cut-based selections, but this typically requires the prior use of computationally expensive reconstruction algorithms. This work demonstrates an alternative approach of using a 3D submanifold sparse convolutional network trained on low-level information from the scintillation light signal of interactions inside LAr-TPCs. This technique is applied to example simulations from ICARUS, the far detector of the short baseline neutrino program at Fermilab. The results ofmore »« less
https://doi.org/10.1103/physrevd.105.112009

Full Text Available
Domain adaptation techniques for improved cross-domain study of galaxy mergers

Conference Ćiprijanović, A. ; Kafkes, D. ; Jenkins, S. ; ...

In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here we demonstrate the use of two techniques - Maximum Mean Discrepancy (MMD) and adversarial training with Domain Adversarial Neural Networks (DANN) - for the classification of distant galaxy mergers from the Illustris-1 simulation, where the two domains presented differ onlymore »« less
Full Text Available
Domain Adaptation for Measurements of Strong Gravitational Lenses

Conference Swierc, Paxson ; Zhao, Zhao, Yifan ; Ciprijanovic, Aleksandra ; ...

Upcoming surveys are predicted to discover galaxy-scale strong lenses on the order of 10\textsuperscript{5}, making deep learning methods necessary in lensing data analysis. Currently, there is insufficient real lensing data to train deep learning algorithms, but the alternative of training only on simulated data results in poor performance on real data. Domain Adaptation may be able to bridge the gap between simulated and real datasets. We utilize domain adaptation for the estimation of Einstein radius (more »« less
https://doi.org/10.2172/2246772

Full Text Available
Domain Adaptation for Measurements of Strong Gravitational Lenses

Conference Swierc, Paxson ; Zhao, Yifan M. ; Ćiprijanović, Aleksandra ; ...

Upcoming surveys are predicted to discover galaxy-scale strong lenses on the order ofmore »« less
Full Text Available
Domain Adaptation for Measurements of Strong Gravitational Lenses

Conference Swierc, Paxson ; Zhao, Yifan ; Ciprijanovic, Aleksandra ; ...

Upcoming surveys are predicted to discover galaxy-scale strong lenses on the magnitude of 105, making deep learning methods necessary in lensing data analysis. Currently, there is insufficient real lensing data to train deep learning algorithms, but training only on simulated data results in poor performance on real data. Domain adaptation can bridge the gap between simulated and real datasets. We adopt domain adaptation on the estimation of Einstein radius in simulated galaxy-scale gravitational lensing images. We evaluate two domain adaptation techniques - domain adversarial neural networks (DANN) and maximum mean discrepancy (MMD). We train on a source domain of simulatedmore »« less
Full Text Available

Similar Records

Title: Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

Abstract

Citation Formats

The NuMI neutrino beam journal, January 2016

Gradient-based learning applied to document recognition journal, January 1998

ROOT — An object oriented data analysis framework journal, April 1997

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition conference, June 2014

Deep Learning and Its Application to LHC Physics journal, October 2018

A convolutional neural network neutrino event classifier journal, September 2016

Learning representations by back-propagating errors journal, October 1986

Vertex reconstruction of neutrino interactions using deep learning conference, May 2017

Caffe: Convolutional Architecture for Fast Feature Embedding conference, January 2014

Neutrino-Nucleus Interactions journal, November 2011

Design, calibration, and performance of the MINERvA detector journal, April 2014

The GENIE neutrino Monte Carlo generator journal, February 2010

Measurement of Ratios of ν μ Charged-Current Cross Sections on C, Fe, and Pb to CH at Neutrino Energies 2–20 GeV journal, June 2014

Measurement of partonic nuclear effects in deep-inelastic neutrino scattering using MINERvA journal, April 2016

Geant4—a simulation toolkit journal, July 2003

MINERvA neutrino detector response measured with test beam data journal, July 2015

A New Approach to Linear Filtering and Prediction Problems journal, March 1960

Neutrino flux predictions for the NuMI beam journal, November 2016

Evolving Deep Networks Using HPC conference, January 2017

The NuMI neutrino beam
journal, January 2016

Gradient-based learning applied to document recognition
journal, January 1998

ROOT — An object oriented data analysis framework
journal, April 1997

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
conference, June 2014

Deep Learning and Its Application to LHC Physics
journal, October 2018

A convolutional neural network neutrino event classifier
journal, September 2016

Learning representations by back-propagating errors
journal, October 1986

Vertex reconstruction of neutrino interactions using deep learning
conference, May 2017

Caffe: Convolutional Architecture for Fast Feature Embedding
conference, January 2014

Neutrino-Nucleus Interactions
journal, November 2011

Design, calibration, and performance of the MINERvA detector
journal, April 2014

The GENIE neutrino Monte Carlo generator
journal, February 2010

Measurement of Ratios of $ν_{μ}$ Charged-Current Cross Sections on C, Fe, and Pb to CH at Neutrino Energies 2–20 GeV
journal, June 2014

Measurement of partonic nuclear effects in deep-inelastic neutrino scattering using MINERvA
journal, April 2016

Geant4—a simulation toolkit
journal, July 2003

MINERvA neutrino detector response measured with test beam data
journal, July 2015

A New Approach to Linear Filtering and Prediction Problems
journal, March 1960

Neutrino flux predictions for the NuMI beam
journal, November 2016

Evolving Deep Networks Using HPC
conference, January 2017