skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Data handling with SAM and art at the NOvA experiment

Journal Article · · Journal of Physics. Conference Series
 [1];  [2];  [3];  [4];  [5];  [4];  [4];  [6];  [6]
  1. Univ. of Cincinnati, Cincinnati, OH (United States)
  2. California Institute of Technology, Pasadena, CA (United States)
  3. Indiana Univ., Bloomington, IN (United States)
  4. Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
  5. Tufts Univ., Medford, MA (United States)
  6. Univ. of Minnesota, Minneapolis, MN (United States)

During operations, NOvA produces between 5,000 and 7,000 raw files per day with peaks in excess of 12,000. These files must be processed in several stages to produce fully calibrated and reconstructed analysis files. In addition, many simulated neutrino interactions must be produced and processed through the same stages as data. To accommodate the large volume of data and Monte Carlo, production must be possible both on the Fermilab grid and on off-site farms, such as the ones accessible through the Open Science Grid. To handle the challenge of cataloging these files and to facilitate their off-line processing, we have adopted the SAM system developed at Fermilab. SAM indexes files according to metadata, keeps track of each file's physical locations, provides dataset management facilities, and facilitates data transfer to off-site grids. To integrate SAM with Fermilab's art software framework and the NOvA production workflow, we have developed methods to embed metadata into our configuration files, art files, and standalone ROOT files. A module in the art framework propagates the embedded information from configuration files into art files, and from input art files to output art files, allowing us to maintain a complete processing history within our files. Embedding metadata in configuration files also allows configuration files indexed in SAM to be used as inputs to Monte Carlo production jobs. Further, SAM keeps track of the input files used to create each output file. Parentage information enables the construction of self-draining datasets which have become the primary production paradigm used at NOvA. In this study we will present an overview of SAM at NOvA and how it has transformed the file production framework used by the experiment.

Research Organization:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
Grant/Contract Number:
AC02-07CH11359
OSTI ID:
1250776
Report Number(s):
FERMILAB-CONF-15-202-ND; 1413833
Journal Information:
Journal of Physics. Conference Series, Vol. 664, Issue 4; Conference: 21st International Conference on Computing in High Energy and Nuclear Physics, Okinawa (Japan), 13-17 Apr 2015; ISSN 1742-6588
Publisher:
IOP PublishingCopyright Statement
Country of Publication:
United States
Language:
English

Similar Records

Production Operations Management System
Technical Report · Sat Oct 19 00:00:00 EDT 2019 · OSTI ID:1250776

SAM Plug-in Development (Phase I Final Report)
Technical Report · Sun Sep 27 00:00:00 EDT 2020 · OSTI ID:1250776

Experience producing simulated events for the DZero experiment on the SAM-Grid
Conference · Wed Dec 01 00:00:00 EST 2004 · OSTI ID:1250776

Related Subjects