skip to main content

DOE PAGESDOE PAGES

Title: Data preservation at the Fermilab Tevatron

The Fermilab Tevatron collider's data-taking run ended in September 2011, yielding a dataset with rich scientific potential. The CDF and D0 experiments each have approximately 9 PB of collider and simulated data stored on tape. A large computing infrastructure consisting of tape storage, disk cache, and distributed grid computing for physics analysis with the Tevatron data is present at Fermilab. The Fermilab Run II data preservation project intends to keep this analysis capability sustained through the year 2020 and beyond. To achieve this goal, we have implemented a system that utilizes virtualization, automated validation, and migration to new standards in both software and data storage technology and leverages resources available from currently-running experiments at Fermilab. Lastly, these efforts have also provided useful lessons in ensuring long-term data access for numerous experiments, and enable high-quality scientific output for years to come.
Authors:
 [1] ;  [2] ;  [2] ;  [3] ;  [2] ;  [2] ;  [2] ;  [2] ;  [2] ;  [2] ;  [2] ; ORCiD logo [2] ;  [2] ;  [2] ;  [2] ;  [2] ;  [4] ;  [5] ;  [2] ;  [3] more »;  [2] « less
  1. Sezione di Padova-Trento and Univ. of Padova, Padova (Italy)
  2. Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
  3. Univ. of Washington, Seattle, WA (United States)
  4. Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States); Univ. of Rochester, Rochester, NY (United States)
  5. Univ. of Arizona, Tucson, AZ (United States)
Publication Date:
Report Number(s):
FERMILAB-PUB-16-552-CD; arXiv:1701.07773
Journal ID: ISSN 0168-9002; 1510877; TRN: US1700951
Grant/Contract Number:
AC02-07CH11359; NSF-PHY-1247316
Type:
Accepted Manuscript
Journal Name:
Nuclear Instruments and Methods in Physics Research. Section A, Accelerators, Spectrometers, Detectors and Associated Equipment
Additional Journal Information:
Journal Volume: 851; Journal Issue: C; Journal ID: ISSN 0168-9002
Publisher:
Elsevier
Research Org:
Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
Sponsoring Org:
USDOE Office of Science (SC), High Energy Physics (HEP) (SC-25)
Country of Publication:
United States
Language:
English
Subject:
46 INSTRUMENTATION RELATED TO NUCLEAR SCIENCE AND TECHNOLOGY; 72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS; collider physics; data analysis; computational frameworks; data storage; detector simulation; data preservation
OSTI Identifier:
1345607
Alternate Identifier(s):
OSTI ID: 1416601

Amerio, S., Behari, S., Boyd, J., Brochmann, M., Culbertson, R., Diesburg, M., Freeman, J., Garren, L., Greenlee, H., Herner, K., Illingworth, R., Jayatilaka, B., Jonckheere, A., Li, Q., Naymola, S., Oleynik, G., Sakumoto, W., Varnes, E., Vellidis, C., Watts, G., and White, S.. Data preservation at the Fermilab Tevatron. United States: N. p., Web. doi:10.1016/j.nima.2017.01.043.
Amerio, S., Behari, S., Boyd, J., Brochmann, M., Culbertson, R., Diesburg, M., Freeman, J., Garren, L., Greenlee, H., Herner, K., Illingworth, R., Jayatilaka, B., Jonckheere, A., Li, Q., Naymola, S., Oleynik, G., Sakumoto, W., Varnes, E., Vellidis, C., Watts, G., & White, S.. Data preservation at the Fermilab Tevatron. United States. doi:10.1016/j.nima.2017.01.043.
Amerio, S., Behari, S., Boyd, J., Brochmann, M., Culbertson, R., Diesburg, M., Freeman, J., Garren, L., Greenlee, H., Herner, K., Illingworth, R., Jayatilaka, B., Jonckheere, A., Li, Q., Naymola, S., Oleynik, G., Sakumoto, W., Varnes, E., Vellidis, C., Watts, G., and White, S.. 2017. "Data preservation at the Fermilab Tevatron". United States. doi:10.1016/j.nima.2017.01.043. https://www.osti.gov/servlets/purl/1345607.
@article{osti_1345607,
title = {Data preservation at the Fermilab Tevatron},
author = {Amerio, S. and Behari, S. and Boyd, J. and Brochmann, M. and Culbertson, R. and Diesburg, M. and Freeman, J. and Garren, L. and Greenlee, H. and Herner, K. and Illingworth, R. and Jayatilaka, B. and Jonckheere, A. and Li, Q. and Naymola, S. and Oleynik, G. and Sakumoto, W. and Varnes, E. and Vellidis, C. and Watts, G. and White, S.},
abstractNote = {The Fermilab Tevatron collider's data-taking run ended in September 2011, yielding a dataset with rich scientific potential. The CDF and D0 experiments each have approximately 9 PB of collider and simulated data stored on tape. A large computing infrastructure consisting of tape storage, disk cache, and distributed grid computing for physics analysis with the Tevatron data is present at Fermilab. The Fermilab Run II data preservation project intends to keep this analysis capability sustained through the year 2020 and beyond. To achieve this goal, we have implemented a system that utilizes virtualization, automated validation, and migration to new standards in both software and data storage technology and leverages resources available from currently-running experiments at Fermilab. Lastly, these efforts have also provided useful lessons in ensuring long-term data access for numerous experiments, and enable high-quality scientific output for years to come.},
doi = {10.1016/j.nima.2017.01.043},
journal = {Nuclear Instruments and Methods in Physics Research. Section A, Accelerators, Spectrometers, Detectors and Associated Equipment},
number = C,
volume = 851,
place = {United States},
year = {2017},
month = {1}
}