Data preservation at the Fermilab Tevatron
Abstract
The Fermilab Tevatron collider's data-taking run ended in September 2011, yielding a dataset with rich scientific potential. The CDF and DO experiments each have nearly 9 PB of collider and simulated data stored on tape. A large computing infrastructure consisting of tape storage, disk cache, and distributed grid computing for physics analysis with the Tevatron data is present at Fermilab. The Fermilab Run II data preservation project intends to keep this analysis capability sustained through the year 2020 or beyond. To achieve this, we are implementing a system that utilizes virtualization, automated validation, and migration to new standards in both software and data storage technology as well as leveraging resources available from currently-running experiments at Fermilab. Furthermore, these efforts will provide useful lessons in ensuring long-term data access for numerous experiments throughout high-energy physics, and provide a roadmap for high-quality scientific output for years to come.
- Authors:
-
- Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States)
- Publication Date:
- Research Org.:
- Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), High Energy Physics (HEP)
- OSTI Identifier:
- 1250774
- Report Number(s):
- FERMILAB-CONF-15-219-CD
Journal ID: ISSN 1742-6588; 1413816
- Grant/Contract Number:
- AC02-07CH11359
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Journal of Physics. Conference Series
- Additional Journal Information:
- Journal Volume: 664; Journal Issue: 3; Conference: 21st International Conference on Computing in High Energy and Nuclear Physics, Okinawa (Japan), 13-17 Apr 2015; Journal ID: ISSN 1742-6588
- Publisher:
- IOP Publishing
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 96 KNOWLEDGE MANAGEMENT AND PRESERVATION
Citation Formats
Boyd, J., Herner, K., Jayatilaka, B., Roser, R., and Sakumoto, W.. Data preservation at the Fermilab Tevatron. United States: N. p., 2015.
Web. doi:10.1088/1742-6596/664/3/032015.
Boyd, J., Herner, K., Jayatilaka, B., Roser, R., & Sakumoto, W.. Data preservation at the Fermilab Tevatron. United States. https://doi.org/10.1088/1742-6596/664/3/032015
Boyd, J., Herner, K., Jayatilaka, B., Roser, R., and Sakumoto, W.. Wed .
"Data preservation at the Fermilab Tevatron". United States. https://doi.org/10.1088/1742-6596/664/3/032015. https://www.osti.gov/servlets/purl/1250774.
@article{osti_1250774,
title = {Data preservation at the Fermilab Tevatron},
author = {Boyd, J. and Herner, K. and Jayatilaka, B. and Roser, R. and Sakumoto, W.},
abstractNote = {The Fermilab Tevatron collider's data-taking run ended in September 2011, yielding a dataset with rich scientific potential. The CDF and DO experiments each have nearly 9 PB of collider and simulated data stored on tape. A large computing infrastructure consisting of tape storage, disk cache, and distributed grid computing for physics analysis with the Tevatron data is present at Fermilab. The Fermilab Run II data preservation project intends to keep this analysis capability sustained through the year 2020 or beyond. To achieve this, we are implementing a system that utilizes virtualization, automated validation, and migration to new standards in both software and data storage technology as well as leveraging resources available from currently-running experiments at Fermilab. Furthermore, these efforts will provide useful lessons in ensuring long-term data access for numerous experiments throughout high-energy physics, and provide a roadmap for high-quality scientific output for years to come.},
doi = {10.1088/1742-6596/664/3/032015},
journal = {Journal of Physics. Conference Series},
number = 3,
volume = 664,
place = {United States},
year = {2015},
month = {12}
}