Rucio: Scientific Data Management
Abstract
Rucio is an open-source software framework that provides scientific collaborations with the functionality to organize, manage, and access their data at scale. The data can be distributed across heterogeneous data centers at widely distributed locations. Rucio was originally developed to meet the requirements of the high-energy physics experiment ATLAS, and now is continuously extended to support the LHC experiments and other diverse scientific communities. In this article, we detail the fundamental concepts of Rucio, describe the architecture along with implementation details, and report operational experience from production usage.
- Authors:
- more »
- Publication Date:
- Research Org.:
- Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), High Energy Physics (HEP)
- Contributing Org.:
- CERN Collaboration
- OSTI Identifier:
- 1542972
- Report Number(s):
- arXiv:1902.09857; ATL-COM-SOFT-2018-100; FERMILAB-PUB-19-095-CD; 2510-2044
Journal ID: ISSN 2510-2036; oai:inspirehep.net:1722117
- Grant/Contract Number:
- AC02-07CH11359
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Computing and Software for Big Science
- Additional Journal Information:
- Journal Volume: 3; Journal Issue: 1; Journal ID: ISSN 2510-2036
- Publisher:
- Springer
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING; 96 KNOWLEDGE MANAGEMENT AND PRESERVATION
Citation Formats
Barisits, Martin, Beermann, Thomas, Berghaus, Frank, Bockelman, Brian, Bogado, Joaquin, Cameron, David, Christidis, Dimitrios, Ciangottini, Diego, Dimitrov, Gancho, Elsing, Markus, Garonne, Vincent, di Girolamo, Alessandro, Goossens, Luc, Guan, Wen, Guenther, Jaroslav, Javurek, Tomas, Kuhn, Dietmar, Lassnig, Mario, Lopez, Fernando, Magini, Nicolo, Molfetas, Angelos, Nairz, Armin, Ould-Saada, Farid, Prenner, Stefan, Serfon, Cedric, Stewart, Graeme, Vaandering, Eric, Vasileva, Petya, Vigne, Ralph, and Wegner, Tobias. Rucio: Scientific Data Management. United States: N. p., 2019.
Web. doi:10.1007/s41781-019-0026-3.
Barisits, Martin, Beermann, Thomas, Berghaus, Frank, Bockelman, Brian, Bogado, Joaquin, Cameron, David, Christidis, Dimitrios, Ciangottini, Diego, Dimitrov, Gancho, Elsing, Markus, Garonne, Vincent, di Girolamo, Alessandro, Goossens, Luc, Guan, Wen, Guenther, Jaroslav, Javurek, Tomas, Kuhn, Dietmar, Lassnig, Mario, Lopez, Fernando, Magini, Nicolo, Molfetas, Angelos, Nairz, Armin, Ould-Saada, Farid, Prenner, Stefan, Serfon, Cedric, Stewart, Graeme, Vaandering, Eric, Vasileva, Petya, Vigne, Ralph, & Wegner, Tobias. Rucio: Scientific Data Management. United States. https://doi.org/10.1007/s41781-019-0026-3
Barisits, Martin, Beermann, Thomas, Berghaus, Frank, Bockelman, Brian, Bogado, Joaquin, Cameron, David, Christidis, Dimitrios, Ciangottini, Diego, Dimitrov, Gancho, Elsing, Markus, Garonne, Vincent, di Girolamo, Alessandro, Goossens, Luc, Guan, Wen, Guenther, Jaroslav, Javurek, Tomas, Kuhn, Dietmar, Lassnig, Mario, Lopez, Fernando, Magini, Nicolo, Molfetas, Angelos, Nairz, Armin, Ould-Saada, Farid, Prenner, Stefan, Serfon, Cedric, Stewart, Graeme, Vaandering, Eric, Vasileva, Petya, Vigne, Ralph, and Wegner, Tobias. Fri .
"Rucio: Scientific Data Management". United States. https://doi.org/10.1007/s41781-019-0026-3. https://www.osti.gov/servlets/purl/1542972.
@article{osti_1542972,
title = {Rucio: Scientific Data Management},
author = {Barisits, Martin and Beermann, Thomas and Berghaus, Frank and Bockelman, Brian and Bogado, Joaquin and Cameron, David and Christidis, Dimitrios and Ciangottini, Diego and Dimitrov, Gancho and Elsing, Markus and Garonne, Vincent and di Girolamo, Alessandro and Goossens, Luc and Guan, Wen and Guenther, Jaroslav and Javurek, Tomas and Kuhn, Dietmar and Lassnig, Mario and Lopez, Fernando and Magini, Nicolo and Molfetas, Angelos and Nairz, Armin and Ould-Saada, Farid and Prenner, Stefan and Serfon, Cedric and Stewart, Graeme and Vaandering, Eric and Vasileva, Petya and Vigne, Ralph and Wegner, Tobias},
abstractNote = {Rucio is an open-source software framework that provides scientific collaborations with the functionality to organize, manage, and access their data at scale. The data can be distributed across heterogeneous data centers at widely distributed locations. Rucio was originally developed to meet the requirements of the high-energy physics experiment ATLAS, and now is continuously extended to support the LHC experiments and other diverse scientific communities. In this article, we detail the fundamental concepts of Rucio, describe the architecture along with implementation details, and report operational experience from production usage.},
doi = {10.1007/s41781-019-0026-3},
journal = {Computing and Software for Big Science},
number = 1,
volume = 3,
place = {United States},
year = {Fri Aug 09 00:00:00 EDT 2019},
month = {Fri Aug 09 00:00:00 EDT 2019}
}
Free Publicly Available Full Text
Publisher's Version of Record
Other availability
Figures / Tables:
Fig. 1: The namespace is organized with collections and files. Collections can either be containers or datasets. Containers consist of containers or datasets. Datasets consist of files only. Files can be in multiple datasets
All figures and tables
(11 total)
Save to My Library
You must Sign In or Create an Account in order to save documents to your library.
Works referenced in this record:
Machine learning of network metrics in ATLAS Distributed Data Management
journal, October 2017
- Lassnig, Mario; Toler, Wesley; Vamosi, Ralf
- Journal of Physics: Conference Series, Vol. 898
Testing as a Service with HammerCloud
journal, June 2014
- Llamas, Ramón Medrano; Barrand, Quentin; Elmsheuser, Johannes
- Journal of Physics: Conference Series, Vol. 513, Issue 6
ATLAS Replica Management in Rucio: Replication Rules and Subscriptions
journal, June 2014
- Barisits, M.; Serfon, C.; Garonne, V.
- Journal of Physics: Conference Series, Vol. 513, Issue 4
The GridSite Web/Grid security system
journal, April 2010
- McNab, Andrew; Li, Yibiao
- Journal of Physics: Conference Series, Vol. 219, Issue 6
FTS3: New Data Movement Service For WLCG
journal, June 2014
- Ayllon, A. A.; Salichos, M.; Simon, M. K.
- Journal of Physics: Conference Series, Vol. 513, Issue 3
LHCOPN and LHCONE: Status and Future Evolution
journal, December 2015
- Martelli, E.; Stancu, S.
- Journal of Physics: Conference Series, Vol. 664, Issue 5
Overview of ATLAS PanDA Workload Management
journal, December 2011
- Maeno, T.; De, K.; Wenaus, T.
- Journal of Physics: Conference Series, Vol. 331, Issue 7
Multilevel Workflow System in the ATLAS Experiment
journal, May 2015
- Borodin, M.; De, K.; Navarro, J. Garcia
- Journal of Physics: Conference Series, Vol. 608
EOS as the present and future solution for data storage at CERN
journal, December 2015
- Peters, Aj; Sindrilaru, Ea; Adde, G.
- Journal of Physics: Conference Series, Vol. 664, Issue 4
Managing ATLAS data on a petabyte-scale with DQ2
journal, July 2008
- Branco, M.; Cameron, D.; Gaidioz, B.
- Journal of Physics: Conference Series, Vol. 119, Issue 6
The ATLAS Data Acquisition System in LHC Run 2
journal, October 2017
- Panduro Vazquez, William
- Journal of Physics: Conference Series, Vol. 898
The new CERN tape software - getting ready for total performance
journal, December 2015
- Cano, E.; Murray, S.; Kruse, D. F.
- Journal of Physics: Conference Series, Vol. 664, Issue 4
DIRAC in Large Particle Physics Experiments
journal, October 2017
- Stagni, F.; Tsaregorodtsev, A.; Arrabito, L.
- Journal of Physics: Conference Series, Vol. 898
ROOT — An object oriented data analysis framework
journal, April 1997
- Brun, Rene; Rademakers, Fons
- Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 389, Issue 1-2
AthenaMT: upgrading the ATLAS software framework for the many-core world with multi-threading
journal, October 2017
- Leggett, Charles; Baines, John; Bold, Tomasz
- Journal of Physics: Conference Series, Vol. 898
AGIS: The ATLAS Grid Information System
journal, December 2012
- Anisenkov, Alexey; Belov, Sergey; Di Girolamo, Alessandro
- Journal of Physics: Conference Series, Vol. 396, Issue 3
Caching Servers for ATLAS
journal, October 2017
- Gardner, R. W.; Hanushevsky, A.; Vukotic, I.
- Journal of Physics: Conference Series, Vol. 898
Dynamic federation of grid and cloud storage
journal, September 2016
- Furano, Fabrizio; Keeble, Oliver; Field, Laurence
- Physics of Particles and Nuclei Letters, Vol. 13, Issue 5
A Survey of Software-Defined Networking: Past, Present, and Future of Programmable Networks
journal, October 2014
- Nunes, Bruno Astuto A.; Mendonca, Marc; Nguyen, Xuan-Nam
- IEEE Communications Surveys & Tutorials, Vol. 16, Issue 3
Search for TeV-scale gravity signatures in high-mass final states with leptons and jets with the ATLAS detector at
journal, September 2016
- Aaboud, M.; Aad, G.; Abbott, B.
- Physics Letters B, Vol. 760
DPM evolution: a disk operations management engine for DPM
journal, October 2017
- Manzi, A.; Furano, F.; Keeble, O.
- Journal of Physics: Conference Series, Vol. 898
Storage resource manager version 2.2: design, implementation, and testing experience
journal, July 2008
- Donno, F.; Abadie, L.; Badino, P.
- Journal of Physics: Conference Series, Vol. 119, Issue 6
LHC Machine
journal, August 2008
- Evans, Lyndon; Bryant, Philip
- Journal of Instrumentation, Vol. 3, Issue 08
The ATLAS Tier-0: Overview and operational experience
journal, April 2010
- Elsing, Markus; Goossens, Luc; Nairz, Armin
- Journal of Physics: Conference Series, Vol. 219, Issue 7
The ATLAS Experiment at the CERN Large Hadron Collider
journal, August 2008
- Collaboration, The ATLAS; Aad, G.; Abat, E.
- Journal of Instrumentation, Vol. 3, Issue 08
The CMS experiment at the CERN LHC
journal, August 2008
- Collaboration, The CMS; Chatrchyan, S.; Hmayakyan, G.
- Journal of Instrumentation, Vol. 3, Issue 08
Unified Monitoring Architecture for IT and Grid Services
journal, October 2017
- Aimar, A.; Corman, A. Aguado; Andrade, P.
- Journal of Physics: Conference Series, Vol. 898
ATLAS Distributed Computing Experience and Performance During the LHC Run-2
journal, October 2017
- Filipčič, A.
- Journal of Physics: Conference Series, Vol. 898
Distributed computing in practice: the Condor experience
journal, January 2005
- Thain, Douglas; Tannenbaum, Todd; Livny, Miron
- Concurrency and Computation: Practice and Experience, Vol. 17, Issue 2-4, p. 323-356
Globus Toolkit Version 4: Software for Service-Oriented Systems
journal, July 2006
- Foster, Ian
- Journal of Computer Science and Technology, Vol. 21, Issue 4
AGIS: The ATLAS Grid Information System
journal, June 2014
- Anisenkov, A.; Girolamo, A. Di; Klimentov, A.
- Journal of Physics: Conference Series, Vol. 513, Issue 3
Distributed computing in practice: the Condor experience
journal, January 2005
- Thain, Douglas; Tannenbaum, Todd; Livny, Miron
- Concurrency and Computation: Practice and Experience, Vol. 17, Issue 2-4, p. 323-356
AGIS: The ATLAS Grid Information System
journal, December 2012
- Anisenkov, Alexey; Belov, Sergey; Di Girolamo, Alessandro
- Journal of Physics: Conference Series, Vol. 396, Issue 3
ATLAS Replica Management in Rucio: Replication Rules and Subscriptions
journal, June 2014
- Barisits, M.; Serfon, C.; Garonne, V.
- Journal of Physics: Conference Series, Vol. 513, Issue 4
Machine learning of network metrics in ATLAS Distributed Data Management
journal, October 2017
- Lassnig, Mario; Toler, Wesley; Vamosi, Ralf
- Journal of Physics: Conference Series, Vol. 898
DPM evolution: a disk operations management engine for DPM
journal, October 2017
- Manzi, A.; Furano, F.; Keeble, O.
- Journal of Physics: Conference Series, Vol. 898
DIRAC in Large Particle Physics Experiments
journal, October 2017
- Stagni, F.; Tsaregorodtsev, A.; Arrabito, L.
- Journal of Physics: Conference Series, Vol. 898
Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.