Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

dCache: The Storage System of Choice for Data-Intensive Applications

Journal Article · · Computing and Software for Big Science
The ever-increasing volumes of data produced by modern scientific facilities like EuXFEL and LHC put significant stress on data management infrastructure operated by laboratories and research centers. The challenges to be addressed span the entire data life cycle, from ingest and efficient data analysis to long-term preservation, typically involving large tape libraries. dCache, a storage system developed in collaboration between the Deutsches Elektronen-Synchrotron (DESY), Fermi National Accelerator Laboratory, and Nordic e-Infrastructure Collaboration (NeIC), is designed to manage a large number of disk servers and to facilitate transparent data migration to and from archival storage. Its multifaceted approach offers a unified method to support a variety of scientific use cases with the same storage infrastructure, including high-throughput data ingest, data sharing over wide area networks, efficient access from HPC clusters, and long-term data preservation on tertiary storage. Initially developed for high energy physics (HEP) experiments, dCache is now used by various scientific communities, including astrophysics, biomedical research, and life sciences, each having specific requirements. This paper presents architecture, deployment strategies, performance and scalability enhancements, and recent advancements in dCache addressing the needs of scientific communities. Finally, we touch on the development and release process, ensuring the software’s high quality.
Research Organization:
Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), High Energy Physics (HEP)
Grant/Contract Number:
89243024CSC000002
OSTI ID:
3006987
Report Number(s):
FERMILAB-PUB--25-0857-CSAID; oai:inspirehep.net:3089579
Journal Information:
Computing and Software for Big Science, Journal Name: Computing and Software for Big Science Journal Issue: 1 Vol. 9; ISSN 2510-2036; ISSN 2510-2044
Publisher:
SpringerCopyright Statement
Country of Publication:
United States
Language:
English

References (27)

CERN Tape Archive — from development to production deployment journal January 2019
Architecture and prototype of a WLCG data lake for HL-LHC journal January 2019
Beyond X.509: token-based authentication and authorization for HEP journal January 2019
The GridKa Tape Storage: various performance test results and current improvements journal January 2020
ATLAS Data Carousel journal January 2020
dCache – Efficient Message Encoding For Inter-Service Communication in dCache: Evaluation of Existing Serialization Protocols as a Replacement for Java Object Serialization journal January 2020
dCache: Inter-disciplinary storage system journal January 2021
Improving Performance of Tape Restore Request Scheduling in the Storage System dCache journal January 2021
An HTTP REST API for Tape-backed Storage journal January 2024
dCache integration with CERN Tape Archive journal January 2024
The CERN Tape Archive Beyond CERN An Open Source Data Archival System for HEP journal January 2024
Memorandum on design-oriented information systems research journal January 2011
A distributed storage system with dCache journal July 2008
LHC Data Analysis Using NFSv4.1 (pNFS): A Detailed Evaluation journal December 2011
dCache, agile adoption of storage technology journal December 2012
Experience with HEP analysis on mounted filesystems. journal December 2012
Transparent handling of small files with dCache to optimize tape access journal December 2015
Data Resilience in the dCache Storage System journal October 2017
EOS developments journal October 2017
Exporting Storage Systems in a Scalable Manner with pNFS conference January 2005
A cost-effective, high-bandwidth storage architecture journal December 1998
Distributed file systems: concepts and examples journal December 1990
Data Management Infrastructure for European XFEL conferencepaper January 2024
Improving Tape Restore Request Scheduling in the Storage System dCache text January 2020
Chimera - a new, fast, extensible and Grid enabled namespace service null January 2005
The cathedral and the bazaar journal March 1998
root-project/root: v6.18/02 software August 2019

Similar Records

dCache: Inter-disciplinary storage system
Conference · Thu Dec 31 23:00:00 EST 2020 · EPJ Web Conf. · OSTI ID:1854764

dCache project status and update
Conference · Tue Dec 31 23:00:00 EST 2024 · EPJ Web Conf. · OSTI ID:3009879

dCache - Keeping up With the Evolution of Science
Conference · Tue Dec 31 23:00:00 EST 2019 · EPJ Web Conf. · OSTI ID:1842718

Related Subjects