Label-based Virtual Directories In dCache
- DESY
- Fermilab
- Natl. Supercomputing Ctr., Tianjin
Traditional filesystems organize data in directories. These directories are typically a collection of files whose grouping is based on a single criterion, e.g., the starting date of an experiment, experiment name, beamline ID, measurement device, or instrument. However, each file in a directory can belong to several logical groups, such as a special event type, experiment condition, or a part of a selected dataset. dCache is a storage system developed to store large amounts of scientific data, used by many HEP and Photon Science experiments. With recent developments in dCache, we have introduced a concept of file tagging, which dynamically groups files with the same label into virtual directories. The file labels can be added, removed, renamed, and deleted through the admin interface or via REST API. The files in virtual directories are exposed through all protocols supported by dCache. This contribution will describe the details of the implementation for file tagging in dCache and present our future development plans on automatic metadata extractions, a feature that will significantly simplify data management. Additionally, we are exploring the future use of virtual directories as a way to translate scientific data catalogs into filesystem views for direct data analysis.
- Research Organization:
- Natl. Supercomputing Ctr., Tianjin; DESY; Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States)
- Sponsoring Organization:
- US Department of Energy
- DOE Contract Number:
- 89243024CSC000002
- OSTI ID:
- 3009885
- Report Number(s):
- FERMILAB-CONF-25-0940-CSAID; oai:inspirehep.net:3065643
- Resource Type:
- Conference paper
- Conference Information:
- Journal Name: EPJ Web Conf.
- Country of Publication:
- United States
- Language:
- English
Similar Records
dCache project status and update
dCache: The Storage System of Choice for Data-Intensive Applications
dCache - Joining the noWORM Storage Club
Conference
·
Tue Dec 31 23:00:00 EST 2024
· EPJ Web Conf.
·
OSTI ID:3009879
dCache: The Storage System of Choice for Data-Intensive Applications
Journal Article
·
Mon Nov 24 19:00:00 EST 2025
· Computing and Software for Big Science
·
OSTI ID:3006987
dCache - Joining the noWORM Storage Club
Conference
·
Mon Dec 31 23:00:00 EST 2018
· EPJ Web Conf.
·
OSTI ID:1581427