skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Position Papers for the ASCR Workshop on the Management and Storage of Scientific Data

Abstract

The purpose of this workshop is to identify priority research directions in the area of data management for high-performance and scientific computing above and beyond HPC’s traditional "the parallel file system is the data-management system" model. Supporting the breadth of the DOE mission, including the explosion of AI uses and the growing needs of experimental and observational science, motivates revisiting our assumptions about data management. There are many facets of this topic to explore including: (1) Interfaces for accessing data that resides on traditional persistent storage as well as memory devices; (2) Storage-system architecture design that supports scientific workflows on varied hierarchical storage and networking devices; (3) Devising metadata management infrastructure to support FAIR principles (Findability, Accessibility, Interoperability, and Reusability); (4) Capturing provenance information about scientific data; (5) Utilizing AI to learn I/O patterns of emerging workloads for efficient data management; (6) Providing data management support for AI and complex workflows; and (7) Understanding the overlap between traditional storage systems and I/O (SSIO) efforts and data management. While the program committee has identified these topics as important areas for discussion, we welcome position papers from the community that propose additional topics of interest for discussion at the workshop. The workshopmore » agenda will include breakout sessions for discussing these and selected topic areas to inform priority research directions for data management for high-performance and scientific computing.« less

Authors:
 [1];  [2];  [3];  [1];  [4];  [5]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. Harvard Univ., Cambridge, MA (United States)
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
  4. Argonne National Lab. (ANL), Argonne, IL (United States)
  5. Univ. of California, Merced, CA (United States)
Publication Date:
Research Org.:
US Department of Energy (USDOE), Washington DC (United States). Office of Science
Sponsoring Org.:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
OSTI Identifier:
1843500
Resource Type:
Technical Report
Resource Relation:
Conference: ASCR Workshop on the Management and Storage of Scientific Data, Held Virtually (United States), 24-25, 27 Jan 2022; Related Information: https://www.orau.gov/MgmtStgeonScData
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Byna, Suren, Idreos, Stratos, Jones, Terry, Mohror, Kathryn, Ross, Rob, and Rusu, Florin. Position Papers for the ASCR Workshop on the Management and Storage of Scientific Data. United States: N. p., 2022. Web. doi:10.2172/1843500.
Byna, Suren, Idreos, Stratos, Jones, Terry, Mohror, Kathryn, Ross, Rob, & Rusu, Florin. Position Papers for the ASCR Workshop on the Management and Storage of Scientific Data. United States. https://doi.org/10.2172/1843500
Byna, Suren, Idreos, Stratos, Jones, Terry, Mohror, Kathryn, Ross, Rob, and Rusu, Florin. 2022. "Position Papers for the ASCR Workshop on the Management and Storage of Scientific Data". United States. https://doi.org/10.2172/1843500. https://www.osti.gov/servlets/purl/1843500.
@article{osti_1843500,
title = {Position Papers for the ASCR Workshop on the Management and Storage of Scientific Data},
author = {Byna, Suren and Idreos, Stratos and Jones, Terry and Mohror, Kathryn and Ross, Rob and Rusu, Florin},
abstractNote = {The purpose of this workshop is to identify priority research directions in the area of data management for high-performance and scientific computing above and beyond HPC’s traditional "the parallel file system is the data-management system" model. Supporting the breadth of the DOE mission, including the explosion of AI uses and the growing needs of experimental and observational science, motivates revisiting our assumptions about data management. There are many facets of this topic to explore including: (1) Interfaces for accessing data that resides on traditional persistent storage as well as memory devices; (2) Storage-system architecture design that supports scientific workflows on varied hierarchical storage and networking devices; (3) Devising metadata management infrastructure to support FAIR principles (Findability, Accessibility, Interoperability, and Reusability); (4) Capturing provenance information about scientific data; (5) Utilizing AI to learn I/O patterns of emerging workloads for efficient data management; (6) Providing data management support for AI and complex workflows; and (7) Understanding the overlap between traditional storage systems and I/O (SSIO) efforts and data management. While the program committee has identified these topics as important areas for discussion, we welcome position papers from the community that propose additional topics of interest for discussion at the workshop. The workshop agenda will include breakout sessions for discussing these and selected topic areas to inform priority research directions for data management for high-performance and scientific computing.},
doi = {10.2172/1843500},
url = {https://www.osti.gov/biblio/1843500}, journal = {},
number = ,
volume = ,
place = {United States},
year = {Sat Jan 01 00:00:00 EST 2022},
month = {Sat Jan 01 00:00:00 EST 2022}
}