Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

The ATree: A data structure to support very large scientific databases

Technical Report ·
DOI:https://doi.org/10.2172/638241· OSTI ID:638241

The datasets generated by satellite observations and supercomputer simulations are overwhelming conventional methods of storage and access, leading to unreasonably long delays in data analysis. The major problem that the authors address is the slow access, from large datasets in archival storage, to small subsets needed for scientific visualization and analysis. The goal is to minimize the amount of storage that has to be read when a subset of the data is needed. A second goal is to enhance the accessibility of data subsets by applying data reduction and indexing methods to the subsets. The reduced format allows larger datasets to be stored on local disk for analysis. Data indexing permits efficient manipulation of the data, and thus improves the productivity of the researcher. A data structure called the ATree is described that meets the demands of interactive scientific applications. The ATree data structure is suitable for storing data abstracts as well as original data. It allows quick access to a subset of interest and is suitable for feature-based queries. It intrinsically partitions the data and organizes the chunks in a linear sequence on secondary/tertiary storage. It can store data at various resolutions and incorporates hierarchical compression methods.

Research Organization:
Univ. of Maryland, College Park, MD (United States)
Sponsoring Organization:
USDOE Office of Energy Research, Washington, DC (United States)
DOE Contract Number:
FG05-92ER25141
OSTI ID:
638241
Report Number(s):
DOE/ER/25141--1; CAR-TR--760; CS-TR--3435; ON: DE98007349
Country of Publication:
United States
Language:
English

Similar Records

MOSIQS: Persistent Memory Object Storage With Metadata Indexing and Querying for Scientific Computing
Journal Article · Tue Jun 08 00:00:00 EDT 2021 · IEEE Access · OSTI ID:1820827

TokSearch: A search engine for fusion experimental data
Journal Article · Sun Apr 01 00:00:00 EDT 2018 · Fusion Engineering and Design · OSTI ID:1436502

Physical database support for scientific and statistical database management
Conference · Thu May 01 00:00:00 EDT 1986 · OSTI ID:5563783