skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: AnalyzeThis: An Analysis Workflow-Aware Storage System

Conference ·
OSTI ID:1265619

The need for novel data analysis is urgent in the face of a data deluge from modern applications. Traditional approaches to data analysis incur significant data movement costs, moving data back and forth between the storage system and the processor. Emerging Active Flash devices enable processing on the flash, where the data already resides. An array of such Active Flash devices allows us to revisit how analysis workflows interact with storage systems. By seamlessly blending together the flash storage and data analysis, we create an analysis workflow-aware storage system, AnalyzeThis. Our guiding principle is that analysis-awareness be deeply ingrained in each and every layer of the storage, elevating data analyses as first-class citizens, and transforming AnalyzeThis into a potent analytics-aware appliance. We implement the AnalyzeThis storage system atop an emulation platform of the Active Flash array. Our results indicate that AnalyzeThis is viable, expediting workflow execution and minimizing data movement.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1265619
Resource Relation:
Conference: 2015 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC'15), Austin, TX, USA, 20151115, 20151120
Country of Publication:
United States
Language:
English

Similar Records

An Analysis Workflow-Aware Storage System for Multi-Core Active Flash Arrays
Journal Article · Tue Aug 14 00:00:00 EDT 2018 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1265619

AnalyzeThis: an analysis workflow-aware storage system
Conference · Thu Jan 01 00:00:00 EST 2015 · Proceedings of SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis · OSTI ID:1265619

An Integrated Indexing and Search Service for Distributed File Systems
Journal Article · Mon Apr 27 00:00:00 EDT 2020 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1265619