skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: AnalyzeThis: an analysis workflow-aware storage system, In: SC '15 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

Abstract

The need for novel data analysis is urgent in the face of a data deluge from modern applications. Traditional ap-proaches to data analysis incur significant data movement costs, moving data back and forth between the storage system and the processor. Emerging Active Flash devices en-able processing on the flash, where the data already resides. An array of such Active Flash devices allows us to revisit how analysis workflows interact with storage systems. By seam-lessly blending together the flash storage and data analysis, we create an analysis workflow-aware storage system, Ana-lyzeThis. Our guiding principle is that analysis-awareness be deeply ingrained in each and every layer of the storage, ele-vating data analyses as first-class citizens, and transforming AnalyzeThis into a potent analytics-aware appliance. We implement the AnalyzeThis storage system atop an emulation platform of the Active Flash array. Our results indicate that AnalyzeThis is viable, expediting workflow execution and minimizing data movement.

Authors:
 [1];  [2];  [2];  [2];  [1];  [1];  [3]
  1. Virginia Tech
  2. Oak Ridge National Laboratory
  3. Lawrence Berkeley Laboratory
Publication Date:
Research Org.:
Oak Ridge National Laboratory, Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1567399
DOE Contract Number:  
AC05-00OR22725; AC02-05CH11231
Resource Type:
Conference
Journal Name:
PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS
Additional Journal Information:
Conference: International Conference for High Performance Computing, Networking, Storage and Analysis, Austin, Texas, November 15-20, 2015
Country of Publication:
United States
Language:
English
Subject:
Computer Science; Engineering

Citation Formats

Sim, Hyogi, Kim, Youngjae, Vazhkudai, Sudharshan S., Tiwari, Devesh, Anwar, Ali, Butt, Ali R., and Ramakrishnan, Lavanya. AnalyzeThis: an analysis workflow-aware storage system, In: SC '15 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. United States: N. p., 2015. Web. doi:10.1145/2807591.2807622.
Sim, Hyogi, Kim, Youngjae, Vazhkudai, Sudharshan S., Tiwari, Devesh, Anwar, Ali, Butt, Ali R., & Ramakrishnan, Lavanya. AnalyzeThis: an analysis workflow-aware storage system, In: SC '15 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. United States. doi:10.1145/2807591.2807622.
Sim, Hyogi, Kim, Youngjae, Vazhkudai, Sudharshan S., Tiwari, Devesh, Anwar, Ali, Butt, Ali R., and Ramakrishnan, Lavanya. Thu . "AnalyzeThis: an analysis workflow-aware storage system, In: SC '15 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis". United States. doi:10.1145/2807591.2807622.
@article{osti_1567399,
title = {AnalyzeThis: an analysis workflow-aware storage system, In: SC '15 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis},
author = {Sim, Hyogi and Kim, Youngjae and Vazhkudai, Sudharshan S. and Tiwari, Devesh and Anwar, Ali and Butt, Ali R. and Ramakrishnan, Lavanya},
abstractNote = {The need for novel data analysis is urgent in the face of a data deluge from modern applications. Traditional ap-proaches to data analysis incur significant data movement costs, moving data back and forth between the storage system and the processor. Emerging Active Flash devices en-able processing on the flash, where the data already resides. An array of such Active Flash devices allows us to revisit how analysis workflows interact with storage systems. By seam-lessly blending together the flash storage and data analysis, we create an analysis workflow-aware storage system, Ana-lyzeThis. Our guiding principle is that analysis-awareness be deeply ingrained in each and every layer of the storage, ele-vating data analyses as first-class citizens, and transforming AnalyzeThis into a potent analytics-aware appliance. We implement the AnalyzeThis storage system atop an emulation platform of the Active Flash array. Our results indicate that AnalyzeThis is viable, expediting workflow execution and minimizing data movement.},
doi = {10.1145/2807591.2807622},
journal = {PROCEEDINGS OF SC15: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {1}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share:

Works referenced in this record:

Understanding and Improving Computational Science Storage Access through Continuous Characterization
journal, October 2011

  • Carns, Philip; Harms, Kevin; Allcock, William
  • ACM Transactions on Storage, Vol. 7, Issue 3, p. 1-26
  • DOI: 10.1145/2027066.2027068

A case for intelligent disks (IDISKs)
journal, September 1998

  • Keeton, Kimberly; Patterson, David A.; Hellerstein, Joseph M.
  • ACM SIGMOD Record, Vol. 27, Issue 3
  • DOI: 10.1145/290593.290602

Efficient management of idleness in storage systems
journal, June 2009


Sipros/ProRata: a versatile informatics system for quantitative community proteomics
journal, June 2013