Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Feature-based analysis of large-scale spatio-temporal sensor data on hybrid architectures

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [1];  [1];  [1];  [1];  [2];  [3]
  1. Emory Univ., Atlanta, GA (United States). Center for Comprehensive Informatics and Biomedical Informatics Dept.
  2. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Scientific Data Group
  3. Emory Univ., Atlanta, GA (United States). Center for Comprehensive Informatics and Biomedical Informatics Dept.; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Scientific Data Group

The analysis of large sensor datasets for structural and functional features has applications in many domains, including weather and climate modeling, characterization of subsurface reservoirs, and biomedicine. The vast amount of data obtained from state-of-the-art sensors and the computational cost of analysis operations create a barrier to such analyses. In this paper, we describe middleware system support to take advantage of large clusters of hybrid CPU–GPU nodes to address the data and compute-intensive requirements of feature-based analyses of large spatio-temporal datasets.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
OSTI ID:
1565093
Journal Information:
International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications Journal Issue: 3 Vol. 27; ISSN 1094-3420
Publisher:
SAGECopyright Statement
Country of Publication:
United States
Language:
English

References (21)

Optimizing retrieval and processing of multi-dimensional scientific datasets conference January 2000
Biomedical image analysis on a cooperative cluster of GPUs and multicores conference January 2008
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures journal November 2010
Distributed processing of very large datasets with DataCutter journal October 2001
DataStager: scalable data staging services for petascale applications journal June 2010
:{Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data journal January 2003
A scalable gaussian process analysis algorithm for biomass monitoring journal July 2011
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
  • Lofstead, Jay F.; Klasky, Scott; Schwan, Karsten
  • Proceedings of the 6th international workshop on Challenges of large applications in distributed environments - CLADE '08 https://doi.org/10.1145/1383529.1383533
conference January 2008
Cluster I/O with River: making the fast case common
  • Arpaci-Dusseau, Remzi H.; Anderson, Eric; Treuhaft, Noah
  • Proceedings of the sixth workshop on I/O in parallel and distributed systems - IOPADS '99 https://doi.org/10.1145/301816.301823
conference January 1999
Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community journal September 2011
High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms
  • Teodoro, George; Pan, Tony; Kurc, Tahsin M.
  • 2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on Parallel and Distributed Processing https://doi.org/10.1109/IPDPS.2013.11
conference May 2013
Run-time optimizations for replicated dataflows on heterogeneous environments
  • Teodoro, George; Hartley, Timothy D. R.; Catalyurek, Umit
  • Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing - HPDC '10 https://doi.org/10.1145/1851476.1851479
conference January 2010
MapReduce: a flexible data processing tool journal January 2010
Morphological signatures and genomic correlates in glioblastoma
  • Cooper, Lee A. D.; Kong, Jun; Wang, Fusheng
  • 2011 8th IEEE International Symposium on Biomedical Imaging (ISBI 2011), 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro https://doi.org/10.1109/ISBI.2011.5872714
conference March 2011
A K-Means Cluster Analysis Computer Program With Cross-Tabulations and Next-Nearest-Neighbor Analysis journal April 1980
DAGuE: A generic distributed DAG engine for High Performance Computing journal January 2012
Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems
  • Teodoro, George; Kurc, Tahsin M.; Pan, Tony
  • 2012 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2012 IEEE 26th International Parallel and Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2012.101
conference May 2012
DataSpaces: an interaction and coordination framework for coupled simulation workflows conference January 2010
A parallel software infrastructure for structured adaptive mesh methods conference January 1995
An Integrative Approach for In Silico Glioma Research journal October 2010
Enabling Interoperation of High Performance, Scientific Computing Applications: Modeling Scientific Data with the Sets and Fields (SAF) Modeling System book January 2001