skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Efficient, Parallel At-scale Correlation Analysis for Atom Probe Tomography on Hybrid Architectures

Abstract

Atom probe tomography (APT) is a material probing technique that has undergone dramatic improvements in its capability to map individual atoms within a material sample resulting in data files with hundreds of millions of atoms. Understanding the nano-structural features hidden in these massive amounts of atomic data is a crucial analysis task for materials scientists. However, fast analysis capabilities for large APT workloads remains a critical bottleneck. In this paper, we present the design, implementation and detailed performance evaluations of a parallel software capable of efficiently performing extremely time-consuming correlation analyses of massive high density APT data. Starting with shared memory implementations to motivate our design choices, we extend the implementation to hybrid architectures keeping realistic APT workloads in mind. Detailed performance analyses of three different parallel implementations of the software are supported by empirical results on a Cray XC30 and a Cray XC40 architecture. Its usefulness is demonstrated by reducing the turnaround time of an end-to-end APT correlation analysis on 100 millions atoms by three orders of magnitude using 2048 MPI ranks on 1024 nodes (24 cores per node) of a Cray XC30. The software reported here equips material scientists for the first time with a high-speed scalable capabilitymore » for efficient and timely analyses of massive APT data.« less

Authors:
 [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]
  1. ORNL
Publication Date:
Research Org.:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1468218
DOE Contract Number:  
AC05-00OR22725
Resource Type:
Conference
Resource Relation:
Conference: 32nd IEEE International Parallel & Distributed Processing Symposium (IPDPS 2018) - Vancouver, , Canada - 5/21/2018 8:00:00 AM-5/25/2018 8:00:00 AM
Country of Publication:
United States
Language:
English

Citation Formats

Lu, Hao, Seal, Sudip K., Guo, Wei, and Poplawsky, Jonathan D. Efficient, Parallel At-scale Correlation Analysis for Atom Probe Tomography on Hybrid Architectures. United States: N. p., 2018. Web. doi:10.1109/IPDPS.2018.00016.
Lu, Hao, Seal, Sudip K., Guo, Wei, & Poplawsky, Jonathan D. Efficient, Parallel At-scale Correlation Analysis for Atom Probe Tomography on Hybrid Architectures. United States. doi:10.1109/IPDPS.2018.00016.
Lu, Hao, Seal, Sudip K., Guo, Wei, and Poplawsky, Jonathan D. Tue . "Efficient, Parallel At-scale Correlation Analysis for Atom Probe Tomography on Hybrid Architectures". United States. doi:10.1109/IPDPS.2018.00016. https://www.osti.gov/servlets/purl/1468218.
@article{osti_1468218,
title = {Efficient, Parallel At-scale Correlation Analysis for Atom Probe Tomography on Hybrid Architectures},
author = {Lu, Hao and Seal, Sudip K. and Guo, Wei and Poplawsky, Jonathan D.},
abstractNote = {Atom probe tomography (APT) is a material probing technique that has undergone dramatic improvements in its capability to map individual atoms within a material sample resulting in data files with hundreds of millions of atoms. Understanding the nano-structural features hidden in these massive amounts of atomic data is a crucial analysis task for materials scientists. However, fast analysis capabilities for large APT workloads remains a critical bottleneck. In this paper, we present the design, implementation and detailed performance evaluations of a parallel software capable of efficiently performing extremely time-consuming correlation analyses of massive high density APT data. Starting with shared memory implementations to motivate our design choices, we extend the implementation to hybrid architectures keeping realistic APT workloads in mind. Detailed performance analyses of three different parallel implementations of the software are supported by empirical results on a Cray XC30 and a Cray XC40 architecture. Its usefulness is demonstrated by reducing the turnaround time of an end-to-end APT correlation analysis on 100 millions atoms by three orders of magnitude using 2048 MPI ranks on 1024 nodes (24 cores per node) of a Cray XC30. The software reported here equips material scientists for the first time with a high-speed scalable capability for efficient and timely analyses of massive APT data.},
doi = {10.1109/IPDPS.2018.00016},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2018},
month = {5}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: