skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Scalable and accurate multi-GPU-based image reconstruction of large-scale ptychography data

Journal Article · · Scientific Reports

Abstract While the advances in synchrotron light sources, together with the development of focusing optics and detectors, allow nanoscale ptychographic imaging of materials and biological specimens, the corresponding experiments can yield terabyte-scale volumes of data that can impose a heavy burden on the computing platform. Although graphics processing units (GPUs) provide high performance for such large-scale ptychography datasets, a single GPU is typically insufficient for analysis and reconstruction. Several works have considered leveraging multiple GPUs to accelerate the ptychographic reconstruction. However, most of these works utilize only the Message Passing Interface to handle the communications between GPUs. This approach poses inefficiency for a hardware configuration that has multiple GPUs in a single node, especially while reconstructing a single large projection, since it provides no optimizations to handle the heterogeneous GPU interconnections containing both low-speed (e.g., PCIe) and high-speed links (e.g., NVLink). In this paper, we provide an optimized intranode multi-GPU implementation that can efficiently solve large-scale ptychographic reconstruction problems. We focus on the maximum likelihood reconstruction problem using a conjugate gradient (CG) method for the solution and propose a novel hybrid parallelization model to address the performance bottlenecks in the CG solver. Accordingly, we have developed a tool, called PtyGer ( Pty chographic G PU(multipl e )-based r econstruction), implementing our hybrid parallelization model design. A comprehensive evaluation verifies that PtyGer can fully preserve the original algorithm’s accuracy while achieving outstanding intranode GPU scalability.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); USDOE Office of Science (SC), Basic Energy Sciences (BES); USDOE National Nuclear Security Administration (NNSA); US Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA)
Grant/Contract Number:
AC02-06CH11357; 89233218CNA000001; D2019-1903270004
OSTI ID:
1860447
Alternate ID(s):
OSTI ID: 1901717
Journal Information:
Scientific Reports, Journal Name: Scientific Reports Vol. 12 Journal Issue: 1; ISSN 2045-2322
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (58)

Phase retrieval with transverse translation diversity: a nonlinear optimization approach journal January 2008
Parallel ptychographic reconstruction journal January 2014
S-Caffe: Co-designing MPI Runtimes and Caffe for Scalable Deep Learning on Modern GPU Clusters
  • Awan, Ammar Ahmad; Hamidouche, Khaled; Hashmi, Jahanzeb Maqbool
  • PPoPP '17: 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming https://doi.org/10.1145/3018743.3018769
conference January 2017
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing
  • Wang, Linnan; Wu, Wei; Xu, Zenglin
  • ICS '16: 2016 International Conference on Supercomputing, Proceedings of the 2016 International Conference on Supercomputing https://doi.org/10.1145/2925426.2926256
conference June 2016
Maximum-likelihood refinement for coherent diffractive imaging journal June 2012
AAlign: A SIMD Framework for Pairwise Sequence Alignment on x86-Based Multi-and Many-Core Processors conference May 2016
XDesign : an open-source software package for designing X-ray imaging phantoms and experiments journal February 2017
Beyond crystallography: Diffractive imaging using coherent x-ray light sources journal April 2015
Movable Aperture Lensless Transmission Microscopy: A Novel Phase Retrieval Algorithm journal July 2004
The conjugate gradient method in extremal problems journal January 1969
Multi-GPU Graph Analytics conference May 2017
Transmission microscopy without lenses for objects of unlimited size journal February 2007
High-throughput ptychography using Eiger-scanning X-ray nano-imaging of extended regions journal January 2014
Topology-aware optimizations for multi-GPU ptychographic image reconstruction
  • Yu, Xiaodong; Bicer, Tekin; Kettimuthu, Rajkumar
  • ICS '21: 2021 International Conference on Supercomputing, Proceedings of the ACM International Conference on Supercomputing https://doi.org/10.1145/3447818.3460380
conference June 2021
High-Resolution Scanning X-ray Diffraction Microscopy journal July 2008
The Velociprobe: An ultrafast hard X-ray nanoprobe for high-resolution ptychographic imaging journal August 2019
Image Quality Assessment: From Error Visibility to Structural Similarity journal April 2004
Relaxed averaged alternating reflections for diffraction imaging journal November 2004
Keyhole coherent diffractive imaging journal March 2008
GPU acceleration of regular expression matching for large datasets: exploring the implementation space conference January 2013
X-ray ptychography journal December 2017
Stepping up to Summit journal March 2018
Probe retrieval in ptychographic coherent diffractive imaging journal March 2009
Optimization of Collective Communication Operations in MPICH journal February 2005
High-Performance Multi-Mode Ptychography Reconstruction on Distributed GPUs conference August 2018
Rotation-as-fast-axis scanning-probe x-ray tomography: the importance of angular diversity for fly-scan modes journal January 2018
A phase retrieval algorithm for shifting illumination journal November 2004
An optimum demodulator for poisson processes: Photon source detectors journal January 1963
cuART: Fine-Grained Algebraic Reconstruction Technique for Computed Tomography Images on GPUs conference May 2016
Joint ptycho-tomography reconstruction through alternating direction method of multipliers journal January 2019
An improved ptychographical phase retrieval algorithm for diffractive imaging journal September 2009
PtychoShelves , a versatile high-level framework for high-performance analysis of ptychographic data journal March 2020
Iterative least-squares solver for generalized maximum-likelihood ptychography journal January 2018
Ptychography at the Linac Coherent Light Source in a parasitic geometry journal September 2020
An Enhanced Image Reconstruction Tool for Computed Tomography on GPUs conference May 2017
Convergence Properties of Nonlinear Conjugate Gradient Methods journal January 2000
Memory access patterns: the missing piece of the multi-GPU puzzle
  • Ben-Nun, Tal; Levy, Ely; Barak, Amnon
  • SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/2807591.2807611
conference November 2015
Exploring different automata representations for efficient regular expression matching on GPUs journal August 2013
Beugung im inhomogenen Primärstrahlwellenfeld. I. Prinzip einer Phasenmessung von Elektronenbeungungsinterferenzen journal July 1969
Comparing Managed Memory and ATS with and without Prefetching on NVIDIA Volta GPUs conference November 2019
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU journal July 2017
Ptychopy: GPU framework for ptychographic data analysis conference September 2021
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect journal January 2020
Ptychography & lensless X-ray imaging journal January 2008
Coherent X-Ray Diffraction Imaging journal January 2012
Simultaneous X-ray fluorescence and ptychographic microscopy of Cyclotella meneghiniana journal January 2012
MemXCT: memory-centric X-ray CT reconstruction with massive parallelization
  • Hidayetoğlu, Mert; Biçer, Tekin; de Gonzalo, Simon Garcia
  • SC '19: The International Conference for High Performance Computing, Networking, Storage, and Analysis, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/3295500.3356220
conference November 2019
Further improvements to the ptychographical iterative engine journal January 2017
Coherent lensless X-ray imaging journal November 2010
GPU-Based Static Data-Flow Analysis for Fast and Scalable Android App Vetting conference May 2020
PyNX.Ptycho : a computing library for X-ray coherent diffraction imaging of nanostructures journal September 2016
Demystifying automata processing: GPUs, FPGAs or Micron's AP? conference January 2017
A Nonlinear Conjugate Gradient Method with a Strong Global Convergence Property journal January 1999
NV-group: link-efficient reduction for distributed deep learning on modern dense GPU systems
  • Chu, Ching-Hsiang; Kousha, Pouya; Awan, Ammar Ahmad
  • ICS '20: 2020 International Conference on Supercomputing, Proceedings of the 34th ACM International Conference on Supercomputing https://doi.org/10.1145/3392717.3392771
conference June 2020
GPU-Aware MPI on RDMA-Enabled Clusters: Design, Implementation and Evaluation journal October 2014
GPU-Based Iterative Medical CT Image Reconstructions journal March 2018
Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations journal October 2017
Scalable Distributed DNN Training using TensorFlow and CUDA-Aware MPI: Characterization, Designs, and Performance Evaluation conference May 2019

Similar Records

Multi-beam X-ray ptychography for high-throughput coherent diffraction imaging
Journal Article · Wed Nov 11 00:00:00 EST 2020 · Scientific Reports · OSTI ID:1860447

Evaluating On-Node GPU Interconnects for Deep Learning Workloads
Conference · Mon Jan 01 00:00:00 EST 2018 · OSTI ID:1860447

Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect
Journal Article · Wed Jan 01 00:00:00 EST 2020 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1860447

Related Subjects