Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Fast GPU 3D diffeomorphic image registration

Journal Article · · Journal of Parallel and Distributed Computing
 [1];  [2];  [2];  [3];  [4]
  1. Univ. of Stuttgart (Germany); Univ. of Stuttgart (Germany)
  2. Univ. of Texas, Austin, TX (United States)
  3. Univ. of Stuttgart (Germany)
  4. Univ. of Houston, TX (United States)
3D image registration is one of the most fundamental and computationally expensive operations in medical image analysis. Here, we present a mixed-precision, Gauss–Newton–Krylov solver for diffeomorphic registration of two images. Our work extends the publicly available CLAIRE library to GPU architectures. Despite the importance of image registration, only a few implementations of large deformation diffeomorphic registration packages support GPUs. Our contributions are new algorithms to significantly reduce the run time of the two main computational kernels in CLAIRE: calculation of derivatives and scattered-data interpolation. Additionally, we deploy (i) highly-optimized, mixed-precision GPU-kernels for the evaluation of scattered-data interpolation, (ii) replace Fast-Fourier-Transform (FFT)-based first-order derivatives with optimized 8th-order finite differences, and (iii) compare with state-of-the-art CPU and GPU implementations. As a highlight, we demonstrate that we can register clinical images in less than 6 s on a single NVIDIA Tesla V100. This amounts to over 20 speed-up over the current version of CLAIRE and over 30 speed-up over existing GPU implementations.
Research Organization:
Duke Univ., Durham, NC (United States); Univ. of Texas, Austin, TX (United States)
Sponsoring Organization:
National Institutes of Health (NIH); National Science Foundation (NSF); US Air Force Office of Scientific Research (AFOSR); USDOE; USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
NA0003969; SC0019393
OSTI ID:
1850434
Alternate ID(s):
OSTI ID: 1874748
Journal Information:
Journal of Parallel and Distributed Computing, Journal Name: Journal of Parallel and Distributed Computing Journal Issue: C Vol. 149; ISSN 0743-7315
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (68)

Parallel Lagrange--Newton--Krylov--Schur Methods for PDE-Constrained Optimization. Part II: The Lagrange--Newton Solver and Its Application to Optimal Control of Steady Viscous Flows journal January 2005
Diffeomorphic Matching and Dynamic Deformable Surfaces in 3d Medical Imaging journal January 2010
Fast Predictive Image Registration preprint January 2016
Fourier spectral methods for fractional-in-space reaction-diffusion equations journal April 2014
Image Sequence Interpolation Using Optimal Control journal March 2011
Stability and Convergence Analysis of Fully Discrete Fourier Collocation Spectral Method for 3-D Viscous Burgers’ Equation journal July 2012
Registration of Anatomical Images Using Paths of Diffeomorphisms Parameterized with Stationary Vector Field Flows journal February 2009
Diffeomorphic 3D Image Registration via Geodesic Shooting Using an Efficient Adjoint Calculation journal August 2011
Geodesics, Parallel Transport & One-Parameter Subgroups for Diffeomorphic Image Registration journal December 2012
A survey of medical image registration journal March 1998
Fast free-form deformation using graphics processing units journal June 2010
A survey of medical image registration on graphics hardware journal December 2011
Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs journal April 2016
Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain journal February 2008
Medical image processing on the GPU – Past, present and future journal December 2013
Geodesic shape regression with multiple geometries and sparse parameters journal July 2017
Probabilistic modeling of anatomical variability using a low dimensional parameterization of diffeomorphisms journal October 2017
A fast diffeomorphic image registration algorithm journal October 2007
Diffeomorphic demons: Efficient non-parametric image registration journal March 2009
Evolutions equations in computational anatomy journal March 2009
A reproducible evaluation of ANTs similarity metric performance in brain image registration journal February 2011
Diffeomorphic registration using geodesic shooting and Gauss–Newton optimisation journal April 2011
LCC-Demons: A robust and accurate symmetric diffeomorphic registration algorithm journal November 2013
Morphometry of anatomical shape complexes with dense deformations and sparse parameters journal November 2014
Quicksilver: Fast predictive image registration – A deep learning approach journal September 2017
An optimal control approach to the optical flow problem journal January 2016
Sequential Quadratic Programming journal January 1995
Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms journal February 2005
Efficient GPU-Based Texture Interpolation using Uniform B-Splines journal January 2008
Efficient Cubic B-spline Image Interpolation on a GPU journal October 2012
Medical image registration journal February 2001
Implementation and evaluation of various demons deformable image registration algorithms on a GPU journal December 2009
On developing B-spline registration algorithms for multi-core processors journal October 2010
Ill-posed medicine—an introduction to image registration journal May 2008
Jacobi fields in groups of diffeomorphisms and applications journal January 2007
Variational problems on flows of diffeomorphisms for image matching journal January 1998
GPU Prefilter for Accurate Cubic B-spline Interpolation journal December 2010
Nonrigid registration using free-form deformations: application to breast MR images journal January 1999
Deformable templates using large deformation kinematics journal January 1996
Multi-GPU acceleration of DARTEL (early detection of Alzheimer) conference September 2014
An optimal control approach for deformable registration
  • Hart, G. L.; Zach, C.; Niethammer, M.
  • 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR Workshops), 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops https://doi.org/10.1109/CVPR.2009.5204344
conference June 2009
An optimal control approach for deformable registration
  • Hart, Gabriel L.; Zach, Christopher; Niethammer, Marc
  • 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops https://doi.org/10.1109/CVPRW.2009.5204344
conference June 2009
Accelerating multi-scale flows for LDDKBM diffeomorphic registration conference November 2011
A Survey of Medical Image Registration on Multicore and the GPU journal March 2010
Distributed-Memory Large Deformation Diffeomorphic 3D Image Registration
  • Mang, Andreas; Gholami, Amir; Biros, George
  • SC16: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2016.71
conference November 2016
elastix: A Toolbox for Intensity-Based Medical Image Registration journal January 2010
Simultaneous Multi-scale Registration Using Large Deformation Diffeomorphic Metric Mapping journal October 2011
Deformable Medical Image Registration: A Survey journal July 2013
Learning a Probabilistic Model for Diffeomorphic Registration journal September 2019
VoxelMorph: A Learning Framework for Deformable Medical Image Registration journal August 2019
Inexact Newton Methods journal April 1982
Choosing the Forcing Terms in an Inexact Newton Method journal January 1996
A Hyperelastic Regularization Energy for Image Registration journal January 2013
An Inexact Newton--Krylov Algorithm for Constrained Diffeomorphic Image Registration journal January 2015
Constrained $H^1$-Regularization Schemes for Diffeomorphic Image Registration journal January 2016
Asymptotically Compatible Fourier Spectral Approximations of Nonlocal Allen--Cahn Equations journal January 2016
A Semi-Lagrangian Two-Level Preconditioned Newton--Krylov Solver for Constrained Diffeomorphic Image Registration journal January 2017
A Matrix-Free Approach to Parallel and Memory-Efficient Deformable Image Registration journal January 2018
CLAIRE: A Distributed-Memory Solver for Constrained Large Deformation Diffeomorphic Image Registration journal January 2019
Optimal Control Formulation for Determining Optical Flow journal January 2003
Parallel Lagrange--Newton--Krylov--Schur Methods for PDE-Constrained Optimization. Part I: The Krylov--Schur Solver journal January 2005
Parallel Lagrange--Newton--Krylov--Schur Methods for PDE-Constrained Optimization. Part II: The Lagrange--Newton Solver and Its Application to Optimal Control of Steady Viscous Flows journal January 2005
Accelerating 3d Non-Rigid Registration Using Graphics Hardware journal January 2008
Roofline: an insightful visual performance model for multicore architectures journal April 2009
A GPU approach for accelerating 3D deformable registration (DARTEL) on brain biomedical images conference January 2013
A framework for scalable biophysics-based image analysis
  • Gholami, Amir; Mang, Andreas; Scheufele, Klaudius
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '17 https://doi.org/10.1145/3126908.3126930
conference January 2017
Non-rigid image registration: theory and practice journal December 2004
Methods of conjugate gradients for solving linear systems journal December 1952

Cited By (1)

Diffeomorphic Shape Matching by Operator Splitting in 3D Cardiology Imaging journal January 2021

Similar Records

CLAIRE—Parallelized Diffeomorphic Image Registration for Large-Scale Biomedical Imaging Applications
Journal Article · Thu Sep 15 20:00:00 EDT 2022 · Journal of Imaging · OSTI ID:1888050

LOGAN: High-Performance X-Drop Pairwise Alignment on GPU (LOGAN) v1.0
Software · Mon Nov 04 19:00:00 EST 2019 · OSTI ID:code-32464

LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment
Journal Article · Sun May 17 20:00:00 EDT 2020 · Proceedings - IEEE International Parallel and Distributed Processing Symposium (IPDPS) · OSTI ID:1650093