High performance graphics processor based computed tomography reconstruction algorithms for nuclear and other large scale applications.

Jimenez, Edward S.; Orr, Laurel J.; Thompson, Kyle R.

doi:10.2172/1121931

High performance graphics processor based computed tomography reconstruction algorithms for nuclear and other large scale applications.

Technical Report · Sun Sep 01 04:00:00 EDT 2013

DOI:https://doi.org/10.2172/1121931· OSTI ID:1121931

Jimenez, Edward S. ^[1]; Orr, Laurel J. ^[1]; Thompson, Kyle R. ^[1]

Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

The goal of this work is to develop a fast computed tomography (CT) reconstruction algorithm based on graphics processing units (GPU) that achieves significant improvement over traditional central processing unit (CPU) based implementations. The main challenge in developing a CT algorithm that is capable of handling very large datasets is parallelizing the algorithm in such a way that data transfer does not hinder performance of the reconstruction algorithm. General Purpose Graphics Processing (GPGPU) is a new technology that the Science and Technology (S&T) community is starting to adopt in many fields where CPU-based computing is the norm. GPGPU programming requires a new approach to algorithm development that utilizes massively multi-threaded environments. Multi-threaded algorithms in general are difficult to optimize since performance bottlenecks occur that are non-existent in single-threaded algorithms such as memory latencies. If an efficient GPU-based CT reconstruction algorithm can be developed; computational times could be improved by a factor of 20. Additionally, cost benefits will be realized as commodity graphics hardware could potentially replace expensive supercomputers and high-end workstations. This project will take advantage of the CUDA programming environment and attempt to parallelize the task in such a way that multiple slices of the reconstruction volume are computed simultaneously. This work will also take advantage of the GPU memory by utilizing asynchronous memory transfers, GPU texture memory, and (when possible) pinned host memory so that the memory transfer bottleneck inherent to GPGPU is amortized. Additionally, this work will take advantage of GPU-specific hardware (i.e. fast texture memory, pixel-pipelines, hardware interpolators, and varying memory hierarchy) that will allow for additional performance improvements.

Research Organization:: Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA)

DOE Contract Number:: AC04-94AL85000

OSTI ID:: 1121931

Report Number(s):: SAND2013--8059; 482802

Country of Publication:: United States

Language:: English

Similar Records

Development of a GPU-based high-performance radiative transfer model for the Infrared Atmospheric Sounding Interferometer (IASI)

Journal Article · Sun Mar 20 00:00:00 EDT 2011 · Journal of Computational Physics · OSTI ID:21499779

Optimizing High Performance Markov Clustering for Pre-Exascale Architectures

Journal Article · Thu Apr 30 20:00:00 EDT 2020 · Proceedings - IEEE International Parallel and Distributed Processing Symposium (IPDPS) · OSTI ID:1650092

RACB: Resource Aware Cache Bypass on GPUs

Conference · Wed Oct 01 00:00:00 EDT 2014 · 2014 International Symposium on Computer Architecture and High Performance Computing Workshop; 22-24 Oct. 2014; Paris, France · OSTI ID:1567596

Related Subjects

97 MATHEMATICS AND COMPUTING

High performance graphics processor based computed tomography reconstruction algorithms for nuclear and other large scale applications.

Citation Formats

Similar Records

Related Subjects