Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces
Abstract
We present the first high-performance computing implementation of the meso-scale phase field dislocation dynamics (PFDD) model on a graphics processing unit (GPU)-based platform. The implementation takes advantage of the portable OpenACC standard directive pragmas along with Nvidia's compute unified device architecture (CUDA) fast Fourier transform (FFT) library called CUFFT to execute the FFT computations within the PFDD formulation on the same GPU platform. The overall implementation is termed ACCPFDD-CUFFT. The package is entirely performance portable due to the use of OPENACC-CUDA inter-operability, in which calls to CUDA functions are replaced with the OPENACC data regions for a host central processing unit (CPU) and device (GPU). A comprehensive benchmark study has been conducted, which compares a number of FFT routines, the Numerical Recipes FFT (FOURN), Fastest Fourier Transform in the West (FFTW), and the CUFFT. The last one exploits the advantages of the GPU hardware for FFT calculations. The novel ACCPFDD-CUFFT implementation is verified using the analytical solutions for the stress field around an infinite edge dislocation and subsequently applied to simulate the interaction and motion of dislocations through a bi-phase copper-nickel (Cu–Ni) interface. It is demonstrated that the ACCPFDD-CUFFT implementation on a single TESLA K80 GPU offers a 27.6X speedupmore »
- Authors:
-
- Univ. of New Hampshire, Durham, NH (United States)
- Univ. of California, Santa Barbara, CA (United States)
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Publication Date:
- Research Org.:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1402642
- Alternate Identifier(s):
- OSTI ID: 1549390
- Report Number(s):
- LA-UR-17-25058
Journal ID: ISSN 0965-9978
- Grant/Contract Number:
- AC52-06NA25396; 20160156ER
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Advances in Engineering Software
- Additional Journal Information:
- Journal Volume: 115; Journal ID: ISSN 0965-9978
- Publisher:
- Elsevier
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Eghtesad, Adnan, Germaschewski, Kai, Beyerlein, Irene J., Hunter, Abigail, and Knezevic, Marko. Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces. United States: N. p., 2017.
Web. doi:10.1016/j.advengsoft.2017.09.010.
Eghtesad, Adnan, Germaschewski, Kai, Beyerlein, Irene J., Hunter, Abigail, & Knezevic, Marko. Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces. United States. https://doi.org/10.1016/j.advengsoft.2017.09.010
Eghtesad, Adnan, Germaschewski, Kai, Beyerlein, Irene J., Hunter, Abigail, and Knezevic, Marko. Sat .
"Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces". United States. https://doi.org/10.1016/j.advengsoft.2017.09.010. https://www.osti.gov/servlets/purl/1402642.
@article{osti_1402642,
title = {Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces},
author = {Eghtesad, Adnan and Germaschewski, Kai and Beyerlein, Irene J. and Hunter, Abigail and Knezevic, Marko},
abstractNote = {We present the first high-performance computing implementation of the meso-scale phase field dislocation dynamics (PFDD) model on a graphics processing unit (GPU)-based platform. The implementation takes advantage of the portable OpenACC standard directive pragmas along with Nvidia's compute unified device architecture (CUDA) fast Fourier transform (FFT) library called CUFFT to execute the FFT computations within the PFDD formulation on the same GPU platform. The overall implementation is termed ACCPFDD-CUFFT. The package is entirely performance portable due to the use of OPENACC-CUDA inter-operability, in which calls to CUDA functions are replaced with the OPENACC data regions for a host central processing unit (CPU) and device (GPU). A comprehensive benchmark study has been conducted, which compares a number of FFT routines, the Numerical Recipes FFT (FOURN), Fastest Fourier Transform in the West (FFTW), and the CUFFT. The last one exploits the advantages of the GPU hardware for FFT calculations. The novel ACCPFDD-CUFFT implementation is verified using the analytical solutions for the stress field around an infinite edge dislocation and subsequently applied to simulate the interaction and motion of dislocations through a bi-phase copper-nickel (Cu–Ni) interface. It is demonstrated that the ACCPFDD-CUFFT implementation on a single TESLA K80 GPU offers a 27.6X speedup relative to the serial version and a 5X speedup relative to the 22-multicore Intel Xeon CPU E5-2699 v4 @ 2.20 GHz version of the code.},
doi = {10.1016/j.advengsoft.2017.09.010},
journal = {Advances in Engineering Software},
number = ,
volume = 115,
place = {United States},
year = {2017},
month = {10}
}
Web of Science