DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces

Abstract

We present the first high-performance computing implementation of the meso-scale phase field dislocation dynamics (PFDD) model on a graphics processing unit (GPU)-based platform. The implementation takes advantage of the portable OpenACC standard directive pragmas along with Nvidia's compute unified device architecture (CUDA) fast Fourier transform (FFT) library called CUFFT to execute the FFT computations within the PFDD formulation on the same GPU platform. The overall implementation is termed ACCPFDD-CUFFT. The package is entirely performance portable due to the use of OPENACC-CUDA inter-operability, in which calls to CUDA functions are replaced with the OPENACC data regions for a host central processing unit (CPU) and device (GPU). A comprehensive benchmark study has been conducted, which compares a number of FFT routines, the Numerical Recipes FFT (FOURN), Fastest Fourier Transform in the West (FFTW), and the CUFFT. The last one exploits the advantages of the GPU hardware for FFT calculations. The novel ACCPFDD-CUFFT implementation is verified using the analytical solutions for the stress field around an infinite edge dislocation and subsequently applied to simulate the interaction and motion of dislocations through a bi-phase copper-nickel (Cu–Ni) interface. It is demonstrated that the ACCPFDD-CUFFT implementation on a single TESLA K80 GPU offers a 27.6X speedupmore » relative to the serial version and a 5X speedup relative to the 22-multicore Intel Xeon CPU E5-2699 v4 @ 2.20 GHz version of the code.« less

Authors:
 [1];  [1];  [2];  [3];  [1]
  1. Univ. of New Hampshire, Durham, NH (United States)
  2. Univ. of California, Santa Barbara, CA (United States)
  3. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Publication Date:
Research Org.:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1402642
Alternate Identifier(s):
OSTI ID: 1549390
Report Number(s):
LA-UR-17-25058
Journal ID: ISSN 0965-9978
Grant/Contract Number:  
AC52-06NA25396; 20160156ER
Resource Type:
Accepted Manuscript
Journal Name:
Advances in Engineering Software
Additional Journal Information:
Journal Volume: 115; Journal ID: ISSN 0965-9978
Publisher:
Elsevier
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Eghtesad, Adnan, Germaschewski, Kai, Beyerlein, Irene J., Hunter, Abigail, and Knezevic, Marko. Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces. United States: N. p., 2017. Web. doi:10.1016/j.advengsoft.2017.09.010.
Eghtesad, Adnan, Germaschewski, Kai, Beyerlein, Irene J., Hunter, Abigail, & Knezevic, Marko. Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces. United States. https://doi.org/10.1016/j.advengsoft.2017.09.010
Eghtesad, Adnan, Germaschewski, Kai, Beyerlein, Irene J., Hunter, Abigail, and Knezevic, Marko. Sat . "Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces". United States. https://doi.org/10.1016/j.advengsoft.2017.09.010. https://www.osti.gov/servlets/purl/1402642.
@article{osti_1402642,
title = {Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces},
author = {Eghtesad, Adnan and Germaschewski, Kai and Beyerlein, Irene J. and Hunter, Abigail and Knezevic, Marko},
abstractNote = {We present the first high-performance computing implementation of the meso-scale phase field dislocation dynamics (PFDD) model on a graphics processing unit (GPU)-based platform. The implementation takes advantage of the portable OpenACC standard directive pragmas along with Nvidia's compute unified device architecture (CUDA) fast Fourier transform (FFT) library called CUFFT to execute the FFT computations within the PFDD formulation on the same GPU platform. The overall implementation is termed ACCPFDD-CUFFT. The package is entirely performance portable due to the use of OPENACC-CUDA inter-operability, in which calls to CUDA functions are replaced with the OPENACC data regions for a host central processing unit (CPU) and device (GPU). A comprehensive benchmark study has been conducted, which compares a number of FFT routines, the Numerical Recipes FFT (FOURN), Fastest Fourier Transform in the West (FFTW), and the CUFFT. The last one exploits the advantages of the GPU hardware for FFT calculations. The novel ACCPFDD-CUFFT implementation is verified using the analytical solutions for the stress field around an infinite edge dislocation and subsequently applied to simulate the interaction and motion of dislocations through a bi-phase copper-nickel (Cu–Ni) interface. It is demonstrated that the ACCPFDD-CUFFT implementation on a single TESLA K80 GPU offers a 27.6X speedup relative to the serial version and a 5X speedup relative to the 22-multicore Intel Xeon CPU E5-2699 v4 @ 2.20 GHz version of the code.},
doi = {10.1016/j.advengsoft.2017.09.010},
journal = {Advances in Engineering Software},
number = ,
volume = 115,
place = {United States},
year = {Sat Oct 14 00:00:00 EDT 2017},
month = {Sat Oct 14 00:00:00 EDT 2017}
}

Journal Article:

Citation Metrics:
Cited by: 10 works
Citation information provided by
Web of Science

Save / Share: