GPU acceleration of the Locally Selfconsistent Multiple Scattering code for first principles calculation of the ground state and statistical physics of materials
Abstract
The Locally Self-consistent Multiple Scattering (LSMS) code solves the first principles Density Functional theory Kohn–Sham equation for a wide range of materials with a special focus on metals, alloys and metallic nano-structures. It has traditionally exhibited near perfect scalability on massively parallel high performance computer architectures. In this paper, we present our efforts to exploit GPUs to accelerate the LSMS code to enable first principles calculations of O(100,000) atoms and statistical physics sampling of finite temperature properties. We reimplement the scattering matrix calculation for GPUs with a block matrix inversion algorithm that only uses accelerator memory. Finally, using the Cray XK7 system Titan at the Oak Ridge Leadership Computing Facility we achieve a sustained performance of 14.5PFlop/s and a speedup of 8.6 compared to the CPU only code.
- Authors:
-
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- NVIDIA Corporation, Santa Clara, CA (United States)
- Publication Date:
- Research Org.:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
- Sponsoring Org.:
- USDOE Office of Science (SC), Basic Energy Sciences (BES); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Contributing Org.:
- NVIDIA Corporation, Santa Clara, CA (United States)
- OSTI Identifier:
- 1335344
- Alternate Identifier(s):
- OSTI ID: 1396465
- Grant/Contract Number:
- AC05-00OR22725
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Computer Physics Communications
- Additional Journal Information:
- Journal Volume: 211; Journal ID: ISSN 0010-4655
- Publisher:
- Elsevier
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 36 MATERIALS SCIENCE; First-principles; Monte-Carlo; Phase transitions
Citation Formats
Eisenbach, Markus, Larkin, Jeff, Lutjens, Justin, Rennich, Steven, and Rogers, James H. GPU acceleration of the Locally Selfconsistent Multiple Scattering code for first principles calculation of the ground state and statistical physics of materials. United States: N. p., 2016.
Web. doi:10.1016/j.cpc.2016.07.013.
Eisenbach, Markus, Larkin, Jeff, Lutjens, Justin, Rennich, Steven, & Rogers, James H. GPU acceleration of the Locally Selfconsistent Multiple Scattering code for first principles calculation of the ground state and statistical physics of materials. United States. https://doi.org/10.1016/j.cpc.2016.07.013
Eisenbach, Markus, Larkin, Jeff, Lutjens, Justin, Rennich, Steven, and Rogers, James H. Tue .
"GPU acceleration of the Locally Selfconsistent Multiple Scattering code for first principles calculation of the ground state and statistical physics of materials". United States. https://doi.org/10.1016/j.cpc.2016.07.013. https://www.osti.gov/servlets/purl/1335344.
@article{osti_1335344,
title = {GPU acceleration of the Locally Selfconsistent Multiple Scattering code for first principles calculation of the ground state and statistical physics of materials},
author = {Eisenbach, Markus and Larkin, Jeff and Lutjens, Justin and Rennich, Steven and Rogers, James H.},
abstractNote = {The Locally Self-consistent Multiple Scattering (LSMS) code solves the first principles Density Functional theory Kohn–Sham equation for a wide range of materials with a special focus on metals, alloys and metallic nano-structures. It has traditionally exhibited near perfect scalability on massively parallel high performance computer architectures. In this paper, we present our efforts to exploit GPUs to accelerate the LSMS code to enable first principles calculations of O(100,000) atoms and statistical physics sampling of finite temperature properties. We reimplement the scattering matrix calculation for GPUs with a block matrix inversion algorithm that only uses accelerator memory. Finally, using the Cray XK7 system Titan at the Oak Ridge Leadership Computing Facility we achieve a sustained performance of 14.5PFlop/s and a speedup of 8.6 compared to the CPU only code.},
doi = {10.1016/j.cpc.2016.07.013},
journal = {Computer Physics Communications},
number = ,
volume = 211,
place = {United States},
year = {Tue Jul 12 00:00:00 EDT 2016},
month = {Tue Jul 12 00:00:00 EDT 2016}
}
Web of Science
Works referenced in this record:
Inhomogeneous Electron Gas
journal, November 1964
- Hohenberg, P.; Kohn, W.
- Physical Review, Vol. 136, Issue 3B, p. B864-B871
Self-Consistent Equations Including Exchange and Correlation Effects
journal, November 1965
- Kohn, W.; Sham, L. J.
- Physical Review, Vol. 140, Issue 4A, p. A1133-A1138
Calculating condensed matter properties using the KKR-Green's function method—recent developments and applications
journal, August 2011
- Ebert, H.; Ködderitzsch, D.; Minár, J.
- Reports on Progress in Physics, Vol. 74, Issue 9
Order- N Multiple Scattering Approach to Electronic Structure Calculations
journal, October 1995
- Wang, Yang; Stocks, G. M.; Shelton, W. A.
- Physical Review Letters, Vol. 75, Issue 15
Magnetic anisotropy of monoatomic iron chains embedded in copper
journal, March 2002
- Eisenbach, Markus; Györffy, Balazs L.; Stocks, G. Malcolm
- Physical Review B, Vol. 65, Issue 14
On calculating the magnetic state of nanostructures
journal, February 2007
- Stocks, G.; Eisenbach, M.; Ujfalussy, B.
- Progress in Materials Science, Vol. 52, Issue 2-3
Equation of State Calculations by Fast Computing Machines
journal, June 1953
- Metropolis, Nicholas; Rosenbluth, Arianna W.; Rosenbluth, Marshall N.
- The Journal of Chemical Physics, Vol. 21, Issue 6
First principles calculation of finite temperature magnetism in Fe and Fe 3 C
journal, April 2011
- Eisenbach, M.; Nicholson, D. M.; Rusanu, A.
- Journal of Applied Physics, Vol. 109, Issue 7
First principles approach to the magneto caloric effect: Application toNi 2 MnGa
journal, April 2011
- Nicholson, D. M.; Odbadrakh, Kh.; Rusanu, A.
- Journal of Applied Physics, Vol. 109, Issue 7
Density-functional Monte-Carlo simulation of CuZn order-disorder transition
journal, January 2016
- Khan, S. N.; Eisenbach, Markus
- Physical Review B, Vol. 93, Issue 2
Onsager cavity fields in itinerant-electron paramagnets
journal, July 1992
- Staunton, J. B.; Gyorffy, B. L.
- Physical Review Letters, Vol. 69, Issue 2
Massively parallel Wang–Landau sampling on multiple GPUs
journal, August 2012
- Yin, Junqi; Landau, D. P.
- Computer Physics Communications, Vol. 183, Issue 8