DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: An efficient MPI/OpenMP parallelization of the Hartree–Fock–Roothaan method for the first generation of Intel® Xeon Phi™ processor architecture

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [2];  [3];  [4]
  1. Lomonosov Moscow State Univ., Moscow (Russian Federation)
  2. RSC Technologies, Moscow (Russian Federation)
  3. Intel Corporation, Schaumburg, IL (United States)
  4. Argonne National Lab. (ANL), Argonne, IL (United States)

The Hartree–Fock method in the General Atomic and Molecular Structure System (GAMESS) quantum chemistry package represents one of the most irregular algorithms in computation today. Major steps in the calculation are the irregular computation of electron repulsion integrals and the building of the Fock matrix. These are the central components of the main self consistent field (SCF) loop, the key hot spot in electronic structure codes. By threading the Message Passing Interface (MPI) ranks in the official release of the GAMESS code, we not only speed up the main SCF loop (4× to 6× for large systems) but also achieve a significant (>2>2×) reduction in the overall memory footprint. These improvements are a direct consequence of memory access optimizations within the MPI ranks. We benchmark our implementation against the official release of the GAMESS code on the Intel® Xeon Phi™ supercomputer. Scaling numbers are reported on up to 7680 cores on Intel Xeon Phi coprocessors.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES) (SC-22), Scientific User Facilities Division; Intel Corporation
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1401981
Alternate ID(s):
OSTI ID: 1402492
Journal Information:
International Journal of High Performance Computing Applications, Vol. 2017; ISSN 1094-3420
Publisher:
SAGECopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 12 works
Citation information provided by
Web of Science

References (23)

Acceleration of the GAMESS-UK electronic structure package on graphical processing units journal May 2011
Efficient electronic integrals and their generalized derivatives for object oriented implementations of electronic structure calculations journal December 2008
MPI/OpenMP Hybrid Parallel Algorithm for Hartree−Fock Calculations journal March 2010
The Performance Characterization of the RSC PetaStream Module book January 2014
General atomic and molecular electronic structure system journal November 1993
Quantum Chemistry on Graphical Processing Units. 1. Strategies for Two-Electron Integral Evaluation journal January 2008
The Heuristic Static Load-Balancing Algorithm Applied to the Community Earth System Model conference May 2014
Thread-level parallelization and optimization of NWChem for the Intel MIC architecture
  • Shan, Hongzhang; Williams, Samuel; de Jong, Wibe
  • Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores - PMAM '15 https://doi.org/10.1145/2712386.2712391
conference January 2015
Macroscale superlubricity enabled by graphene nanoscroll formation journal May 2015
Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms book January 2014
Extending the Power of Quantum Chemistry to Large Systems with the Fragment Molecular Orbital Method journal August 2007
Parallelization of SCF calculations within Q-Chem journal June 2000
Toward high-performance computational chemistry: II. A scalable self-consistent field program journal January 1996
A parallel distributed data CPHF algorithm for analytic Hessians journal January 2007
A New Scalable Parallel Algorithm for Fock Matrix Construction
  • Liu, Xing; Patel, Aftab; Chow, Edmond
  • 2014 IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2014 IEEE 28th International Parallel and Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2014.97
conference May 2014
The Distributed Data Interface in GAMESS journal June 2000
Libcint: An efficient general integral library for Gaussian basis functions journal June 2015
Horizontal vectorization of electron repulsion integrals journal September 2016
Quantum Chemistry on Graphical Processing Units. 2. Direct Self-Consistent-Field Implementation journal March 2009
New Multithreaded Hybrid CPU/GPU Approach to Hartree–Fock journal September 2012
NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations journal September 2010
Efficient recursive computation of molecular integrals over Cartesian Gaussian functions journal April 1986
Evaluation of molecular integrals over Gaussian basis functions journal July 1976

Cited By (1)

Multithreaded parallelization of the energy and analytic gradient in the fragment molecular orbital method journal March 2019