skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Optimization of Condensed Matter Physics Application with OpenMP Tasking Model

Conference ·

The Density Matrix Renormalization Group (DMRG++) is a condensed matter physics application used to study superconductivity properties of materials. It’s main computations consist of calculating hamiltonian matrix which requires sparse matrix-vector multiplications. This paper presents task-based parallelization and optimization strategies of the Hamiltonian algorithm. The algorithm is implemented as a mini-application in C++ and parallelized with OpenMP. The optimization leverages tasking features, such as dependencies or priorities included in the OpenMP standard 4.5. The code refactoring targets performance as much as programmability. The optimized version achieves a speedup of 8.0× with 8 threads and 20.5× with 40 threads on a Power9 computing node while reducing the memory consumption to 90 MB with respect to the original code, by adding less than ten OpenMP directives.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); USDOE Office of Science (SC), Basic Energy Sciences (BES)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1564203
Resource Relation:
Journal Volume: 11718; Conference: 15th International Workshop on OpenMP (IWOMP 2019) - AUCKLAND, , New Zealand - 9/11/2019 8:00:00 AM-9/13/2019 8:00:00 AM
Country of Publication:
United States
Language:
English

References (7)

Implementation of the SU(2) Hamiltonian symmetry for the DMRG algorithm journal October 2012
The density matrix renormalization group for strongly correlated electron systems: A generic implementation journal September 2009
Fluid-Structure Interaction Based on HPC Multicode Coupling journal January 2018
Hints to improve automatic load balancing with LeWI for hybrid applications journal September 2014
Runtime mechanisms to survive new HPC architectures: A use case in human respiratory simulations journal April 2019
On the usefulness of object tracking techniques in performance analysis
  • Llort, Germán; Servat, Harald; González, Juan
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2503267
conference January 2013
IBM Power9 Processor Architecture journal March 2017

Similar Records

Data Locality Enhancement of Dynamic Simulations for Exascale Computing (Final Report)
Technical Report · Fri Nov 29 00:00:00 EST 2019 · OSTI ID:1564203

Porting DMRG++ Scientific Application to OpenPOWER
Conference · Sun Jul 01 00:00:00 EDT 2018 · OSTI ID:1564203

Quantum Monte Carlo Endstation for Petascale Computing
Technical Report · Wed Mar 02 00:00:00 EST 2011 · OSTI ID:1564203

Related Subjects