Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Porting DMRG++ Scientific Application to OpenPOWER

Conference ·
With the rapidly changing microprocessor designs and architectural diversity (multi-cores, many-cores, accelerators) for the next generation HPC systems, scientific applications must adapt to the hardware, to exploit the different types of parallelism and resources available in the architecture. To get the benefit of all the in-node hardware threads, it is important to use a single programming model to map and coordinate the available work to the different heterogeneous execution units in the node (e.g., multi-core hardware threads (latency optimized), accelerators (bandwidth optimized), etc.).Our goal is to show that we can manage the node complexity of these systems by using OpenMP for in-node parallelization by exploiting different “programming styles” supported by OpenMP 4.5 to program CPU cores and accelerators. Finding out the suitable programming-style (e.g., SPMD style, multi-level tasks, accelerator programming, nested parallelism, or a combination of these) using the latest features of OpenMP to maximize performance and achieve performance portability across heterogeneous and homogeneous systems is still an open research problem.We developed a mini-application, Kronecker Product (KP), from the original DMRG++ application (sparse matrix algebra) computational motif to experiment with different OpenMP programming styles on an OpenPOWER architecture and present their results in this paper.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1528734
Country of Publication:
United States
Language:
English

References (4)

The density matrix renormalization group for strongly correlated electron systems: A generic implementation journal September 2009
The ubiquitous Kronecker product journal November 2000
Scheduling Dynamic OpenMP Applications over Multicore Architectures book January 2008
Automatic thread distribution for nested parallelism in OpenMP conference January 2005

Similar Records

MiniApp for Density Matrix Renormalization Group Hamiltonian Application Kernel
Conference · Sat Sep 01 00:00:00 EDT 2018 · OSTI ID:1484995

First experience of vectorizing electromagnetic physics models for detector simulation
Conference · Tue Dec 22 23:00:00 EST 2015 · J.Phys.Conf.Ser. · OSTI ID:1250786

Experiences with High-Level Programming Directives for Porting Applications to GPUs
Conference · Sat Dec 31 23:00:00 EST 2011 · OSTI ID:1049778

Related Subjects