Numerical eigen-spectrum slicing, accurate orthogonal eigen-basis, and mixed-precision eigenvalue refinement using OpenMP data-dependent tasks and accelerator offload
- MIT Lincoln Lab, LLS, CMIT Lincoln Laboratory, Lexington, MA,USA, Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA
- Technical Development, Synopsys, Inc., Sunnyvale, CA, USA
- ML Compilers and AI Accelerators, Meta, Inc., Menlo Park, CA,USA
- Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA
- Innovative Computing Laboratory, University of Tennessee, Knoxville, TN, USA, Computer Science and Mathematics, Oak Ridge National Laboratory, Oak Ridge, TN,USA, Applied Mathematics, University of Manchester, Manchester,UK
Performing a variety of numerical computations efficiently and, at the same time, in a portable fashion requires both an overarching design followed by a number of implementation strategies. All of these are exemplified below as we present transitioning the PLASMA numerical library from relying on dependence-driven large tasks to achieving utilization of fine grain tasking and offload to hardware accelerators while keeping its core dependence sets: OpenMP source code pragmas and runtime for most system-level functionality and basic low-level numerical kernels provided directly by hardware vendors or open source projects with vendor contributions. We also present new algorithmic methods and their efficient parallel implementations including fine grained tasking for eigen-spectrum slicing and offload for mixed-precision eigenvalue refinement. We provide performance, scaling, and numerical results showing sizable gains over the available solutions from either the open source and vendor-provided packages.
- Sponsoring Organization:
- USDOE
- OSTI ID:
- 2447519
- Journal Information:
- International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications Journal Issue: 6 Vol. 38; ISSN 1094-3420
- Publisher:
- SAGE PublicationsCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support
Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead