A Step towards Energy Efficient Computing: Redesigning a Hydrodynamic Application on CPU-GPU, In: 2014 IEEE 28th International Parallel and Distributed Processing Symposium

Dong, Tingxing; Dobrev, Veselin; Kolev, Tzanio; Rieben, Robert; Tomov, Stanimire; Dongarra, Jack

doi:10.1109/IPDPS.2014.103

A Step towards Energy Efficient Computing: Redesigning a Hydrodynamic Application on CPU-GPU, In: 2014 IEEE 28th International Parallel and Distributed Processing Symposium

Conference · Thu May 01 04:00:00 EDT 2014 · 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM

DOI:https://doi.org/10.1109/IPDPS.2014.103· OSTI ID:1567367

Dong, Tingxing; Dobrev, Veselin; Kolev, Tzanio; Rieben, Robert; Tomov, Stanimire; Dongarra, Jack

Power and energy consumption are becoming an increasing concern in high performance computing. Compared to multi-core CPUs, GPUs have a much better performance per watt. In this paper we discuss efforts to redesign the most computation intensive parts of BLAST, an application that solves the equations for compressible hydrodynamics with high order finite elements, using GPUs BLAST, Dobrev. In order to exploit the hardware parallelism of GPUs and achieve high performance, we implemented custom linear algebra kernels. We intensively optimized our CUDA kernels by exploiting the memory hierarchy, which exceed the vendor's library routines substantially in performance. We proposed an auto tuning technique to adapt our CUDA kernels to the orders of the finite element method. Compared to a previous base implementation, our redesign and optimization lowered the energy consumption of the GPU in two aspects: 60% less time to solution and 10% less power required. Compared to the CPU-only solution, our GPU accelerated BLAST obtained a 2.5× overall speedup and 1.42× energy efficiency (green up) using 4th order (Q_4) finite elements, and a 1.9× speedup and 1.27× green up using 2nd order (Q2) finite elements.

🛈

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

Sponsoring Organization:: USDOE Office of Science (SC)

DOE Contract Number:: AC52-07NA27344

OSTI ID:: 1567367

Conference Information:: Journal Name: 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM

Country of Publication:: United States

Language:: English

Similar Records

Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer

Journal Article · Sun Nov 30 23:00:00 EST 2014 · Journal of Computational Physics · OSTI ID:22382148

A 3D front tracking method on a CPU/GPU system

Journal Article · Thu Jan 20 23:00:00 EST 2011 · OSTI ID:1048813

High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster

Journal Article · Fri Oct 01 00:00:00 EDT 2010 · Journal of Computational Physics · OSTI ID:21418106

Related Subjects

Computer Science

A Step towards Energy Efficient Computing: Redesigning a Hydrodynamic Application on CPU-GPU, In: 2014 IEEE 28th International Parallel and Distributed Processing Symposium

Citation Formats

Similar Records

Related Subjects