Parallel Application Performance on Two Generations of Intel Xeon HPC Platforms

Chang, Christopher H.; Long, Hai; Sides, Scott; Vaidhynathan, Deepthi; Jones, Wesley

doi:10.2172/1226160

Title: Parallel Application Performance on Two Generations of Intel Xeon HPC Platforms

Technical Report · Thu Oct 15 00:00:00 EDT 2015

DOI:https://doi.org/10.2172/1226160· OSTI ID:1226160

Chang, Christopher H. ^[1]; Long, Hai ^[1]; Sides, Scott ^[1]; Vaidhynathan, Deepthi ^[1]; Jones, Wesley ^[1]

National Renewable Energy Laboratory (NREL), Golden, CO (United States)

Two next-generation node configurations hosting the Haswell microarchitecture were tested with a suite of microbenchmarks and application examples, and compared with a current Ivy Bridge production node on NREL" tm s Peregrine high-performance computing cluster. A primary conclusion from this study is that the additional cores are of little value to individual task performance--limitations to application parallelism, or resource contention among concurrently running but independent tasks, limits effective utilization of these added cores. Hyperthreading generally impacts throughput negatively, but can improve performance in the absence of detailed attention to runtime workflow configuration. The observations offer some guidance to procurement of future HPC systems at NREL. First, raw core count must be balanced with available resources, particularly memory bandwidth. Balance-of-system will determine value more than processor capability alone. Second, hyperthreading continues to be largely irrelevant to the workloads that are commonly seen, and were tested here, at NREL. Finally, perhaps the most impactful enhancement to productivity might occur through enabling multiple concurrent jobs per node. Given the right type and size of workload, more may be achieved by doing many slow things at once, than fast things in order.

View Technical Report

Cite

Export

Save

Research Organization:: National Renewable Energy Laboratory (NREL), Golden, CO (United States)

Sponsoring Organization:: USDOE Office of Energy Efficiency and Renewable Energy (EERE)

DOE Contract Number:: AC36-08GO28308

OSTI ID:: 1226160

Report Number(s):: NREL/TP-2C00-64268

Country of Publication:: United States

Language:: English

Similar Records

RADICAL-Pilot and PMIx/PRRTE: Executing Heterogeneous Workloads at Large Scale on Partitioned HPC Resources

Conference · Sun Jan 01 00:00:00 EST 2023 · OSTI ID:1226160

Titov, Mikhail; Matteo, Turilli; Merzky, Andre; +3 more

$\mathrm{RADICAL}$-Pilot and $\mathrm{PMIx}$/$\mathrm{PRRTE}$: Executing Heterogeneous Workloads at Large Scale on Partitioned $\mathrm{HPC}$ Resources

Journal Article · Thu Jan 12 00:00:00 EST 2023 · Lecture Notes in Computer Science · OSTI ID:1226160

Titov, Mikhail; Turilli, Matteo; Merzky, Andre; +3 more

Roofline Analysis in the Intel® Advisor to Deliver Optimized Performance for applications on Intel® Xeon Phi™ Processor

Conference · Tue May 23 00:00:00 EDT 2017 · OSTI ID:1226160

Koskela, Tuomas S.; Lobet, Mathieu; Deslippe, Jack; +1 more

Related Subjects

97 MATHEMATICS AND COMPUTING
benchmarking
Haswell
Peregrine
STREAM
multiply
VASP
Gaussian
LAMMPS
Amber

Title: Parallel Application Performance on Two Generations of Intel Xeon HPC Platforms

Citation Formats

Similar Records

Related Subjects