Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Performance and Energy Usage of Workloads on KNL and Haswell Architectures. In: High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation

Conference · · Lecture Notes in Computer Science
 [1];  [2];  [2];  [2];  [2]
  1. Clemson Univ., SC (United States)
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Manycore architectures are an energy-efficient step towards exascale computing within a constrained power budget. The Intel Knights Landing (KNL) manycore chip is a specific example of this and has seen early adoption by a number of HPC facilities. It is therefore important to understand the performance and energy usage characteristics of KNL. In this paper, we evaluate the performance and energy efficiency of KNL in contrast to the Xeon (Haswell) architecture for applications representative of the workload of users at NERSC. We consider the optimal MPI/OpenMP configuration of each application and use the results to characterize KNL in contrast to Haswell. As well as traditional DDR memory, KNL contains MCDRAM and we also evaluate its efficacy. Our results show that, averaged over our benchmarks, KNL is 1.84 × more energy efficient than Haswell and has 1.27 × greater performance.
Research Organization:
Lawrence Berkeley National Laboratory-National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1546612
Conference Information:
Journal Name: Lecture Notes in Computer Science Journal Volume: 10724
Country of Publication:
United States
Language:
English

References (13)

Castro: a new Compressible Astrophysical Solver. i. Hydrodynamics and Self-Gravity journal May 2010
Argonne applications for the IBM Blue Gene/Q, Mira journal January 2013
Effective Holistic Performance Measurement at Petascale Using IPM book January 2011
Determination of performance characteristics of scientific applications on IBM Blue Gene/Q journal January 2013
Runtime Power Limiting of Parallel Applications on Intel Xeon Phi Processors conference November 2016
The impact of hyper-threading on processor resource utilization in production applications conference December 2011
Evaluating and Optimizing the NERSC Workload on Knights Landing
  • Barnes, Taylor; Cook, Brandon; Deslippe, Jack
  • 2016 7th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) https://doi.org/10.1109/PMBS.2016.010
conference November 2016
Nyx: A MASSIVELY PARALLEL AMR CODE FOR COMPUTATIONAL COSMOLOGY journal February 2013
Metrics for Energy-Aware Software Optimisation book January 2017
Introducing FIRESTARTER: A processor stress test utility
  • Hackenberg, Daniel; Oldenburg, Roland; Molka, Daniel
  • 2013 International Green Computing Conference (IGCC), 2013 International Green Computing Conference Proceedings https://doi.org/10.1109/IGCC.2013.6604507
conference June 2013
Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL conference May 2017
Performance Analysis and Workload Characterization with IPM book January 2010
Effective Performance Measurement at Petascale Using IPM conference December 2010

Similar Records

Electronic Structure Theory Calculations Using Modern Architectures: KNL vs Haswell
Journal Article · Tue Oct 26 00:00:00 EDT 2021 · Journal of Chemical Theory and Computation · OSTI ID:1873417

A Locality-Based Threading Algorithm for the Configuration-Interaction Method
Journal Article · Mon Jul 03 00:00:00 EDT 2017 · IEEE International Symposium on Parallel and Distributed Processing, Workshops and Phd Forum · OSTI ID:1393243

MILC staggered conjugate gradient performance on Intel KNL
Conference · Thu Nov 03 00:00:00 EDT 2016 · Proceedings of Science (POS) · OSTI ID:1398438