Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Preparing NERSC users for Cori, a Cray XC40 system with Intel many integrated cores

Journal Article · · Concurrency and Computation. Practice and Experience
DOI:https://doi.org/10.1002/cpe.4291· OSTI ID:1459400
The newest NERSC supercomputer Cori is a Cray XC40 system consisting of 2,388 Intel Xeon Haswell nodes and 9,688 Intel Xeon-Phi “Knights Landing” (KNL) nodes. Compared to the Xeon-based clusters NERSC users are familiar with, optimal performance on Cori requires consideration of KNL mode settings; process, thread, and memory affinity; fine-grain parallelization; vectorization; and use of the high-bandwidth MCDRAM memory. This paper describes our efforts preparing NERSC users for KNL through the NERSC Exascale Science Application Program, Web documentation, and user training. We discuss how we configured the Cori system for usability and productivity, addressing programming concerns, batch system configurations, and default KNL cluster and memory modes. Here, system usage data, job completion analysis, programming and running jobs issues, and a few successful user stories on KNL are presented.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1459400
Journal Information:
Concurrency and Computation. Practice and Experience, Journal Name: Concurrency and Computation. Practice and Experience Journal Issue: 1 Vol. 30; ISSN 1532-0626
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English

References (5)

Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set journal July 1996
BerkeleyGW: A massively parallel computer package for the calculation of the quasiparticle and optical properties of materials and nanostructures journal June 2012
Evaluating and Optimizing the NERSC Workload on Knights Landing
  • Barnes, Taylor; Cook, Brandon; Deslippe, Jack
  • 2016 7th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) https://doi.org/10.1109/PMBS.2016.010
conference November 2016
Roofline: an insightful visual performance model for multicore architectures journal April 2009
Hopper Workload Analysis report May 2014

Cited By (2)

Harnessing billions of tasks for a scalable portable hydrodynamic simulation of the merger of two stars journal September 2018
MCtandem: an efficient tool for large-scale peptide identification on many integrated core (MIC) architecture journal July 2019