skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Early Evaluation of the Cray XC40 Xeon Phi System ‘Theta’ at Argonne

Authors:
; ; ; ; ;
Publication Date:
Research Org.:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Org.:
Argonne National Laboratory - Argonne Leadership Computing Facility; USDOE Office of Science - Office of Basic Energy Sciences - Scientific User Facilities Division
OSTI Identifier:
1393541
DOE Contract Number:
AC02-06CH11357
Resource Type:
Conference
Resource Relation:
Conference: 2017 Cray Users Group meeting, 05/08/17 - 05/11/17, Redmond, WA, US
Country of Publication:
United States
Language:
English

Citation Formats

Parker, Scott, Morozov, Vitali, Chunduri, Sudheer, Harms, Kevin, Knight, Chris, and Kumaran, Kalyan. Early Evaluation of the Cray XC40 Xeon Phi System ‘Theta’ at Argonne. United States: N. p., 2017. Web.
Parker, Scott, Morozov, Vitali, Chunduri, Sudheer, Harms, Kevin, Knight, Chris, & Kumaran, Kalyan. Early Evaluation of the Cray XC40 Xeon Phi System ‘Theta’ at Argonne. United States.
Parker, Scott, Morozov, Vitali, Chunduri, Sudheer, Harms, Kevin, Knight, Chris, and Kumaran, Kalyan. Mon . "Early Evaluation of the Cray XC40 Xeon Phi System ‘Theta’ at Argonne". United States. doi:. https://www.osti.gov/servlets/purl/1393541.
@article{osti_1393541,
title = {Early Evaluation of the Cray XC40 Xeon Phi System ‘Theta’ at Argonne},
author = {Parker, Scott and Morozov, Vitali and Chunduri, Sudheer and Harms, Kevin and Knight, Chris and Kumaran, Kalyan},
abstractNote = {},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Mon May 08 00:00:00 EDT 2017},
month = {Mon May 08 00:00:00 EDT 2017}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share:
  • Abstract not provided.
  • Power consumption of extreme-scale supercomputers has become a key performance bottleneck. Yet current practices do not leverage power management opportunities, instead running at ''maximum power''. This is not sustainable. Future systems will need to manage power as a critical resource, directing where it has greatest benefit. Power capping is one mechanism for managing power budgets, however its behavior is not well understood. This paper presents an empirical evaluation of several key HPC workloads running under a power cap on a Cray XC40 system, and provides a comparison of this technique with p-state control, demonstrating the performance differences of each. Thesemore » results show: 1. Maximum performance requires ensuring the cap is not reached; 2. Performance slowdown under a cap can be attributed to cascading delays which result in unsynchronized performance variability across nodes; and, 3. Due to lag in reaction time, considerable time is spent operating above the set cap. This work provides a timely and much needed comparison of HPC application performance under a power cap and attempts to enable users and system administrators to understand how to best optimize application performance on power-constrained HPC systems.« less
  • Ab-initio Molecular Dynamics (AIMD) methods are an important class of algorithms, as they enable scientists to understand the chemistry and dynamics of molecular and condensed phase systems while retaining a first-principles-based description of their interactions. Many-core architectures such as the Intel® Xeon Phi™ processor are an interesting and promising target for these algorithms, as they can provide the computational power that is needed to solve interesting problems in chemistry. In this paper, we describe the efforts of refactoring the existing AIMD plane-wave method of NWChem from an MPI-only implementation to a scalable, hybrid code that employs MPI and OpenMP tomore » exploit the capabilities of current and future many-core architectures. We describe the optimizations required to get close to optimal performance for the multiplication of the tall-and-skinny matrices that form the core of the computational algorithm. We present strong scaling results on the complete AIMD simulation for a test case that simulates 256 water molecules and that strong-scales well on a cluster of 1024 nodes of Intel Xeon Phi processors. We compare the performance obtained with a cluster of dual-socket Intel® Xeon® E5–2698v3 processors.« less