skip to main content

Title: Wilson Dslash Kernel From Lattice QCD Optimization

Lattice Quantum Chromodynamics (LQCD) is a numerical technique used for calculations in Theoretical Nuclear and High Energy Physics. LQCD is traditionally one of the first applications ported to many new high performance computing architectures and indeed LQCD practitioners have been known to design and build custom LQCD computers. Lattice QCD kernels are frequently used as benchmarks (e.g. 168.wupwise in the SPEC suite) and are generally well understood, and as such are ideal to illustrate several optimization techniques. In this chapter we will detail our work in optimizing the Wilson-Dslash kernels for Intel Xeon Phi, however, as we will show the technique gives excellent performance on regular Xeon Architecture as well.
Authors:
 [1] ;  [2] ;  [3] ;  [3]
  1. Jefferson Lab, Newport News, VA
  2. Parallel Computing Lab, Intel Corporation, California, USA
  3. Parallel Computing Lab, Intel Corporation, India
Publication Date:
OSTI Identifier:
1223094
Report Number(s):
JLAB-IT-15-01; DOE/OR/23177-3581
DOE Contract Number:
AC05-06OR23177
Resource Type:
Book
Research Org:
Thomas Jefferson National Accelerator Facility, Newport News, VA (United States)
Sponsoring Org:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21); USDOE Office of Science (SC), Nuclear Physics (NP) (SC-26); USDOE Office of Science (SC), High Energy Physics (HEP) (SC-25)
Country of Publication:
United States
Language:
English
Subject:
Lattice QCD, Wilson-Dslash, Parallel Computing, Optimization, Blocking, Code-Generators