skip to main content

Title: Wilson Dslash Kernel From Lattice QCD Optimization

Lattice Quantum Chromodynamics (LQCD) is a numerical technique used for calculations in Theoretical Nuclear and High Energy Physics. LQCD is traditionally one of the first applications ported to many new high performance computing architectures and indeed LQCD practitioners have been known to design and build custom LQCD computers. Lattice QCD kernels are frequently used as benchmarks (e.g. 168.wupwise in the SPEC suite) and are generally well understood, and as such are ideal to illustrate several optimization techniques. In this chapter we will detail our work in optimizing the Wilson-Dslash kernels for Intel Xeon Phi, however, as we will show the technique gives excellent performance on regular Xeon Architecture as well.
 [1] ;  [2] ;  [3] ;  [3]
  1. Jefferson Lab, Newport News, VA
  2. Parallel Computing Lab, Intel Corporation, California, USA
  3. Parallel Computing Lab, Intel Corporation, India
Publication Date:
OSTI Identifier:
Report Number(s):
JLAB-IT-15-01; DOE/OR/23177-3581
DOE Contract Number:
Resource Type:
Research Org:
Thomas Jefferson National Accelerator Facility, Newport News, VA (United States)
Sponsoring Org:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21); USDOE Office of Science (SC), Nuclear Physics (NP) (SC-26); USDOE Office of Science (SC), High Energy Physics (HEP) (SC-25)
Country of Publication:
United States
Lattice QCD, Wilson-Dslash, Parallel Computing, Optimization, Blocking, Code-Generators