Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Optimized Lattice QCD kernels for a Pentium 4 Cluster

Technical Report ·
DOI:https://doi.org/10.2172/954827· OSTI ID:954827

Soon, a new cluster of parallel Pentium 4 machines will be set up at JLAB to run Lattice QCD calculations. I discuss the rationale for optimized Lattice QCD routines, and how the features of the Pentium 4 enable new optimized routines to run much faster than normal C routines. I describe the optimization strategies used in SU(3) linear algebra routines, and in both single-node and parallel implementations of the Wilson-Dirac Operator. Finally, I show single node performance timings for the parallel version of the Wilson-Dirac operator.

Research Organization:
Thomas Jefferson Lab National Accelerator Facility
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-84ER40150
OSTI ID:
954827
Report Number(s):
JLAB-THY-01-29
Country of Publication:
United States
Language:
English

Similar Records

Lattice QCD production on commodity clusters at Fermilab
Conference · Tue Sep 30 00:00:00 EDT 2003 · OSTI ID:815581

FermiQCD: A tool kit for parallel lattice QCD applications
Journal Article · Thu Feb 28 23:00:00 EST 2002 · Nuclear Physics. B, Proceedings Supplements · OSTI ID:1155661

Loop representations of the quark determinant in lattice QCD
Journal Article · Wed Sep 01 00:00:00 EDT 1999 · Physical Review, D · OSTI ID:362731