Optimized Lattice QCD kernels for a Pentium 4 Cluster
Soon, a new cluster of parallel Pentium 4 machines will be set up at JLAB to run Lattice QCD calculations. I discuss the rationale for optimized Lattice QCD routines, and how the features of the Pentium 4 enable new optimized routines to run much faster than normal C routines. I describe the optimization strategies used in SU(3) linear algebra routines, and in both single-node and parallel implementations of the Wilson-Dirac Operator. Finally, I show single node performance timings for the parallel version of the Wilson-Dirac operator.
- Research Organization:
- Thomas Jefferson Lab National Accelerator Facility
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-84ER40150
- OSTI ID:
- 954827
- Report Number(s):
- JLAB-THY-01-29
- Country of Publication:
- United States
- Language:
- English
Similar Records
Lattice QCD production on commodity clusters at Fermilab
FermiQCD: A tool kit for parallel lattice QCD applications
Loop representations of the quark determinant in lattice QCD
Conference
·
Tue Sep 30 00:00:00 EDT 2003
·
OSTI ID:815581
FermiQCD: A tool kit for parallel lattice QCD applications
Journal Article
·
Thu Feb 28 23:00:00 EST 2002
· Nuclear Physics. B, Proceedings Supplements
·
OSTI ID:1155661
Loop representations of the quark determinant in lattice QCD
Journal Article
·
Wed Sep 01 00:00:00 EDT 1999
· Physical Review, D
·
OSTI ID:362731