skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: National Computational Infrastructure for Lattice Gauge Theory SciDAC-2 Closeout Report

Abstract

As part of this project work, researchers from Vanderbilt University, Fermi National Laboratory and Illinois Institute of technology developed a real-time cluster fault-tolerant cluster monitoring framework. This framework is open source and is available for download upon request. This work has also been used at Fermi Laboratory, Vanderbilt University and Mississippi State University across projects other than LQCD. The goal for the scientific workflow project is to investigate and develop domain-specific workflow tools for LQCD to help effectively orchestrate, in parallel, computational campaigns consisting of many loosely-coupled batch processing jobs. Major requirements for an LQCD workflow system include: a system to manage input metadata, e.g. physics parameters such as masses, a system to manage and permit the reuse of templates describing workflows, a system to capture data provenance information, a systems to manage produced data, a means of monitoring workflow progress and status, a means of resuming or extending a stopped workflow, fault tolerance features to enhance the reliability of running workflows. Requirements for an LQCD workflow system are available in documentation.

Authors:
Publication Date:
Research Org.:
Illinois Institute of Technology, Chicago, IL (United States)
Sponsoring Org.:
USDOE Office of Science (SC), High Energy Physics (HEP)
OSTI Identifier:
1089000
Report Number(s):
Final
DOE Contract Number:  
FC02-06ER41442
Resource Type:
Technical Report
Country of Publication:
United States
Language:
English
Subject:
72 PHYSICS OF ELEMENTARY PARTICLES AND FIELDS; 97 MATHEMATICS AND COMPUTING

Citation Formats

Sun, Xian-He. National Computational Infrastructure for Lattice Gauge Theory SciDAC-2 Closeout Report. United States: N. p., 2013. Web. doi:10.2172/1089000.
Sun, Xian-He. National Computational Infrastructure for Lattice Gauge Theory SciDAC-2 Closeout Report. United States. https://doi.org/10.2172/1089000
Sun, Xian-He. 2013. "National Computational Infrastructure for Lattice Gauge Theory SciDAC-2 Closeout Report". United States. https://doi.org/10.2172/1089000. https://www.osti.gov/servlets/purl/1089000.
@article{osti_1089000,
title = {National Computational Infrastructure for Lattice Gauge Theory SciDAC-2 Closeout Report},
author = {Sun, Xian-He},
abstractNote = {As part of this project work, researchers from Vanderbilt University, Fermi National Laboratory and Illinois Institute of technology developed a real-time cluster fault-tolerant cluster monitoring framework. This framework is open source and is available for download upon request. This work has also been used at Fermi Laboratory, Vanderbilt University and Mississippi State University across projects other than LQCD. The goal for the scientific workflow project is to investigate and develop domain-specific workflow tools for LQCD to help effectively orchestrate, in parallel, computational campaigns consisting of many loosely-coupled batch processing jobs. Major requirements for an LQCD workflow system include: a system to manage input metadata, e.g. physics parameters such as masses, a system to manage and permit the reuse of templates describing workflows, a system to capture data provenance information, a systems to manage produced data, a means of monitoring workflow progress and status, a means of resuming or extending a stopped workflow, fault tolerance features to enhance the reliability of running workflows. Requirements for an LQCD workflow system are available in documentation.},
doi = {10.2172/1089000},
url = {https://www.osti.gov/biblio/1089000}, journal = {},
number = ,
volume = ,
place = {United States},
year = {Thu Aug 01 00:00:00 EDT 2013},
month = {Thu Aug 01 00:00:00 EDT 2013}
}