Reproducibility in a multiprocessor system
Abstract
Fixing a problem is usually greatly aided if the problem is reproducible. To ensure reproducibility of a multiprocessor system, the following aspects are proposed; a deterministic system start state, a single system clock, phase alignment of clocks in the system, system-wide synchronization events, reproducible execution of system components, deterministic chip interfaces, zero-impact communication with the system, precise stop of the system and a scan of the system state.
- Inventors:
- Issue Date:
- Research Org.:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1108971
- Patent Number(s):
- 8595554
- Application Number:
- 12/774,475
- Assignee:
- International Business Machines Corporation (Armonk, NY)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Bellofatto, Ralph A, Chen, Dong, Coteus, Paul W, Eisley, Noel A, Gara, Alan, Gooding, Thomas M, Haring, Rudolf A, Heidelberger, Philip, Kopcsay, Gerard V, Liebsch, Thomas A, Ohmacht, Martin, Reed, Don D, Senger, Robert M, Steinmacher-Burow, Burkhard, and Sugawara, Yutaka. Reproducibility in a multiprocessor system. United States: N. p., 2013.
Web.
Bellofatto, Ralph A, Chen, Dong, Coteus, Paul W, Eisley, Noel A, Gara, Alan, Gooding, Thomas M, Haring, Rudolf A, Heidelberger, Philip, Kopcsay, Gerard V, Liebsch, Thomas A, Ohmacht, Martin, Reed, Don D, Senger, Robert M, Steinmacher-Burow, Burkhard, & Sugawara, Yutaka. Reproducibility in a multiprocessor system. United States.
Bellofatto, Ralph A, Chen, Dong, Coteus, Paul W, Eisley, Noel A, Gara, Alan, Gooding, Thomas M, Haring, Rudolf A, Heidelberger, Philip, Kopcsay, Gerard V, Liebsch, Thomas A, Ohmacht, Martin, Reed, Don D, Senger, Robert M, Steinmacher-Burow, Burkhard, and Sugawara, Yutaka. Tue .
"Reproducibility in a multiprocessor system". United States. https://www.osti.gov/servlets/purl/1108971.
@article{osti_1108971,
title = {Reproducibility in a multiprocessor system},
author = {Bellofatto, Ralph A and Chen, Dong and Coteus, Paul W and Eisley, Noel A and Gara, Alan and Gooding, Thomas M and Haring, Rudolf A and Heidelberger, Philip and Kopcsay, Gerard V and Liebsch, Thomas A and Ohmacht, Martin and Reed, Don D and Senger, Robert M and Steinmacher-Burow, Burkhard and Sugawara, Yutaka},
abstractNote = {Fixing a problem is usually greatly aided if the problem is reproducible. To ensure reproducibility of a multiprocessor system, the following aspects are proposed; a deterministic system start state, a single system clock, phase alignment of clocks in the system, system-wide synchronization events, reproducible execution of system components, deterministic chip interfaces, zero-impact communication with the system, precise stop of the system and a scan of the system state.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2013},
month = {11}
}
Works referenced in this record:
Packaging the Blue Gene/L supercomputer
journal, March 2005
- Coteus, P.; Bickford, H. R.; Cipolla, T. M.
- IBM Journal of Research and Development, Vol. 49, Issue 2.3
Blue Gene/L compute chip: Control, test, and bring-up infrastructure
journal, March 2005
- Haring, R. A.; Bellofatto, R.; Bright, A. A.
- IBM Journal of Research and Development, Vol. 49, Issue 2.3
Blue Gene/L compute chip: Synthesis, timing, and physical design
journal, March 2005
- Bright, A. A.; Haring, R. A.; Dombrowa, M. B.
- IBM Journal of Research and Development, Vol. 49, Issue 2.3
Durable memory RS/6000 system design
conference, January 1994
- Abbott, M.; Har, D.; Herger, L.
- Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing