Buffered coscheduling for parallel programming and enhanced fault tolerance
Abstract
A computer implemented method schedules processor jobs on a network of parallel machine processors or distributed system processors. Control information communications generated by each process performed by each processor during a defined time interval is accumulated in buffers, where adjacent time intervals are separated by strobe intervals for a global exchange of control information. A global exchange of the control information communications at the end of each defined time interval is performed during an intervening strobe interval so that each processor is informed by all of the other processors of the number of incoming jobs to be received by each processor in a subsequent time interval. The buffered coscheduling method of this invention also enhances the fault tolerance of a network of parallel machine processors or distributed system processors
- Inventors:
-
- Los Alamos, NM
- Issue Date:
- Research Org.:
- Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 908604
- Patent Number(s):
- 6993764
- Application Number:
- 09/895,570
- Assignee:
- The Regents of the University of California (Los Alamos, NM)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- W-7405-ENG-36
- Resource Type:
- Patent
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Petrini, Fabrizio, and Feng, Wu-chun. Buffered coscheduling for parallel programming and enhanced fault tolerance. United States: N. p., 2006.
Web.
Petrini, Fabrizio, & Feng, Wu-chun. Buffered coscheduling for parallel programming and enhanced fault tolerance. United States.
Petrini, Fabrizio, and Feng, Wu-chun. Tue .
"Buffered coscheduling for parallel programming and enhanced fault tolerance". United States. https://www.osti.gov/servlets/purl/908604.
@article{osti_908604,
title = {Buffered coscheduling for parallel programming and enhanced fault tolerance},
author = {Petrini, Fabrizio and Feng, Wu-chun},
abstractNote = {A computer implemented method schedules processor jobs on a network of parallel machine processors or distributed system processors. Control information communications generated by each process performed by each processor during a defined time interval is accumulated in buffers, where adjacent time intervals are separated by strobe intervals for a global exchange of control information. A global exchange of the control information communications at the end of each defined time interval is performed during an intervening strobe interval so that each processor is informed by all of the other processors of the number of incoming jobs to be received by each processor in a subsequent time interval. The buffered coscheduling method of this invention also enhances the fault tolerance of a network of parallel machine processors or distributed system processors},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2006},
month = {1}
}
Works referenced in this record:
Scheduling with implicit information in distributed systems
journal, June 1998
- Arpaci-Dusseau, Andrea C.; Culler, David E.; Mainwaring, Alan M.
- ACM SIGMETRICS Performance Evaluation Review, Vol. 26, Issue 1
All-to-all personalized communication in a wormhole-routed torus
journal, May 1996
- Tseng, Yu-Chee; Gupta, S. K. S.
- IEEE Transactions on Parallel and Distributed Systems, Vol. 7, Issue 5, p. 498-505
Simultaneous multithreading: a platform for next-generation processors
journal, September 1997
- Eggers, S. J.; Emer, J. S.; Levy, H. M.
- IEEE Micro, Vol. 17, Issue 5
Concurrent event handling through multithreading
journal, September 1999
- Keckler, S. W.; Chang, A.; Chatterjee, W. S. L. S.
- IEEE Transactions on Computers, Vol. 48, Issue 9, p. 903-916