Improving efficiency of a global barrier operation in a parallel computer
Abstract
Performing a global barrier operation in a parallel computer that includes compute nodes coupled for data communications, where each compute node executes tasks, with one task on each compute node designated as a master task, including: for each task on each compute node until all master tasks have joined a global barrier: determining whether the task is a master task; if the task is not a master task, joining a single local barrier; if the task is a master task, joining the global barrier and the single local barrier only after all other tasks on the compute node have joined the single local barrier.
- Issue Date:
- Research Org.:
- International Business Machines Corp., Armonk, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1327915
- Patent Number(s):
- 9459934
- Application Number:
- 13/683,726
- Assignee:
- International Business Machines Corporation (Armonk, NY
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- B554331
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2012 Nov 21
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
. Improving efficiency of a global barrier operation in a parallel computer. United States: N. p., 2016.
Web.
. Improving efficiency of a global barrier operation in a parallel computer. United States.
. Tue .
"Improving efficiency of a global barrier operation in a parallel computer". United States. https://www.osti.gov/servlets/purl/1327915.
@article{osti_1327915,
title = {Improving efficiency of a global barrier operation in a parallel computer},
author = {},
abstractNote = {Performing a global barrier operation in a parallel computer that includes compute nodes coupled for data communications, where each compute node executes tasks, with one task on each compute node designated as a master task, including: for each task on each compute node until all master tasks have joined a global barrier: determining whether the task is a master task; if the task is not a master task, joining a single local barrier; if the task is a master task, joining the global barrier and the single local barrier only after all other tasks on the compute node have joined the single local barrier.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2016},
month = {10}
}
Save to My Library
You must Sign In or Create an Account in order to save documents to your library.
Works referenced in this record:
Automatic generation and tuning of MPI collective communication routines
conference, January 2005
- Faraj, Ahmad; Yuan, Xin
- Proceedings of the 19th annual international conference on Supercomputing - ICS '05
Building packet buffers using interleaved memories
conference, January 2005
- Shrimali, G.; McKeown, N.
- HPSR. 2005 Workshop on High Performance Switching and Routing, 2005.
Kernel-level single system image for petascale computing
journal, April 2006
- Ong, Hong; Vetter, Jeffrey; Studham, R. Scott
- ACM SIGOPS Operating Systems Review, Vol. 40, Issue 2
Real-Time Performance Monitoring, Adaptive Control, and Interactive Steering of Computational Grids
journal, November 2000
- Vetter, Jeffrey S.; Reed, Daniel A.
- The International Journal of High Performance Computing Applications, Vol. 14, Issue 4
Optimization of MPI collectives on clusters of large-scale SMP's
conference, January 1999
- Sistare, Steve; vandeVaart, Rolf; Loh, Eugene
- Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM) - Supercomputing '99
AM++: a generalized active message framework
conference, January 2010
- Willcock, Jeremiah James; Hoefler, Torsten; Edmonds, Nicholas Gerard
- Proceedings of the 19th international conference on Parallel architectures and compilation techniques - PACT '10
Extending the message passing interface (MPI)
conference, January 1995
- Skjellum, A.; Doss, N. E.; Viswanathan, K.
- Proceedings Scalable Parallel Libraries Conference
Computing parallel prefix and reduction using coterie structures
conference, January 1992
- Herbordt, M. C.; Weems, C. C.
- [1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation
Computing the Hough transform on a scan line array processor (image processing)
journal, March 1989
- Fisher, A. L.; Highnam, P. T.
- IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 11, Issue 3
Efficient MPI Collective Operations for Clusters in Long-and-Fast Networks
conference, September 2006
- Matsuda, Motohiko; Kudoh, Tomohiro; Kodama, Yuetsu
- 2006 IEEE International Conference on Cluster Computing
Bandwidth Efficient All-reduce Operation on Tree Topologies
conference, March 2007
- Patarasuk, Pitch; Yuan, Xin
- 2007 IEEE International Parallel and Distributed Processing Symposium
Optimizing threaded MPI execution on SMP clusters
conference, January 2001
- Tang, Hong; Yang, Tao
- Proceedings of the 15th international conference on Supercomputing - ICS '01
Interleaved all-to-all reliable broadcast on meshes and hypercubes
journal, May 1994
- Sunggu Lee, ; Shin, K. G.
- IEEE Transactions on Parallel and Distributed Systems, Vol. 5, Issue 5
Efficient algorithms for all-to-all communications in multiport message-passing systems
journal, January 1997
- Bruck, J.; Kipnis, S.
- IEEE Transactions on Parallel and Distributed Systems, Vol. 8, Issue 11
Performance analysis and optimization of MPI collective operations on multi-core clusters
journal, April 2009
- Tu, Bibo; Fan, Jianping; Zhan, Jianfeng
- The Journal of Supercomputing, Vol. 60, Issue 1