DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Improving efficiency of a global barrier operation in a parallel computer

Abstract

Performing a global barrier operation in a parallel computer that includes compute nodes coupled for data communications, where each compute node executes tasks, with one task on each compute node designated as a master task, including: for each task on each compute node until all master tasks have joined a global barrier: determining whether the task is a master task; if the task is not a master task, joining a single local barrier; if the task is a master task, joining the global barrier and the single local barrier only after all other tasks on the compute node have joined the single local barrier.

Issue Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1327915
Patent Number(s):
9459934
Application Number:
13/683,726
Assignee:
International Business Machines Corporation (Armonk, NY
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
B554331
Resource Type:
Patent
Resource Relation:
Patent File Date: 2012 Nov 21
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

. Improving efficiency of a global barrier operation in a parallel computer. United States: N. p., 2016. Web.
. Improving efficiency of a global barrier operation in a parallel computer. United States.
. Tue . "Improving efficiency of a global barrier operation in a parallel computer". United States. https://www.osti.gov/servlets/purl/1327915.
@article{osti_1327915,
title = {Improving efficiency of a global barrier operation in a parallel computer},
author = {},
abstractNote = {Performing a global barrier operation in a parallel computer that includes compute nodes coupled for data communications, where each compute node executes tasks, with one task on each compute node designated as a master task, including: for each task on each compute node until all master tasks have joined a global barrier: determining whether the task is a master task; if the task is not a master task, joining a single local barrier; if the task is a master task, joining the global barrier and the single local barrier only after all other tasks on the compute node have joined the single local barrier.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2016},
month = {10}
}

Works referenced in this record:

Automatic generation and tuning of MPI collective communication routines
conference, January 2005


Building packet buffers using interleaved memories
conference, January 2005


Kernel-level single system image for petascale computing
journal, April 2006


Real-Time Performance Monitoring, Adaptive Control, and Interactive Steering of Computational Grids
journal, November 2000


Optimization of MPI collectives on clusters of large-scale SMP's
conference, January 1999


AM++: a generalized active message framework
conference, January 2010

  • Willcock, Jeremiah James; Hoefler, Torsten; Edmonds, Nicholas Gerard
  • Proceedings of the 19th international conference on Parallel architectures and compilation techniques - PACT '10
  • https://doi.org/10.1145/1854273.1854323

Extending the message passing interface (MPI)
conference, January 1995


Computing parallel prefix and reduction using coterie structures
conference, January 1992

  • Herbordt, M. C.; Weems, C. C.
  • [1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation
  • https://doi.org/10.1109/FMPC.1992.234895

Computing the Hough transform on a scan line array processor (image processing)
journal, March 1989


Efficient MPI Collective Operations for Clusters in Long-and-Fast Networks
conference, September 2006


Bandwidth Efficient All-reduce Operation on Tree Topologies
conference, March 2007


Optimizing threaded MPI execution on SMP clusters
conference, January 2001


Interleaved all-to-all reliable broadcast on meshes and hypercubes
journal, May 1994


Efficient algorithms for all-to-all communications in multiport message-passing systems
journal, January 1997


Performance analysis and optimization of MPI collective operations on multi-core clusters
journal, April 2009