skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Executing a gather operation on a parallel computer

Patent ·
OSTI ID:1039561

Methods, apparatus, and computer program products are disclosed for executing a gather operation on a parallel computer according to embodiments of the present invention. Embodiments include configuring, by the logical root, a result buffer or the logical root, the result buffer having positions, each position corresponding to a ranked node in the operational group and for storing contribution data gathered from that ranked node. Embodiments also include repeatedly for each position in the result buffer: determining, by each compute node of an operational group, whether the current position in the result buffer corresponds with the rank of the compute node, if the current position in the result buffer corresponds with the rank of the compute node, contributing, by that compute node, the compute node's contribution data, if the current position in the result buffer does not correspond with the rank of the compute node, contributing, by that compute node, a value of zero for the contribution data, and storing, by the logical root in the current position in the result buffer, results of a bitwise OR operation of all the contribution data by all compute nodes of the operational group for the current position, the results received through the global combining network.

Research Organization:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
B519700
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Number(s):
8,140,826
Application Number:
11/754,740
OSTI ID:
1039561
Resource Relation:
Patent File Date: 2007 May 29
Country of Publication:
United States
Language:
English

References (11)

Computing parallel prefix and reduction using coterie structures
  • Herbordt, M. C.; Weems, C. C.
  • [1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation https://doi.org/10.1109/FMPC.1992.234895
conference January 1992
Universality of mixed action extrapolation formulae journal April 2009
Coprocessor design to support MPI primitives in configurable multiprocessors journal April 2007
Computing the Hough transform on a scan line array processor (image processing) journal March 1989
Extending the message passing interface (MPI) conference January 1995
Efficient MPI Collective Operations for Clusters in Long-and-Fast Networks conference September 2006
Bandwidth Efficient All-reduce Operation on Tree Topologies conference March 2007
Interleaved all-to-all reliable broadcast on meshes and hypercubes journal May 1994
Optimizing threaded MPI execution on SMP clusters conference January 2001
An All-Reduce Operation in Star Networks Using All-to-All Broadcast Communication Pattern book January 2005
Efficient algorithms for all-to-all communications in multiport message-passing systems journal January 1997