skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Executing a gather operation on a parallel computer

Abstract

Methods, apparatus, and computer program products are disclosed for executing a gather operation on a parallel computer according to embodiments of the present invention. Embodiments include configuring, by the logical root, a result buffer or the logical root, the result buffer having positions, each position corresponding to a ranked node in the operational group and for storing contribution data gathered from that ranked node. Embodiments also include repeatedly for each position in the result buffer: determining, by each compute node of an operational group, whether the current position in the result buffer corresponds with the rank of the compute node, if the current position in the result buffer corresponds with the rank of the compute node, contributing, by that compute node, the compute node's contribution data, if the current position in the result buffer does not correspond with the rank of the compute node, contributing, by that compute node, a value of zero for the contribution data, and storing, by the logical root in the current position in the result buffer, results of a bitwise OR operation of all the contribution data by all compute nodes of the operational group for the current position, the results received through the globalmore » combining network.« less

Inventors:
 [1];  [1]
  1. Rochester, MN
Publication Date:
Research Org.:
International Business Machines Corp., Armonk, NY (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1039561
Patent Number(s):
8,140,826
Application Number:
11/754,740
Assignee:
International Business Machines Corporation (Armonk, NY)
DOE Contract Number:  
B519700
Resource Type:
Patent
Resource Relation:
Patent File Date: 2007 May 29
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Archer, Charles J, and Ratterman, Joseph D. Executing a gather operation on a parallel computer. United States: N. p., 2012. Web.
Archer, Charles J, & Ratterman, Joseph D. Executing a gather operation on a parallel computer. United States.
Archer, Charles J, and Ratterman, Joseph D. Tue . "Executing a gather operation on a parallel computer". United States. https://www.osti.gov/servlets/purl/1039561.
@article{osti_1039561,
title = {Executing a gather operation on a parallel computer},
author = {Archer, Charles J and Ratterman, Joseph D},
abstractNote = {Methods, apparatus, and computer program products are disclosed for executing a gather operation on a parallel computer according to embodiments of the present invention. Embodiments include configuring, by the logical root, a result buffer or the logical root, the result buffer having positions, each position corresponding to a ranked node in the operational group and for storing contribution data gathered from that ranked node. Embodiments also include repeatedly for each position in the result buffer: determining, by each compute node of an operational group, whether the current position in the result buffer corresponds with the rank of the compute node, if the current position in the result buffer corresponds with the rank of the compute node, contributing, by that compute node, the compute node's contribution data, if the current position in the result buffer does not correspond with the rank of the compute node, contributing, by that compute node, a value of zero for the contribution data, and storing, by the logical root in the current position in the result buffer, results of a bitwise OR operation of all the contribution data by all compute nodes of the operational group for the current position, the results received through the global combining network.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2012},
month = {3}
}

Patent:

Save / Share:

Works referenced in this record:

Computing parallel prefix and reduction using coterie structures
conference, January 1992

  • Herbordt, M. C.; Weems, C. C.
  • [1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation
  • DOI: 10.1109/FMPC.1992.234895

Universality of mixed action extrapolation formulae
journal, April 2009

  • Chen, Jiunn-Wei; Walker-Loud, Andr√©; O'Connell, Donal
  • Journal of High Energy Physics, Vol. 2009, Issue 04
  • DOI: 10.1088/1126-6708/2009/04/090

Coprocessor design to support MPI primitives in configurable multiprocessors
journal, April 2007

  • Ziavras, Sotirios G.; Gerbessiotis, Alexandros V.; Bafna, Rohan
  • Integration, the VLSI Journal, Vol. 40, Issue 3, p. 235-252
  • DOI: 10.1016/j.vlsi.2005.10.001

Computing the Hough transform on a scan line array processor (image processing)
journal, March 1989

  • Fisher, A. L.; Highnam, P. T.
  • IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 11, Issue 3
  • DOI: 10.1109/34.21795

Extending the message passing interface (MPI)
conference, January 1995

  • Skjellum, A.; Doss, N. E.; Viswanathan, K.
  • Proceedings Scalable Parallel Libraries Conference
  • DOI: 10.1109/SPLC.1994.376998

Efficient MPI Collective Operations for Clusters in Long-and-Fast Networks
conference, September 2006

  • Matsuda, Motohiko; Kudoh, Tomohiro; Kodama, Yuetsu
  • 2006 IEEE International Conference on Cluster Computing
  • DOI: 10.1109/CLUSTR.2006.311848

Bandwidth Efficient All-reduce Operation on Tree Topologies
conference, March 2007

  • Patarasuk, Pitch; Yuan, Xin
  • 2007 IEEE International Parallel and Distributed Processing Symposium
  • DOI: 10.1109/IPDPS.2007.370405

Interleaved all-to-all reliable broadcast on meshes and hypercubes
journal, May 1994

  • Sunggu Lee, ; Shin, K. G.
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 5, Issue 5
  • DOI: 10.1109/71.282556

Optimizing threaded MPI execution on SMP clusters
conference, January 2001

  • Tang, Hong; Yang, Tao
  • Proceedings of the 15th international conference on Supercomputing - ICS '01
  • DOI: 10.1145/377792.377895

Efficient algorithms for all-to-all communications in multiport message-passing systems
journal, January 1997

  • Bruck, J.; Kipnis, S.
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 8, Issue 11
  • DOI: 10.1109/71.642949