Executing a gather operation on a parallel computer

Archer, Charles J; Ratterman, Joseph D

Title: Executing a gather operation on a parallel computer

Patent · Tue Mar 20 00:00:00 EDT 2012

OSTI ID:1039561

Archer, Charles J ^[1]; Ratterman, Joseph D ^[1]

Rochester, MN

Methods, apparatus, and computer program products are disclosed for executing a gather operation on a parallel computer according to embodiments of the present invention. Embodiments include configuring, by the logical root, a result buffer or the logical root, the result buffer having positions, each position corresponding to a ranked node in the operational group and for storing contribution data gathered from that ranked node. Embodiments also include repeatedly for each position in the result buffer: determining, by each compute node of an operational group, whether the current position in the result buffer corresponds with the rank of the compute node, if the current position in the result buffer corresponds with the rank of the compute node, contributing, by that compute node, the compute node's contribution data, if the current position in the result buffer does not correspond with the rank of the compute node, contributing, by that compute node, a value of zero for the contribution data, and storing, by the logical root in the current position in the result buffer, results of a bitwise OR operation of all the contribution data by all compute nodes of the operational group for the current position, the results received through the global combining network.

View Patent

Cite

Export

Save

Research Organization:: International Business Machines Corp., Armonk, NY (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: B519700

Assignee:: International Business Machines Corporation (Armonk, NY)

Patent Number(s):: 8,140,826

Application Number:: 11/754,740

OSTI ID:: 1039561

Resource Relation:: Patent File Date: 2007 May 29

Country of Publication:: United States

Language:: English

References (11)

Computing parallel prefix and reduction using coterie structures Herbordt, M. C.; Weems, C. C. [1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation https://doi.org/10.1109/FMPC.1992.234895	conference	January 1992
Universality of mixed action extrapolation formulae Chen, Jiunn-Wei; Walker-Loud, André; O'Connell, Donal Journal of High Energy Physics, Vol. 2009, Issue 04 https://doi.org/10.1088/1126-6708/2009/04/090	journal	April 2009
Coprocessor design to support MPI primitives in configurable multiprocessors Ziavras, Sotirios G.; Gerbessiotis, Alexandros V.; Bafna, Rohan Integration, the VLSI Journal, Vol. 40, Issue 3, p. 235-252 https://doi.org/10.1016/j.vlsi.2005.10.001	journal	April 2007
Computing the Hough transform on a scan line array processor (image processing) Fisher, A. L.; Highnam, P. T. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 11, Issue 3 https://doi.org/10.1109/34.21795	journal	March 1989
Extending the message passing interface (MPI) Skjellum, A.; Doss, N. E.; Viswanathan, K. Proceedings Scalable Parallel Libraries Conference https://doi.org/10.1109/SPLC.1994.376998	conference	January 1995
Efficient MPI Collective Operations for Clusters in Long-and-Fast Networks Matsuda, Motohiko; Kudoh, Tomohiro; Kodama, Yuetsu 2006 IEEE International Conference on Cluster Computing https://doi.org/10.1109/CLUSTR.2006.311848	conference	September 2006
Bandwidth Efficient All-reduce Operation on Tree Topologies Patarasuk, Pitch; Yuan, Xin 2007 IEEE International Parallel and Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2007.370405	conference	March 2007
Interleaved all-to-all reliable broadcast on meshes and hypercubes IEEE Transactions on Parallel and Distributed Systems, Vol. 5, Issue 5 https://doi.org/10.1109/71.282556	journal	May 1994
Optimizing threaded MPI execution on SMP clusters Tang, Hong; Yang, Tao Proceedings of the 15th international conference on Supercomputing - ICS '01 https://doi.org/10.1145/377792.377895	conference	January 2001
An All-Reduce Operation in Star Networks Using All-to-All Broadcast Communication Pattern Oh, Eunseuk; Choi, Hongsik; Primeaux, David Lecture Notes in Computer Science https://doi.org/10.1007/11428831_52	book	January 2005
Efficient algorithms for all-to-all communications in multiport message-passing systems Bruck, J.; Kipnis, S. IEEE Transactions on Parallel and Distributed Systems, Vol. 8, Issue 11 https://doi.org/10.1109/71.642949	journal	January 1997

Similar Records

Executing scatter operation to parallel computer nodes by repeatedly broadcasting content of send buffer partition corresponding to each node upon bitwise OR operation

Patent · Fri Nov 06 00:00:00 EST 2009 · OSTI ID:1039561

Archer, Charles J; Ratterman, Joseph D

Effecting a broadcast with an allreduce operation on a parallel computer

Patent · Tue Nov 02 00:00:00 EDT 2010 · OSTI ID:1039561

Almasi, Gheorghe; Archer, Charles J; Ratterman, Joseph D; +1 more

Identifying a largest logical plane from a plurality of logical planes formed of compute nodes of a subcommunicator in a parallel computer

Patent · Tue Jul 12 00:00:00 EDT 2016 · OSTI ID:1039561

Davis, Kristan D.; Faraj, Daniel A.

Related Subjects

97 MATHEMATICS AND COMPUTING

Title: Executing a gather operation on a parallel computer

Citation Formats

References (11)

Similar Records

Related Subjects