Performing an allreduce operation on a plurality of compute nodes of a parallel computer

Faraj, Ahmad

Title: Performing an allreduce operation on a plurality of compute nodes of a parallel computer

Abstract

Methods, apparatus, and products are disclosed for performing an allreduce operation on a plurality of compute nodes of a parallel computer, each node including at least two processing cores, that include: performing, for each node, a local reduction operation using allreduce contribution data for the cores of that node, yielding, for each node, a local reduction result for one or more representative cores for that node; establishing one or more logical rings among the nodes, each logical ring including only one of the representative cores from each node; performing, for each logical ring, a global allreduce operation using the local reduction result for the representative cores included in that logical ring, yielding a global allreduce result for each representative core included in that logical ring; and performing, for each node, a local broadcast operation using the global allreduce results for each representative core on that node.

Inventors:: Faraj, Ahmad

Issue Date:: Tue Feb 12 00:00:00 EST 2013

Research Org.:: International Business Machines Corp., Armonk, NY (United States)

Sponsoring Org.:: USDOE

OSTI Identifier:: 1082948

Patent Number(s):: 8375197

Application Number:: 12/124,763

Assignee:: International Business Machines Corporation (Armonk, NY)

Patent Classifications (CPCs):: G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING

Show more

G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
G06F9/5061 - {Partitioning or combining of resources}

Show less

DOE Contract Number:: B554331

Resource Type:: Patent

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING

Citation Formats


                    Faraj, Ahmad. Performing an allreduce operation on a plurality of compute nodes of a parallel computer.  United States: N. p., 2013. 
        Web.

Copy to clipboard


                    Faraj, Ahmad. Performing an allreduce operation on a plurality of compute nodes of a parallel computer.  United States.

Copy to clipboard


                    Faraj, Ahmad. Tue .  
        "Performing an allreduce operation on a plurality of compute nodes of a parallel computer".  United States.  https://www.osti.gov/servlets/purl/1082948.

Copy to clipboard


                    
@article{osti_1082948,

  title        = {Performing an allreduce operation on a plurality of compute nodes of a parallel computer},

  author       = {Faraj, Ahmad},

  abstractNote = {Methods, apparatus, and products are disclosed for performing an allreduce operation on a plurality of compute nodes of a parallel computer, each node including at least two processing cores, that include: performing, for each node, a local reduction operation using allreduce contribution data for the cores of that node, yielding, for each node, a local reduction result for one or more representative cores for that node; establishing one or more logical rings among the nodes, each logical ring including only one of the representative cores from each node; performing, for each logical ring, a global allreduce operation using the local reduction result for the representative cores included in that logical ring, yielding a global allreduce result for each representative core included in that logical ring; and performing, for each node, a local broadcast operation using the global allreduce results for each representative core on that node.},

  doi          = {},

  journal      = {},
number       = ,

  volume       = ,

  place        = {United States},

  year         = {Tue Feb 12 00:00:00 EST 2013},

  month        = {Tue Feb 12 00:00:00 EST 2013}

}

Copy to clipboard

Patent:

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Non-Binary Source-to-Channel Symbol Mappings with Minimized Distortion
patent-application, August 2009

Chan, Ho Yin; Mow, Wai Ho
US Patent Application 12/023750; 20090196361
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20090196361

Tracking Network Contention
patent-application, June 2009

Archer, Charles J.; Peters, Amanda; Smith, Brian E.
US Patent Application 11/955474; 20090154486
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20090154486

Method, system and computer program product for managing memory in a non-uniform memory access system
patent, September 2001

Stevens, Luis F.
US Patent Document 6,289,424
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/6289424

Direct memory access controller system with message-based programming
patent-application, July 2005

Clayton, Shawn Adam; Fortin, Brian Mark; Willie, Daniel Brian
US Patent Application 11/088344; 20050165980
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20050165980

Correlating Hardware Devices Between Local Operating System and Global Management Entity
patent-application, August 2008

Ritz, Andrew J.; Jodh, Santosh S.; Walker, Ellsworth D.
US Patent Application 11/675261; 20080201603
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20080201603

Routing resource reserve/release protocol for multi-processor computer systems
patent, June 2000

Nugent, Steven F.
US Patent Document 6,076,131
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/6076131

Method and apparatus for pre-provisioning networks to support fast restoration with minimum overbuild
patent-application, November 2005

Alicherry, Mansoor Ali Khan; Bhatia, Randeep Singh
US Patent Application 10/838098; 20050243711
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20050243711

Interleaved all-to-all reliable broadcast on meshes and hypercubes
journal, May 1994

Sunggu Lee, ; Shin, K. G.
IEEE Transactions on Parallel and Distributed Systems, Vol. 5, Issue 5
https://doi.org/10.1109/71.282556

An All-Reduce Operation in Star Networks Using All-to-All Broadcast Communication Pattern
book, January 2005

Oh, Eunseuk; Choi, Hongsik; Primeaux, David
Lecture Notes in Computer Science
https://doi.org/10.1007/11428831_52

Method and apparatus for internetworking buffer management
patent, August 2000

Van Seters, Stephen L.; Hauser, Stephen A.; Sankey, Mark A.
US Patent Document 6,108,692
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/6108692