Enabling communication concurrency through flexible MPI endpoints

Dinan, James; Grant, Ryan E.; Balaji, Pavan; Goodell, David; Miller, Douglas; Snir, Marc; Thakur, Rajeev

doi:10.1177/1094342014548772

Title: Enabling communication concurrency through flexible MPI endpoints

Journal Article · Tue Sep 23 00:00:00 EDT 2014 · International Journal of High Performance Computing Applications

DOI:https://doi.org/10.1177/1094342014548772· OSTI ID:1140752

Dinan, James ^[1]; Grant, Ryan E. ^[2]; Balaji, Pavan ^[3]; Goodell, David ^[4]; Miller, Douglas ^[5]; Snir, Marc ^[3]; Thakur, Rajeev ^[3]

Intel Corporation, Hudson, MA (United States)
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Argonne National Lab. (ANL), Argonne, IL (United States)
Cisco Systems Inc., San Jose, CA (United States)
International Business Machines Corporation, Rochester, MN (United States)

MPI defines a one-to-one relationship between MPI processes and ranks. This model captures many use cases effectively; however, it also limits communication concurrency and interoperability between MPI and programming models that utilize threads. Our paper describes the MPI endpoints extension, which relaxes the longstanding one-to-one relationship between MPI processes and ranks. Using endpoints, an MPI implementation can map separate communication contexts to threads, allowing them to drive communication independently. Also, endpoints enable threads to be addressable in MPI operations, enhancing interoperability between MPI and other programming models. Furthermore, these characteristics are illustrated through several examples and an empirical study that contrasts current multithreaded communication performance with the need for high degrees of communication concurrency to achieve peak communication performance.

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA)

Grant/Contract Number:: AC04-94AL85000

OSTI ID:: 1140752

Report Number(s):: SAND2014-0614J; 498469

Journal Information:: International Journal of High Performance Computing Applications, Vol. 28, Issue 4; ISSN 1094-3420

Publisher:: SAGECopyright Statement

Country of Publication:: United States

Language:: English

Citation Metrics:

Cited by: 30 works

Citation information provided by
Web of Science

References (23)

The impact of hybrid-core processors on MPI message rate Barrett, Brian W.; Hammond, Simon D.; Brightwell, Ron Proceedings of the 20th European MPI Users' Group Meeting on - EuroMPI '13 https://doi.org/10.1145/2488551.2488560	conference	January 2013
Hybrid PGAS runtime support for multicore nodes Blagojević, Filip; Hargrove, Paul; Iancu, Costin Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model - PGAS '10 https://doi.org/10.1145/2020373.2020376	conference	January 2010
Generalized communicators in the message passing interface Demaine, E. D.; Foster, I.; Kesselman, C. IEEE Transactions on Parallel and Distributed Systems, Vol. 12, Issue 6 https://doi.org/10.1109/71.932714	journal	June 2001
Enabling MPI interoperability through flexible communication endpoints Dinan, James; Balaji, Pavan; Goodell, David Proceedings of the 20th European MPI Users' Group Meeting on - EuroMPI '13 https://doi.org/10.1145/2488551.2488553	conference	January 2013
Hybrid parallel programming with MPI and unified parallel C Dinan, James; Balaji, Pavan; Lusk, Ewing Proceedings of the 7th ACM international conference on Computing frontiers - CF '10 https://doi.org/10.1145/1787275.1787323	conference	January 2010
Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems Dózsa, Gábor; Kumar, Sameer; Balaji, Pavan Recent Advances in the Message Passing Interface https://doi.org/10.1007/978-3-642-15646-5_2	book	January 2010
Ownership passing: efficient distributed memory programming on multi-core systems Friedley, Andrew; Hoefler, Torsten; Bronevetsky, Greg Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '13 https://doi.org/10.1145/2442516.2442534	conference	January 2013
Dynamic Communicators in MPI Graham, Richard L.; Keller, Rainer Recent Advances in Parallel Virtual Machine and Message Passing Interface https://doi.org/10.1007/978-3-642-03770-2_18	book	January 2009
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory Hoefler, Torsten; Dinan, James; Buntinas, Darius Computing, Vol. 95, Issue 12 https://doi.org/10.1007/s00607-013-0324-2	journal	May 2013
Supporting Hybrid MPI and OpenSHMEM over InfiniBand: Design and Performance Evaluation Jose, Jithin; Kandalla, Krishna; Luo, Miao 2012 41st International Conference on Parallel Processing (ICPP) https://doi.org/10.1109/ICPP.2012.55	conference	September 2012
Unifying UPC and MPI runtimes: experience with MVAPICH Jose, Jithin; Luo, Miao; Sur, Sayantan Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model - PGAS '10 https://doi.org/10.1145/2020373.2020378	conference	January 2010
FG-MPI: Fine-grain MPI for multicore and clusters Kamal, Humaira; Wagner, Alan Distributed Processing, Workshops and Phd Forum (IPDPSW 2010), 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW) https://doi.org/10.1109/IPDPSW.2010.5470773	conference	April 2010
PAMI: A Parallel Active Message Interface for the Blue Gene/Q Supercomputer Kumar, Sameer; Mamidala, Amith R.; Faraj, Daniel A. 2012 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2012 IEEE 26th International Parallel and Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2012.73	conference	May 2012
NUMA-aware shared-memory collective communication for MPI Li, Shigang; Hoefler, Torsten; Snir, Marc Proceedings of the 22nd international symposium on High-performance parallel and distributed computing - HPDC '13 https://doi.org/10.1145/2493123.2462903	conference	January 2013
Multi-threaded UPC runtime with network endpoints: Design alternatives and evaluation on multi-core architectures Luo, Miao; Jose, Jithin; Sur, Sayantan 2011 18th International Conference on High Performance Computing (HiPC) https://doi.org/10.1109/HiPC.2011.6152734	conference	December 2011
Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes Rabenseifner, Rolf; Hager, Georg; Jost, Gabriele 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing https://doi.org/10.1109/PDP.2009.43	conference	February 2009
Development of Mixed Mode MPI / OpenMP Applications Smith, Lorna; Bull, Mark Scientific Programming, Vol. 9, Issue 2-3 https://doi.org/10.1155/2001/450503	journal	January 2001
Extending MPI to accelerators Stuart, Jeff A.; Balaji, Pavan; Owens, John D. Proceedings of the 1st Workshop on Architectures and Systems for Big Data - ASBD '11 https://doi.org/10.1145/2377978.2377981	conference	January 2011
Network Endpoints for Clusters of SMPs Tanase, Gabriel; Almasi, Gheorghe; Xue, Hanhong 2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing https://doi.org/10.1109/SBAC-PAD.2012.15	conference	October 2012
Compact and Efficient Implementation of the MPI Group Operations Träff, Jesper Larsson Recent Advances in the Message Passing Interface https://doi.org/10.1007/978-3-642-15646-5_18	book	January 2010
Evaluating NIC hardware requirements to achieve high message rate PGAS support on multi-core processors Underwood, Keith D.; Levenhagen, Michael J.; Brightwell, Ron Proceedings of the 2007 ACM/IEEE conference on Supercomputing - SC '07 https://doi.org/10.1145/1362622.1362671	conference	January 2007
MVAPICH2-GPU: optimized GPU to GPU communication for InfiniBand clusters Wang, Hao; Potluri, Sreeram; Luo, Miao Computer Science - Research and Development, Vol. 26, Issue 3-4 https://doi.org/10.1007/s00450-011-0171-3	journal	April 2011
Portable, MPI-interoperable coarray fortran Yang, Chaoran; Bland, Wesley; Mellor-Crummey, John Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '14 https://doi.org/10.1145/2555243.2555270	conference	January 2014

Cited By (6)

Optimizing point‐to‐point communication between adaptive MPI endpoints in shared memory White, Sam; Kale, Laxmikant V. Concurrency and Computation: Practice and Experience, Vol. 32, Issue 3 https://doi.org/10.1002/cpe.4467	journal	March 2018
A survey of MPI usage in the US exascale computing project: A survey of MPI usage in the U. S. exascale computing project Bernholdt, David E.; Boehm, Swen; Bosilca, George Concurrency and Computation: Practice and Experience https://doi.org/10.1002/cpe.4851	journal	September 2018
Hardware MPI message matching: Insights into MPI matching behavior to inform design Ferreira, Kurt; Grant, Ryan E.; Levenhagen, Michael J. Concurrency and Computation: Practice and Experience, Vol. 32, Issue 3 https://doi.org/10.1002/cpe.5150	journal	February 2019
Tail queues: A multi‐threaded matching architecture Dosanjh, Matthew G. F.; Grant, Ryan E.; Schonbein, Whit Concurrency and Computation: Practice and Experience, Vol. 32, Issue 3 https://doi.org/10.1002/cpe.5158	journal	February 2019
Finepoints: Partitioned Multithreaded MPI Communication Grant, Ryan E.; Dosanjh, Matthew G. F.; Levenhagen, Michael J. High Performance Computing: 34th International Conference, ISC High Performance 2019, Frankfurt/Main, Germany, June 16–20, 2019, Proceedings, p. 330-350 https://doi.org/10.1007/978-3-030-20656-7_17	book	May 2019
MPI Sessions: Leveraging Runtime Infrastructure to Increase Scalability of Applications at Exascale Holmes, Daniel; Mohror, Kathryn; Grant, Ryan E. Proceedings of the 23rd European MPI Users' Group Meeting on - EuroMPI 2016 https://doi.org/10.1145/2966884.2966915	conference	January 2016

Similar Records

Enabling communication concurrency through flexible MPI endpoints

Journal Article · Tue Sep 23 00:00:00 EDT 2014 · International Journal of High Performance Computing Applications · OSTI ID:1140752

Dinan, James; Grant, Ryan E.; Balaji, Pavan; +4 more

Optimizing point‐to‐point communication between adaptive MPI endpoints in shared memory

Journal Article · Mon Mar 12 00:00:00 EDT 2018 · Concurrency and Computation. Practice and Experience · OSTI ID:1140752

White, Sam; Kale, Laxmikant V.

Test suite for evaluating performance of multithreaded MPI communication.

Journal Article · Tue Dec 01 00:00:00 EST 2009 · Parallel Comput. · OSTI ID:1140752

Thakur, R; Gropp, W

Related Subjects

97 MATHEMATICS AND COMPUTING
MPI
endpoints
hybrid parallel programming
interoperability
communication concurrency

Title: Enabling communication concurrency through flexible MPI endpoints

Citation Formats

References (23)

Cited By (6)

Similar Records

Related Subjects