Wide-area implementation of the message passing interface.
The Message Passing Interface (MPI) can be used as a portable, high-performance programming model for wide-area computing systems. The wide-area environment introduces challenging problems for the MPI implementor, due to the heterogeneity of both the underlying physical infrastructure and the software environment at different sites. In this article, we describe an MPI implementation that incorporates solutions to these problems. This implementation has been constructed by extending the Argonne MPICH implementation of MPI to use communication services provided by the Nexus communication library and authentication, resource allocation, process creation/management, and information services provided by the I-Soft system (initially) and the Globus metacomputing toolkit (work in progress). Nexus provides multimethod communication mechanisms that allow multiple communication methods to be used in a single computation with a uniform interface; I-Soft and Globus provided standard authentication, resource management, and process management mechanisms. We describe how these various mechanisms are supported in the Nexus implementation of MPI and present performance results for this implementation on multicomputers and networked systems. We also discuss how more advanced services provided by the Globus metacomputing toolkit are being used to construct a second-generation wide-area MPI.
- Research Organization:
- Argonne National Lab. (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC); National Science Foundation (NSF)
- DOE Contract Number:
- DE-AC02-06CH11357
- OSTI ID:
- 943164
- Report Number(s):
- ANL/MCS/JA-37949; PACOEJ; TRN: US201002%%643
- Journal Information:
- Parallel Comput., Vol. 24, Issue 1998; ISSN 0167-8191
- Country of Publication:
- United States
- Language:
- ENGLISH
Similar Records
MPICH-G2 : a grid-enabled implementation of the message passing interface.
A grid-enabled MPI : message passing in heterogeneous distributed computing systems.