Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Collective Framework and Performance Optimizations to Open MPI for Cray XT Platforms

Conference ·
OSTI ID:1035529
The performance and scalability of collective operations plays a key role in the performance and scalability of many scientific applications. Within the Open MPI code base we have developed a general purpose hierarchical collective operations framework called Cheetah, and applied it at large scale on the Oak Ridge Leadership Computing Facility's Jaguar (OLCF) platform, obtaining better performance and scalability than the native MPI implementation. This paper discuss Cheetah's design and implementation, and optimizations to the framework for Cray XT 5 platforms. Our results show that the Cheetah's Broadcast and Barrier perform better than the native MPI implementation. For medium data, the Cheetah's Broadcast outperforms the native MPI implementation by 93% for 49,152 processes problem size. For small and large data, it out performs the native MPI implementation by 10% and 9%, respectively, at 24,576 processes problem size. The Cheetah's Barrier performs 10% better than the native MPI implementation for 12,288 processes problem size.
Research Organization:
Oak Ridge National Laboratory (ORNL)
Sponsoring Organization:
SC USDOE - Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1035529
Country of Publication:
United States
Language:
English

Similar Records

Design and Implementation of Broadcast Algorithms for Extreme-Scale Systems
Conference · Fri Dec 31 23:00:00 EST 2010 · OSTI ID:1042820

OPAL: An Open-Source MPI-IO Library over Cray XT
Conference · Sun Dec 31 23:00:00 EST 2006 · OSTI ID:931023

An Evaluation of Open MPI's Matching Transport Layer on the Cray XT
Conference · Sun Dec 31 23:00:00 EST 2006 · OSTI ID:1067288