Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Development of parallel methods for a 1024-processor hypercube

Journal Article · · SIAM J. Sci. Stat. Comput.; (United States)
DOI:https://doi.org/10.1137/0909041· OSTI ID:6818878

The authors developed highly efficient parallel solutions for three practical, full-scale scientific problems: wave mechanics, fluid dynamics, and structural analysis. Several algorithmic techniques are used to keep communication and serial overhead small as both problem size and number of processors are varied. A new parameter, operation efficiency, is introduced that quantifies the tradeoff between communication and redundant computation. A 1024-processor MIMD ensemble is measured to be 502 to 637 times as fast as a single processor when problem size for the ensemble is fixed, and 1009 to 1020 times as fast as a single processor when problem size per processor is fixed. The latter measure, denoted scaled speedup, is developed and contrasted with the traditional measure of parallel speedup. The scaled-problem paradigm better reveals the capabilities of large ensembles, and permits detection of subtle hardware-induced load imbalances (such as error correction and data-dependent MFLOPS rates) that may become increasingly important as parallel processors increase in node counts. Sustained performance for the applications is 70 to 130 MFLOPS, validating the massively parallel ensemble approach as a practical alternative to more conventional processing methods. The techniques presented appear extensible to even higher levels of parallelism than the 1024-processor level explored here.

Research Organization:
Parallel Processing Div., 1413, Sandia National Labs., Albuquerque, NM (US)
OSTI ID:
6818878
Journal Information:
SIAM J. Sci. Stat. Comput.; (United States), Journal Name: SIAM J. Sci. Stat. Comput.; (United States) Vol. 9:4; ISSN SIJCD
Country of Publication:
United States
Language:
English