skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Scientific Application Performance on Candidate PetaScalePlatforms

Abstract

After a decade where HEC (high-end computing) capability was dominated by the rapid pace of improvements to CPU clock frequency, the performance of next-generation supercomputers is increasingly differentiated by varying interconnect designs and levels of integration. Understanding the tradeoffs of these system designs, in the context of high-end numerical simulations, is a key step towards making effective petascale computing a reality. This work represents one of the most comprehensive performance evaluation studies to date on modern HEC systems, including the IBM Power5, AMD Opteron, IBM BG/L, and Cray X1E. A novel aspect of our study is the emphasis on full applications, with real input data at the scale desired by computational scientists in their unique domain. We examine six candidate ultra-scale applications, representing a broad range of algorithms and computational structures. Our work includes the highest concurrency experiments to date on five of our six applications, including 32K processor scalability for two of our codes and describe several successful optimizations strategies on BG/L, as well as improved X1E vectorization. Overall results indicate that our evaluated codes have the potential to effectively utilize petascale resources; however, several applications will require reengineering to incorporate the additional levels of parallelism necessary to achievemore » the vast concurrency of upcoming ultra-scale systems.« less

Authors:
; ; ; ; ; ; ; ; ; ;
Publication Date:
Research Org.:
Ernest Orlando Lawrence Berkeley NationalLaboratory, Berkeley, CA (US)
Sponsoring Org.:
USDOE Director. Office of Science. Advanced ScientificComputing Research
OSTI Identifier:
932484
Report Number(s):
LBNL-62952
R&D Project: K11121; BnR: KJ0101030; TRN: US200813%%77
DOE Contract Number:
DE-AC02-05CH11231
Resource Type:
Conference
Resource Relation:
Conference: International Parallel and Distributed ProcessingSymposium (IPDPS), Long Beach, CA, March 24-30,2007
Country of Publication:
United States
Language:
English
Subject:
42; ALGORITHMS; EVALUATION; PERFORMANCE; PROCESSING; SUPERCOMPUTERS

Citation Formats

Oliker, Leonid, Canning, Andrew, Carter, Jonathan, Iancu, Costin, Lijewski, Michael, Kamil, Shoaib, Shalf, John, Shan, Hongzang, Strohmaier, Erich, Ethier, Stephane, and Goodale, Tom. Scientific Application Performance on Candidate PetaScalePlatforms. United States: N. p., 2007. Web.
Oliker, Leonid, Canning, Andrew, Carter, Jonathan, Iancu, Costin, Lijewski, Michael, Kamil, Shoaib, Shalf, John, Shan, Hongzang, Strohmaier, Erich, Ethier, Stephane, & Goodale, Tom. Scientific Application Performance on Candidate PetaScalePlatforms. United States.
Oliker, Leonid, Canning, Andrew, Carter, Jonathan, Iancu, Costin, Lijewski, Michael, Kamil, Shoaib, Shalf, John, Shan, Hongzang, Strohmaier, Erich, Ethier, Stephane, and Goodale, Tom. Mon . "Scientific Application Performance on Candidate PetaScalePlatforms". United States. doi:. https://www.osti.gov/servlets/purl/932484.
@article{osti_932484,
title = {Scientific Application Performance on Candidate PetaScalePlatforms},
author = {Oliker, Leonid and Canning, Andrew and Carter, Jonathan and Iancu, Costin and Lijewski, Michael and Kamil, Shoaib and Shalf, John and Shan, Hongzang and Strohmaier, Erich and Ethier, Stephane and Goodale, Tom},
abstractNote = {After a decade where HEC (high-end computing) capability was dominated by the rapid pace of improvements to CPU clock frequency, the performance of next-generation supercomputers is increasingly differentiated by varying interconnect designs and levels of integration. Understanding the tradeoffs of these system designs, in the context of high-end numerical simulations, is a key step towards making effective petascale computing a reality. This work represents one of the most comprehensive performance evaluation studies to date on modern HEC systems, including the IBM Power5, AMD Opteron, IBM BG/L, and Cray X1E. A novel aspect of our study is the emphasis on full applications, with real input data at the scale desired by computational scientists in their unique domain. We examine six candidate ultra-scale applications, representing a broad range of algorithms and computational structures. Our work includes the highest concurrency experiments to date on five of our six applications, including 32K processor scalability for two of our codes and describe several successful optimizations strategies on BG/L, as well as improved X1E vectorization. Overall results indicate that our evaluated codes have the potential to effectively utilize petascale resources; however, several applications will require reengineering to incorporate the additional levels of parallelism necessary to achieve the vast concurrency of upcoming ultra-scale systems.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Mon Jan 01 00:00:00 EST 2007},
month = {Mon Jan 01 00:00:00 EST 2007}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: