skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: ASC ATDM Level 2 Milestone #5325: Asynchronous Many-Task Runtime System Analysis and Assessment for Next Generation Platforms

Abstract

This report provides in-depth information and analysis to help create a technical road map for developing next-generation programming models and runtime systems that support Advanced Simulation and Computing (ASC) work- load requirements. The focus herein is on asynchronous many-task (AMT) model and runtime systems, which are of great interest in the context of "Oriascale7 computing, as they hold the promise to address key issues associated with future extreme-scale computer architectures. This report includes a thorough qualitative and quantitative examination of three best-of-class AIM] runtime systems – Charm-++, Legion, and Uintah, all of which are in use as part of the Centers. The studies focus on each of the runtimes' programmability, performance, and mutability. Through the experiments and analysis presented, several overarching Predictive Science Academic Alliance Program II (PSAAP-II) Asc findings emerge. From a performance perspective, AIV runtimes show tremendous potential for addressing extreme- scale challenges. Empirical studies show an AM runtime can mitigate performance heterogeneity inherent to the machine itself and that Message Passing Interface (MP1) and AM11runtimes perform comparably under balanced conditions. From a programmability and mutability perspective however, none of the runtimes in this study are currently ready for use in developing production-ready Sandia ASC applications. The reportmore » concludes by recommending a co- design path forward, wherein application, programming model, and runtime system developers work together to define requirements and solutions. Such a requirements-driven co-design approach benefits the community as a whole, with widespread community engagement mitigating risk for both application developers developers. and high-performance computing runtime systein« less

Authors:
 [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [2];  [2] more »;  [2];  [3];  [3];  [3];  [3];  [3];  [4];  [4];  [4];  [4];  [4];  [5];  [5];  [6];  [6];  [6] « less
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
  2. Univ. of Illinois, Urbana Champaign, IL (United States)
  3. Stanford Univ., Stanford, CA (United States)
  4. Univ. of Utah, Salt Lake City, UT (United States)
  5. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
  6. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Publication Date:
Research Org.:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Org.:
USDOE National Nuclear Security Administration (NNSA)
OSTI Identifier:
1432926
Report Number(s):
SAND-2015-8312
662088
DOE Contract Number:  
AC04-94AL85000
Resource Type:
Technical Report
Country of Publication:
United States
Language:
English

Citation Formats

Baker, Gavin Matthew, Bettencourt, Matthew Tyler, Bova, Steven W., Franko, Ken, Gamell, Marc, Grant, Ryan, Hammond, Simon David, Hollman, David S., Knight, Samuel, Kolla, Hemanth, Lin, Paul, Olivier, Stephen Lecler, Sjaardema, Gregory D., Slattengren, Nicole Lemaster, Teranishi, Keita, Wilke, Jeremiah J., Bennett, Janine Camille, Clay, Robert L., Kale, Laxkimant, Jain, Nikhil, Mikida, Eric, Aiken, Alex, Bauer, Michael, Lee, Wonchan, Slaughter, Elliott, Treichler, Sean, Berzins, Martin, Harman, Todd, Humphreys, Alan, Schmidt, John, Sunderland, Dan, Mccormick, Pat, Gutierrez, Samuel, Shulz, Martin, Gamblin, Todd, and Bremer, Peer -Timo. ASC ATDM Level 2 Milestone #5325: Asynchronous Many-Task Runtime System Analysis and Assessment for Next Generation Platforms. United States: N. p., 2015. Web. doi:10.2172/1432926.
Baker, Gavin Matthew, Bettencourt, Matthew Tyler, Bova, Steven W., Franko, Ken, Gamell, Marc, Grant, Ryan, Hammond, Simon David, Hollman, David S., Knight, Samuel, Kolla, Hemanth, Lin, Paul, Olivier, Stephen Lecler, Sjaardema, Gregory D., Slattengren, Nicole Lemaster, Teranishi, Keita, Wilke, Jeremiah J., Bennett, Janine Camille, Clay, Robert L., Kale, Laxkimant, Jain, Nikhil, Mikida, Eric, Aiken, Alex, Bauer, Michael, Lee, Wonchan, Slaughter, Elliott, Treichler, Sean, Berzins, Martin, Harman, Todd, Humphreys, Alan, Schmidt, John, Sunderland, Dan, Mccormick, Pat, Gutierrez, Samuel, Shulz, Martin, Gamblin, Todd, & Bremer, Peer -Timo. ASC ATDM Level 2 Milestone #5325: Asynchronous Many-Task Runtime System Analysis and Assessment for Next Generation Platforms. United States. doi:10.2172/1432926.
Baker, Gavin Matthew, Bettencourt, Matthew Tyler, Bova, Steven W., Franko, Ken, Gamell, Marc, Grant, Ryan, Hammond, Simon David, Hollman, David S., Knight, Samuel, Kolla, Hemanth, Lin, Paul, Olivier, Stephen Lecler, Sjaardema, Gregory D., Slattengren, Nicole Lemaster, Teranishi, Keita, Wilke, Jeremiah J., Bennett, Janine Camille, Clay, Robert L., Kale, Laxkimant, Jain, Nikhil, Mikida, Eric, Aiken, Alex, Bauer, Michael, Lee, Wonchan, Slaughter, Elliott, Treichler, Sean, Berzins, Martin, Harman, Todd, Humphreys, Alan, Schmidt, John, Sunderland, Dan, Mccormick, Pat, Gutierrez, Samuel, Shulz, Martin, Gamblin, Todd, and Bremer, Peer -Timo. Tue . "ASC ATDM Level 2 Milestone #5325: Asynchronous Many-Task Runtime System Analysis and Assessment for Next Generation Platforms". United States. doi:10.2172/1432926. https://www.osti.gov/servlets/purl/1432926.
@article{osti_1432926,
title = {ASC ATDM Level 2 Milestone #5325: Asynchronous Many-Task Runtime System Analysis and Assessment for Next Generation Platforms},
author = {Baker, Gavin Matthew and Bettencourt, Matthew Tyler and Bova, Steven W. and Franko, Ken and Gamell, Marc and Grant, Ryan and Hammond, Simon David and Hollman, David S. and Knight, Samuel and Kolla, Hemanth and Lin, Paul and Olivier, Stephen Lecler and Sjaardema, Gregory D. and Slattengren, Nicole Lemaster and Teranishi, Keita and Wilke, Jeremiah J. and Bennett, Janine Camille and Clay, Robert L. and Kale, Laxkimant and Jain, Nikhil and Mikida, Eric and Aiken, Alex and Bauer, Michael and Lee, Wonchan and Slaughter, Elliott and Treichler, Sean and Berzins, Martin and Harman, Todd and Humphreys, Alan and Schmidt, John and Sunderland, Dan and Mccormick, Pat and Gutierrez, Samuel and Shulz, Martin and Gamblin, Todd and Bremer, Peer -Timo},
abstractNote = {This report provides in-depth information and analysis to help create a technical road map for developing next-generation programming models and runtime systems that support Advanced Simulation and Computing (ASC) work- load requirements. The focus herein is on asynchronous many-task (AMT) model and runtime systems, which are of great interest in the context of "Oriascale7 computing, as they hold the promise to address key issues associated with future extreme-scale computer architectures. This report includes a thorough qualitative and quantitative examination of three best-of-class AIM] runtime systems – Charm-++, Legion, and Uintah, all of which are in use as part of the Centers. The studies focus on each of the runtimes' programmability, performance, and mutability. Through the experiments and analysis presented, several overarching Predictive Science Academic Alliance Program II (PSAAP-II) Asc findings emerge. From a performance perspective, AIV runtimes show tremendous potential for addressing extreme- scale challenges. Empirical studies show an AM runtime can mitigate performance heterogeneity inherent to the machine itself and that Message Passing Interface (MP1) and AM11runtimes perform comparably under balanced conditions. From a programmability and mutability perspective however, none of the runtimes in this study are currently ready for use in developing production-ready Sandia ASC applications. The report concludes by recommending a co- design path forward, wherein application, programming model, and runtime system developers work together to define requirements and solutions. Such a requirements-driven co-design approach benefits the community as a whole, with widespread community engagement mitigating risk for both application developers developers. and high-performance computing runtime systein},
doi = {10.2172/1432926},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {9}
}

Technical Report:

Save / Share: