Fenix, A Fault Tolerant Programming Framework for MPI Applications
- Rutgers University
- Sandia National Laboratories
- parashar@rutgers.edu
Fenix provides APIs to allow the users to add fault tolerance capability to MPI-based parallel programs in a transparent manner. Fenix-enabled programs can run through process failures during program execution using a pool of spare processes accommodated by Fenix.
- Short Name / Acronym:
- Fenix
- Project Type:
- Open Source, Publicly Available Repository
- Site Accession Number:
- SCR #2147; 7297
- Software Type:
- Scientific
- License(s):
- BSD 3-clause "New" or "Revised" License
- Programming Language(s):
- C, Fortran77/90, C++
- Research Organization:
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOEPrimary Award/Contract Number:AC04-94AL85000
- DOE Contract Number:
- AC04-94AL85000
- Code ID:
- 45238
- OSTI ID:
- 1336604
- Country of Origin:
- United States
Similar Records
Fault Tolerance Assistant (FTA): An Exception Handling Programming Model for MPI Applications
Evaluating Online Global Recovery with Fenix Using Application-Aware In-Memory Checkpointing Techniques
Specification of Fenix MPI Fault Tolerance library (V.0.9)
Technical Report
·
Mon May 23 00:00:00 EDT 2016
·
OSTI ID:1336604
+2 more
Evaluating Online Global Recovery with Fenix Using Application-Aware In-Memory Checkpointing Techniques
Conference
·
Mon Aug 01 00:00:00 EDT 2016
·
OSTI ID:1336604
+4 more
Specification of Fenix MPI Fault Tolerance library (V.0.9)
Technical Report
·
Tue Jun 07 00:00:00 EDT 2016
·
OSTI ID:1336604
+1 more