---
code_id: 1155
site_ownership_code: "LLNL"
open_source: true
repository_link: "https://github.com/LLNL/scr"
project_type: "OS"
software_type: "S"
official_use_only: {}
developers:
- email: ""
  orcid: ""
  first_name: "A."
  last_name: "Moody"
  middle_name: ""
  affiliations: []
contributors: []
sponsoring_organizations:
- organization_name: "USDOE"
  funding_identifiers: []
  primary_award: "AC52-07NA27344"
  DOE: true
contributing_organizations: []
research_organizations:
- organization_name: "Lawrence Livermore National Laboratory"
  DOE: true
related_identifiers: []
release_date: "2009-02-23"
software_title: "The Scalable Checkpoint/Restart Library"
acronym: "SCR"
doi: "https://doi.org/10.11578/dc.20171025.1160"
description: "The Scalable Checkpoint/Restart (SCR) library provides an interface\
  \ that codes may use to worite our and read in application-level checkpoints in\
  \ a scalable fashion. In the current implementation, checkpoint files are cached\
  \ in local storage (hard disk or RAM disk) on the compute nodes. This technique\
  \ provides scalable aggregate bandwidth and uses storage resources that are fully\
  \ dedicated to the job. This approach addresses the two common drawbacks of checkpointing\
  \ a large-scale application to a shared parallel file system, namely, limited bandwidth\
  \ and file system contention. In fact, on current platforms, SCR scales linearly\
  \ with the number of compute nodes. It has been benchmarked as high as 720GB/s on\
  \ 1094 nodes of Atlas, which is nearly two orders of magnitude faster thanthe parallel\
  \ file system."
programming_languages: []
country_of_origin: "United States"
keywords: "ECP"
project_keywords:
- "ECP"
licenses:
- "Other"
proprietary_url: "https://github.com/LLNL/scr/blob/master/LICENSE.TXT"
recipient_org: "LLNL"
site_accession_number: "4349"
date_record_added: "2017-10-25"
date_record_updated: "2020-05-27"
is_file_certified: false
is_limited: false
links:
- rel: "citation"
  href: "https://www.osti.gov/doecode/biblio/1155"
