Exploration of cache behavior using HPSS per-file transfer logs
- LBNL Library
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) system at the National Energy Research Scientific Computing Center(NERSC) and analyzed them to assess workload behavior and gain some insight into which cache configurations would provide the best service to the users. We found, as expected, that the workload is distributed over file size with a declining number of files as the files get larger, so the amount of space consumed per file size increment is roughly constant up to file sizes of 1 GB. Sixty one percent of file accesses were write accesses. There are a significant number of files written which are never read -- backup files and similar files. For all sizes of files, access frequencies decline with the age of the files. HPSS uses the cache as an I/O buffer for incoming data. At our installation the cache behavior is dominated by the write traffic. Cache lifetimes tend to scale linearly with the size of the cache and inversely with the amount of data flow.
- Research Organization:
- Ernest Orlando Lawrence Berkeley National Lab., CA (US)
- Sponsoring Organization:
- USDOE Director, Office of Science. Office of Advanced Scientific Computing Research. Mathematical, Information, and Computational Sciences Division (US)
- DOE Contract Number:
- AC03-76SF00098
- OSTI ID:
- 793764
- Report Number(s):
- LBNL--49330
- Country of Publication:
- United States
- Language:
- English
Similar Records
Large File System Backup: NERSC Global File System Experience
Effectiveness and predictability of in-network storage cache for Scientific Workflows
Using MPI File Caching to Improve Parallel Write Performance for Large-Scale Scientific Applications
Technical Report
·
Thu Oct 23 00:00:00 EDT 2008
·
OSTI ID:941678
Effectiveness and predictability of in-network storage cache for Scientific Workflows
Conference
·
Tue Feb 21 23:00:00 EST 2023
·
OSTI ID:2997068
Using MPI File Caching to Improve Parallel Write Performance for Large-Scale Scientific Applications
Conference
·
Sun Dec 31 23:00:00 EST 2006
·
OSTI ID:1054965