High Performance Storage System Scalability: Architecture, Implementation, and Experience
The High Performance Storage System (HPSS) provides scalable hierarchical storage management (HSM), archive, and file system services. Its design, implementation and current dominant use are focused on HSM and archive services. It is also a general-purpose, global, shared, parallel file system, potentially useful in other application domains. When HPSS design and implementation began over a decade ago, scientific computing power and storage capabilities at a site, such as a DOE national laboratory, was measured in a few 10s of gigaops, data archived in HSMs in a few 10s of terabytes at most, data throughput rates to an HSM in a few megabytes/s, and daily throughput with the HSM in a few gigabytes/day. At that time, the DOE national laboratories and IBM HPSS design team recognized that we were headed for a data storage explosion driven by computing power rising to teraops/petaops requiring data stored in HSMs to rise to petabytes and beyond, data transfer rates with the HSM to rise to gigabytes/s and higher, and daily throughput with a HSM in 10s of terabytes/day. This paper discusses HPSS architectural, implementation and deployment experiences that contributed to its success in meeting the above orders of magnitude scaling targets. We also discuss areas that need additional attention as we continue significant scaling into the future.
- Research Organization:
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
- Sponsoring Organization:
- US Department of Energy (US)
- DOE Contract Number:
- W-7405-ENG-48
- OSTI ID:
- 15015911
- Report Number(s):
- UCRL-PROC-208872; TRN: US200509%%510
- Resource Relation:
- Conference: Presented at: 22nd IEEE 13th NASA Goddard Conference on Mass Storage Systems and Technologies, Monterey, CA (US), 04/11/2005--04/14/2005; Other Information: PBD: 5 Jan 2005
- Country of Publication:
- United States
- Language:
- English
Similar Records
POPM: A Distributed query system for high performance analysis of very large persistent object stores
dCache - Joining the noWORM Storage Club