skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Understanding I/O workload characteristics of a Peta-scale storage system

Journal Article · · Journal of Supercomputing
 [1];  [1]
  1. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). National Center for Computational Sciences

Understanding workload characteristics is critical for optimizing and improving the performance of current systems and software, and architecting new storage systems based on observed workload patterns. In this paper, we characterize the I/O workloads of scientific applications of one of the world's fastest high performance computing (HPC) storage cluster, Spider, at the Oak Ridge Leadership Computing Facility (OLCF). OLCF flagship petascale simulation platform, Titan, and other large HPC clusters, in total over 250 thousands compute cores, depend on Spider for their I/O needs. We characterize the system utilization, the demands of reads and writes, idle time, storage space utilization, and the distribution of read requests to write requests for the Peta-scale Storage Systems. From this study, we develop synthesized workloads, and we show that the read and write I/O bandwidth usage as well as the inter-arrival time of requests can be modeled as a Pareto distribution. We also study the I/O load imbalance problems using I/O performance data collected from the Spider storage system.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1185800
Journal Information:
Journal of Supercomputing, Vol. 71, Issue 3; ISSN 0920-8542
Publisher:
SpringerCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 12 works
Citation information provided by
Web of Science

References (8)

Workload Characterization and Performance Implications of Large-Scale Blog Servers journal November 2012
Characterization of storage workload traces from production Windows Servers conference October 2008
Evaluation of disk-level workloads at different time scales journal October 2009
Efficient management of idleness in storage systems journal June 2009
Internet Web servers: workload characterization and performance implications journal January 1997
ScalaTrace: Scalable compression and replay of communication traces for high-performance computing journal August 2009
Probabilistic Communication and I/O Tracing with Deterministic Replay at Scale conference September 2011
Understanding and improving computational science storage access through continuous characterization conference May 2011

Cited By (1)

Fair bandwidth allocating and strip-aware prefetching for concurrent read streams and striped RAIDs in distributed file systems journal May 2018