Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

The Spider Center Wide File System; From Concept to Reality

Conference ·
OSTI ID:1016038

The Leadership Computing Facility (LCF) at Oak Ridge National Laboratory (ORNL) has a diverse portfolio of computational resources ranging from a petascale XT4/XT5 simulation system (Jaguar) to numerous other systems supporting development, visualization, and data analytics. In order to support vastly different I/O needs of these systems Spider, a Lustre-based center wide file system was designed and deployed to provide over 240 GB/s of aggregate throughput with over 10 Petabytes of formatted capacity. A multi-stage InfiniBand network, dubbed as Scalable I/O Network (SION), with over 889 GB/s of bisectional bandwidth was deployed as part of Spider to provide connectivity to our simulation, development, visualization, and other platforms. To our knowledge, while writing this paper, Spider is the largest and fastest POSIX-compliant parallel file system in production. This paper will detail the overall architecture of the Spider system, challenges in deploying and initial testings of a file system of this scale, and novel solutions to these challenges which offer key insights into file system design in the future.

Research Organization:
Oak Ridge National Laboratory (ORNL); Center for Computational Sciences
Sponsoring Organization:
SC USDOE - Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1016038
Country of Publication:
United States
Language:
English

Similar Records

Jaguar: The World?s Most Powerful Computer
Conference · Wed Dec 31 23:00:00 EST 2008 · OSTI ID:965857

Lessons Learned in Deploying the World s Largest Scale Lustre File System
Conference · Thu Dec 31 23:00:00 EST 2009 · OSTI ID:1016043

A Next-Generation Parallel File System Environment for the OLCF
Conference · Sat Dec 31 23:00:00 EST 2011 · OSTI ID:1039646