Storing files in a parallel computing system using list-based index to identify replica files
Abstract
Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.
- Inventors:
- Issue Date:
- Research Org.:
- Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1195932
- Patent Number(s):
- 9087075
- Application Number:
- 13/536,331
- Assignee:
- EMC Corporation (Hopkinton, MA)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- AC52-06NA25396
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2012 Jun 28
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States: N. p., 2015.
Web.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, & Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Tue .
"Storing files in a parallel computing system using list-based index to identify replica files". United States. https://www.osti.gov/servlets/purl/1195932.
@article{osti_1195932,
title = {Storing files in a parallel computing system using list-based index to identify replica files},
author = {Faibish, Sorin and Bent, John M. and Tzelnic, Percy and Zhang, Zhenhua and Grider, Gary},
abstractNote = {Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {7}
}
Works referenced in this record:
A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications
conference, January 2004
- Hong Tang, ; Gulbeden, A.
- Proceedings of the ACM/IEEE SC2004 Conference
PLFS: a checkpoint filesystem for parallel applications
conference, January 2009
- Bent, John; Gibson, Garth; Grider, Gary
Storage management system with file aggregation
patent, August 2000
- Cannon, David M.; Kaczmarski, Michael Allen
- US Patent Document 6,098,074
Leasing scheme for data-modifying operations
patent, June 2006
- Ghemawat, Sanjay; Gobioff, Howard; Leung, Shun-Tak
- US Patent Document 7,065,618
Filesystem-aware block storage system, apparatus, and method
patent, January 2011
- Terry, Julian M.; Clarkson, Neil A.; Barrall, Geoffrey S.
- US Patent Document 7,873,782
Method and system of providing replica files within a fileset
patent, August 2011
- Shah, Aalop; Borate, Milind; Rajan, Basant
- US Patent Document 7,996,361
Systems and methods for managing portions of files in multi-tier storage systems
patent, January 2013
- Mamidi, Murthy V.; Malige, Raghupathi; Ravi, Gautham
- US Patent Document 8,352,429
Distributed file system and method of operating a distributed file system
patent-application, May 2006
- Tichy, Walter; Isaila, Florin
- US Patent Application 10/491459; 20060101025
Maintaining active-only copy storage pools
patent-application, February 2007
- Cannon, David Maxwell; Martin, Howard Newton
- US Patent Application 11/224852; 20070043789
Works referencing / citing this record:
Storing files in a parallel computing system using list-based index to identify replica files
patent, July 2015
- Faibish, Sorin; Bent, John M.; Tzelnic, Percy
- US Patent Document 9,087,075