skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Scalable augmented enumeration and metadata operations for large filesystems

Patent ·
OSTI ID:1860146

Systems, apparatus, and methods are disclosed for performing scalable operations in a file system. Metadata entries in a namespace or directory tree are sharded across multiple file metadata servers. An augmented enumeration operation, such as listing a directory, is parallelized across the multiple file metadata servers, transparently to clients. Exemplary augmentation features can include filtering and sorting. Augmentation features can be executed concurrently with enumeration, prior to enumeration, after enumeration, or as a combination of these, and can utilize pre-built index structures or holding structures for intermediate results. Augmented enumeration operations can also include no-output operations such as changing file attributes or deleting a file, and cumulative operations such as counting total disk space usage. The parallelization is compatible with tree-level parallelization and storage-level parallelization. Disclosed technologies can be applied to other fields requiring scalable enumeration, such as database and network applications.

Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC52-06NA25396
Assignee:
Triad National Security, LLC (Los Alamos, NM)
Patent Number(s):
11,194,763
Application Number:
15/713,495
OSTI ID:
1860146
Resource Relation:
Patent File Date: 09/22/2017
Country of Publication:
United States
Language:
English

References (43)

Method and Apparatus for Fault-Tolerant Memory Management patent-application August 2010
Directory aggregation for files distributed over a plurality of servers in a switched file system patent-application July 2004
Method for restoring files from a continuous recovery system patent-application February 2017
Methods and apparatus for multi-resolution replication of files in a parallel computing system using semantic information patent October 2015
Parallel log structured file system collective buffering to achieve a compact representation of scientific and/or dimensional data patent September 2015
Method and apparatus for organizing data sources patent May 2009
Small file aggregation in a parallel computing system patent September 2014
Distributed Data Storage System Using a Common Manifest for Storing and accessing Versions of an Object patent-application May 2018
Apparatus and method for realizing big data into a big object and non-transitory tangible machine-readable medium thereof patent January 2013
Optimized Data Placement for Individual File Accesses on Deduplication-enabled Sequential Storage Systems patent-application January 2014
Replacing file system processors by hot swapping patent-application December 2002
System and method for optimizing data migration in a partitioned database patent-application November 2017
Cloud object store for archive storage of high performance computing data using decoupling middleware patent June 2015
System and methods for processing a multidimensional database patent October 2007
Fault tolerant multi-stage data replication with relaxed coherency guarantees patent January 2010
Versioning Schemes for Compute-Centric Object Stores patent-application September 2014
MTREEINI: Intermediate Nodes and Indexes patent-application
Data Syncing in a Distributed System patent-application August 2015
Cloud object store for checkpoints of high performance computing applications using decoupling middleware patent April 2016
Media file management on a media storage and playback device patent-application July 2005
Scalable filesystem enumeration and metadata operations patent March 2021
Parallel sort with a ranged, partitioned key-value store in a high perfomance computing environment patent January 2016
Conditional writes at distributed storage services patent February 2017
Pattern-Aware Prefetching Using Parallel Log-Structured File System patent-application April 2020
Enhancing file system performance patent-application October 2002
System, method, and computer program product for utilizing a data structure including event relationships to detect unwanted activity patent-application September 2013
Cooperative storage of shared files in a parallel computing system with dynamic block size patent November 2015
Computing Erasure Metadata and Data Layout Prior to Storage Using A Processing Platform patent-application December 2016
Distributed metadata servers in a file system with separate metadata servers for file metadata and directory metadata patent November 2018
Object Storage System with Local Transaction Logs, a Distributed Namespace, and Optimized Support for User Directories patent-application December 2015
Allocating data based on memory device performance in a dispersed storage network patent January 2020
Storing files in a parallel computing system based on user-specified parser function patent October 2014
Storing files in a parallel computing system based on user or application specification patent March 2016
Storing files in a parallel computing system using list-based index to identify replica files patent July 2015
Oversubscribed storage extents with on-demand page allocation patent October 2017
Provisioning System-Permissions Using Attribute-Based Access Control Policies patent-application March 2016
Scalable file management for a shared file system patent December 2012
Estimating Data Reduction in Storage Systems patent-application August 2013
Parallel Serialization of Request Processing patent-application August 2012
Parallel file system with metadata distributed across partitioned key-value store patent September 2017
Methods and apparatus for capture and storage of semantic information with sub-files in a parallel computing system patent February 2015
Enumerating Metadata in File System Directories patent-application May 2010
Distributed File System Consistency Check patent-application November 2017