Exploiting Internal Parallelism for Address Translation in Solid-State Drives

Xie, Wei; Chen, Yong; Roth, Philip C.

doi:10.1145/3239564

Title: Exploiting Internal Parallelism for Address Translation in Solid-State Drives

Abstract

Solid-state Drives (SSDs) have changed the landscape of storage systems and present a promising storage solution for data-intensive applications due to their low latency, high bandwidth, and low power consumption compared to traditional hard disk drives. SSDs achieve these desirable characteristics using internal parallelism—parallel access to multiple internal flash memory chips—and a Flash Translation Layer (FTL) that determines where data are stored on those chips so that they do not wear out prematurely. However, current state-of-the-art cache-based FTLs like the Demand-based Flash Translation Layer (DFTL) do not allow IO schedulers to take full advantage of internal parallelism, because they impose a tight coupling between the logical-to-physical address translation and the data access. In this study to address this limitation, we introduce a new FTL design called Parallel-DFTL that works with the DFTL to decouple address translation operations from data accesses. Parallel-DFTL separates address translation and data access operations into different queues, allowing the SSD to use concurrent flash accesses for both types of operations. We also present a Parallel-LRU cache replacement algorithm to improve the concurrency of address translation operations. To compare Parallel-DFTL against existing FTL approaches, we present a Parallel-DFTL performance model and compare its predictions against those formore »« less

Authors:

Xie, Wei ^[1]; Chen, Yong ^[1];

^[2]

Texas Tech Univ., Lubbock, TX (United States)
; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Publication Date:: Sat Dec 15 00:00:00 EST 2018

Research Org.:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Sponsoring Org.:: USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)

OSTI Identifier:: 1490593

Grant/Contract Number:: AC05-00OR22725

Resource Type:: Accepted Manuscript

Journal Name:: ACM Transactions on Storage

Additional Journal Information:: Journal Volume: 14; Journal Issue: 4; Journal ID: ISSN 1553-3077

Publisher:: Association for Computing Machinery (ACM)

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING; Flash translation layer; SSD; parallelism; DFTL; address translation

Citation Formats


                    Xie, Wei, Chen, Yong, and Roth, Philip C. Exploiting Internal Parallelism for Address Translation in Solid-State Drives.  United States: N. p., 2018. 
Web.  doi:10.1145/3239564.

Copy to clipboard


                    Xie, Wei, Chen, Yong, & Roth, Philip C. Exploiting Internal Parallelism for Address Translation in Solid-State Drives.  United States.  https://doi.org/10.1145/3239564

Copy to clipboard


                    Xie, Wei, Chen, Yong, and Roth, Philip C. Sat .  
"Exploiting Internal Parallelism for Address Translation in Solid-State Drives".  United States.  https://doi.org/10.1145/3239564.  https://www.osti.gov/servlets/purl/1490593.

Copy to clipboard


                    
@article{osti_1490593,

  title        = {Exploiting Internal Parallelism for Address Translation in Solid-State Drives},

  author       = {Xie, Wei and Chen, Yong and Roth, Philip C.},

  abstractNote = {Solid-state Drives (SSDs) have changed the landscape of storage systems and present a promising storage solution for data-intensive applications due to their low latency, high bandwidth, and low power consumption compared to traditional hard disk drives. SSDs achieve these desirable characteristics using internal parallelism—parallel access to multiple internal flash memory chips—and a Flash Translation Layer (FTL) that determines where data are stored on those chips so that they do not wear out prematurely. However, current state-of-the-art cache-based FTLs like the Demand-based Flash Translation Layer (DFTL) do not allow IO schedulers to take full advantage of internal parallelism, because they impose a tight coupling between the logical-to-physical address translation and the data access. In this study to address this limitation, we introduce a new FTL design called Parallel-DFTL that works with the DFTL to decouple address translation operations from data accesses. Parallel-DFTL separates address translation and data access operations into different queues, allowing the SSD to use concurrent flash accesses for both types of operations. We also present a Parallel-LRU cache replacement algorithm to improve the concurrency of address translation operations. To compare Parallel-DFTL against existing FTL approaches, we present a Parallel-DFTL performance model and compare its predictions against those for DFTL and an ideal page-mapping approach. We also implemented the Parallel-DFTL approach in an SSD simulator using real device parameters, and used trace-driven simulation to evaluate Parallel-DFTL’s efficacy. Our evaluation results show that Parallel-DFTL improved the overall performance by up to 32% for the real IO workloads we tested, and by up to two orders of magnitude with synthetic test workloads. Finally, we also found that Parallel-DFTL is able to achieve reasonable performance with a very small cache size and that it provides the best benefit for those workloads with large request size or with high write ratio.},

  doi          = {10.1145/3239564},

  journal      = {ACM Transactions on Storage},

  number       = 4,

  volume       = 14,

  place        = {United States},

  year         = {Sat Dec 15 00:00:00 EST 2018},

  month        = {Sat Dec 15 00:00:00 EST 2018}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Accepted Manuscript (DOE)

Publisher's Version of Record

https://doi.org/10.1145/3239564

Other availability

Search WorldCat to find libraries that may hold this journal

Citation Metrics:

Cited by: 5 works

Citation information provided by
Web of Science

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Achieving page-mapping FTL performance at block-mapping FTL cost by hiding address translation
conference, May 2010

Hu, Yang; Jiang, Hong; Feng, Dan
2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
DOI: 10.1109/MSST.2010.5496970

Hot data identification for flash-based storage systems using multiple bloom filters
conference, May 2011

Park, Dongchul; Du, David H. C.
2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)
DOI: 10.1109/MSST.2011.5937216

FlashSim: A Simulator for NAND Flash-Based Solid-State Drives
conference, September 2009

Kim, Youngjae; Tauras, Brendan; Gupta, Aayush
2009 First International Conference on Advances in System Simulation
DOI: 10.1109/SIMUL.2009.17

A mean field model for a class of garbage collection algorithms in flash-based solid state drives
conference, January 2013

Van Houdt, Benny
Proceedings of the ACM SIGMETRICS/international conference on Measurement and modeling of computer systems - SIGMETRICS '13
DOI: 10.1145/2465529.2465543

LazyFTL: a page-level flash translation layer optimized for NAND flash memory
conference, January 2011

Ma, Dongzhe; Feng, Jianhua; Li, Guoliang
Proceedings of the 2011 international conference on Management of data - SIGMOD '11
DOI: 10.1145/1989323.1989325

Efficient identification of hot data for flash memory storage systems
journal, February 2006

Hsieh, Jen-Wei; Kuo, Tei-Wei; Chang, Li-Pin
ACM Transactions on Storage, Vol. 2, Issue 1
DOI: 10.1145/1138041.1138043

Sprinkler: Maximizing resource utilization in many-chip solid state disks
conference, February 2014

Jung, Myoungsoo; Kandemir, Mahmut T.
2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)
DOI: 10.1109/HPCA.2014.6835961

Analytic modeling of SSD write performance
conference, January 2012

Desnoyers, Peter
Proceedings of the 5th Annual International Systems and Storage Conference on - SYSTOR '12
DOI: 10.1145/2367589.2367603

Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity
conference, January 2011

Hu, Yang; Jiang, Hong; Feng, Dan
Proceedings of the international conference on Supercomputing - ICS '11
DOI: 10.1145/1995896.1995912

A space-efficient flash translation layer for CompactFlash systems
journal, May 2002

Jesung Kim, ; Noh, S. H.
IEEE Transactions on Consumer Electronics, Vol. 48, Issue 2
DOI: 10.1109/TCE.2002.1010143

Hydra: A Block-Mapped Parallel Flash Memory Solid-State Disk Architecture
journal, July 2010

Seong, Yoon Jae; Nam, Eyee Hyun; Yoon, Jin Hyuk
IEEE Transactions on Computers, Vol. 59, Issue 7
DOI: 10.1109/TC.2010.63

CBM: A cooperative buffer management for SSD
conference, June 2014

Wei, Qingsong; Chen, Cheng; Yang, Jun
2014 30th Symposium on Mass Storage Systems and Technologies (MSST)
DOI: 10.1109/MSST.2014.6855545

FASTer FTL for Enterprise-Class Flash Memory SSDs
conference, May 2010

Lim, Sang-Phil; Lee, Sang-Won; Moon, Bongki
2010 International Workshop on Storage Network Architecture and Parallel I/Os (SNAPI)
DOI: 10.1109/SNAPI.2010.9

Ozone (O3): An Out-of-Order Flash Memory Controller Architecture
journal, May 2011

Nam, Eyee Hyun; Kim, Bryan Suk Joon; Eom, Hyeonsang
IEEE Transactions on Computers, Vol. 60, Issue 5
DOI: 10.1109/TC.2010.209

A log buffer-based flash translation layer using fully-associative sector translation
journal, July 2007

Lee, Sang-Won; Park, Dong-Joo; Chung, Tae-Sun
ACM Transactions on Embedded Computing Systems, Vol. 6, Issue 3
DOI: 10.1145/1275986.1275990

On the role of burst buffers in leadership-class storage systems
conference, April 2012

Liu, Ning; Cope, Jason; Carns, Philip
2012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST)
DOI: 10.1109/MSST.2012.6232369

The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device architecture, and system organization
conference, January 2009

Dirik, Cagdas; Jacob, Bruce
Proceedings of the 36th annual international symposium on Computer architecture - ISCA '09
DOI: 10.1145/1555754.1555790

Two-mode data distribution scheme for heterogeneous storage in data centers
conference, October 2015

Xie, Wei; Zhou, Jiang; Reyes, Mark
2015 IEEE International Conference on Big Data (Big Data)
DOI: 10.1109/BigData.2015.7363772

Elastic Consistent Hashing for Distributed Storage Systems
conference, May 2017

Xie, Wei; Chen, Yong
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
DOI: 10.1109/IPDPS.2017.88

Hystor: making the best use of solid state drives in high performance storage systems
conference, January 2011

Chen, Feng; Koufaty, David A.; Zhang, Xiaodong
Proceedings of the international conference on Supercomputing - ICS '11
DOI: 10.1145/1995896.1995902

ASA-FTL: An adaptive separation aware flash translation layer for solid state drives
journal, January 2017

Xie, Wei; Chen, Yong; Roth, Philip C.
Parallel Computing, Vol. 61
DOI: 10.1016/j.parco.2016.10.006

DFTL: a flash translation layer employing demand-based selective caching of page-level address mappings
conference, January 2009

Gupta, Aayush; Kim, Youngjae; Urgaonkar, Bhuvan
Proceeding of the 14th international conference on Architectural support for programming languages and operating systems - ASPLOS '09
DOI: 10.1145/1508244.1508271

Hot/cold clustering for page mapping in NAND flash memory
journal, November 2011

Shin, Ilhoon
IEEE Transactions on Consumer Electronics, Vol. 57, Issue 4
DOI: 10.1109/TCE.2011.6131147

Performance of greedy garbage collection in flash-based solid-state drives
journal, November 2010

Bux, Werner; Iliadis, Ilias
Performance Evaluation, Vol. 67, Issue 11
DOI: 10.1016/j.peva.2010.07.003

Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing
conference, February 2011

Chen, Feng; Lee, Rubao; Zhang, Xiaodong
2011 IEEE 17th International Symposium on High Performance Computer Architecture (HPCA)
DOI: 10.1109/HPCA.2011.5749735

Revealing applications' access pattern in collective I/O for cache management
conference, January 2014

Lu, Yin; Chen, Yong; Latham, Rob
Proceedings of the 28th ACM international conference on Supercomputing - ICS '14
DOI: 10.1145/2597652.2597686

Using data clustering to improve cleaning performance for flash memory
journal, March 1999

Chiang, Mei-Ling; Lee, Paul C. H.; Chang, Ruei-Chuan
Software: Practice and Experience, Vol. 29, Issue 3
DOI: 10.1002/(SICI)1097-024X(199903)29:3<267::AID-SPE233>3.0.CO;2-T

Exploiting Internal Parallelism of Flash-based SSDs
journal, January 2010

Seon-yeong Park,
IEEE Computer Architecture Letters, Vol. 9, Issue 1
DOI: 10.1109/L-CA.2010.3

Multi-Channel Architecture-Based FTL for Reliable and High-Performance SSD
journal, December 2014

Hsieh, Jen-Wei; Lin, Han-Yi; Yang, Dong-Lin
IEEE Transactions on Computers, Vol. 63, Issue 12
DOI: 10.1109/TC.2013.169

Cleaning policies in mobile computers using flash memory
journal, November 1999

Chiang, M. -L.; Chang, R. -C.
Journal of Systems and Software, Vol. 48, Issue 3
DOI: 10.1016/S0164-1212(99)00059-X

Write amplification analysis in flash-based solid state drives
conference, January 2009

Hu, Xiao-Yu; Eleftheriou, Evangelos; Haas, Robert
Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference on - SYSTOR '09
DOI: 10.1145/1534530.1534544

Parallel-DFTL: A Flash Translation Layer That Exploits Internal Parallelism in Solid State Drives
conference, August 2016

Xie, Wei; Chen, Yong; Roth, Philip C.
2016 IEEE International Conference on Networking, Architecture and Storage (NAS)
DOI: 10.1109/NAS.2016.7549413

Using data clustering to improve cleaning performance for flash memory
journal, March 1999

Chiang, Mei-Ling; Lee, Paul C. H.; Chang, Ruei-Chuan
Software: Practice and Experience, Vol. 29, Issue 3
DOI: 10.1002/(SICI)1097-024X(199903)29:3%3C267::AID-SPE233%3E3.0.CO;2-T

Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems
conference, November 2014

Dai, Dong; Chen, Yong; Kimpe, Dries
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
DOI: 10.1109/SC.2014.57

PUD-LRU: An Erase-Efficient Write Buffer Management Algorithm for Flash Memory SSD
conference, August 2010

Hu, Jian; Jiang, Hong; Tian, Lei
Simulation of Computer and Telecommunication Systems (MASCOTS), 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
DOI: 10.1109/MASCOTS.2010.16

Locality-driven high-level I/O aggregation for processing scientific datasets
conference, October 2013

Liu, Jialin; Crysler, Bradly; Lu, Yin
2013 IEEE International Conference on Big Data
DOI: 10.1109/BigData.2013.6691560

ADAPT: Efficient workload-sensitive flash management based on adaptation, prediction and aggregation
conference, April 2012

Wang, Chundong; Wong, Weng-Fai
2012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST)
DOI: 10.1109/MSST.2012.6232388

The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device architecture, and system organization
journal, June 2009

Dirik, Cagdas; Jacob, Bruce
ACM SIGARCH Computer Architecture News, Vol. 37, Issue 3
DOI: 10.1145/1555815.1555790

A mean field model for a class of garbage collection algorithms in flash-based solid state drives
journal, June 2013

Van Houdt, Benny
ACM SIGMETRICS Performance Evaluation Review, Vol. 41, Issue 1
DOI: 10.1145/2494232.2465543

A mean field model for a class of garbage collection algorithms in flash-based solid state drives
journal, April 2014

Van Houdt, Benny
Queueing Systems, Vol. 77, Issue 2
DOI: 10.1007/s11134-014-9403-0

Similar Records in DOE PAGES and OSTI.GOV collections:

Active Flash: Performance-Energy Tradeoffs for Out-of-Core Processing on Non-Volatile Memory Devices

Conference Boboila, Simona ; Kim, Youngjae ; Vazhkudai, Sudharshan S ; ...

In this abstract, we study the performance and energy tradeoffs involved in migrating data analysis into the flash device, a process we refer to as Active Flash. The Active Flash paradigm is similar to 'active disks', which has received considerable attention. Active Flash allows us to move processing closer to data, thereby minimizing data movement costs and reducing power consumption. It enables true out-of-core computation. The conventional definition of out-of-core solvers refers to an approach to process data that is too large to fit in the main memory and, consequently, requires access to disk. However, in Active Flash, processing outsidemore »« less
A Temporal Locality-Aware Page-Mapped Flash Translation Layer

Journal Article Kim, Youngjae ; Gupta, Aayush ; Urgaonkar, Bhuvan - JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY

The poor performance of random writes has been a cause of major concern which needs to be addressed to better utilize the potential of flash in enterprise-scale environments. We examine one of the important causes of this poor performance: the design of the flash translation layer (FTL) which performs the virtual-to-physical address translations and hides the erase-before-write characteristics of flash. We propose a complete paradigm shift in the design of the core FTL engine from the existing techniques with our Demand-Based Flash Translation Layer (DFTL) which selectively caches page- level address mappings. Our experimental evaluation using FlashSim with realistic enterprise-scalemore »« less
https://doi.org/10.1007/s11390-013-1395-4
ASA-FTL: An adaptive separation aware flash translation layer for solid state drives

Journal Article Xie, Wei ; Chen, Yong ; Roth, Philip C. - Parallel Computing

Here, the flash-memory based Solid State Drive (SSD) presents a promising storage solution for increasingly critical data-intensive applications due to its low latency (high throughput), high bandwidth, and low power consumption. Within an SSD, its Flash Translation Layer (FTL) is responsible for exposing the SSD’s flash memory storage to the computer system as a simple block device. The FTL design is one of the dominant factors determining an SSD’s lifespan and performance. To reduce the garbage collection overhead and deliver better performance, we propose a new, low-cost, adaptive separation-aware flash translation layer (ASA-FTL) that combines sampling, data clustering and selectivemore »« less
Cited by 11
https://doi.org/10.1016/j.parco.2016.10.006

Full Text Available
Data Locality Enhancement of Dynamic Simulations for Exascale Computing (Final Report)

Technical Report Shen, Xipeng

The development of modern processors exhibits two trends that complicate the optimizations of modern software. The first is the increasing sensitivity of processors' throughput to irregularities in computation. With more processors produced through a massive integration of simple cores, future systems will increasingly favor regular data-level parallel computations, but deviate from the needs of applications with complex patterns. Some evidences are already shown on Graphic Processing Units (GPU): Irregular data accesses (e.g., indirect references A[D[i]]) and conditional branches are limiting many GPU applications' performance at a level an order of magnitude lower than the peak of GPU. The second hardwaremore »« less
https://doi.org/10.2172/1576175

Full Text Available
...And Eat it Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats

Conference Klasky, Scott A ; Lofstead, J. ; Bent, John ; ...

As HPC applications run on increasingly high process counts on larger and larger machines, both the frequency of checkpoints needed for fault tolerance and the resolution and size of Data Analysis Dumps are expected to increase proportionally. In order to maintain an acceptable ratio of time spent performing useful computation work to time spent performing I/O, write bandwidth to the underlying storage system must increase proportionally to this increase in the checkpoint and computation size. Unfortunately, popular scientific self-describing file formats such as netCDF and HDF5 are designed with a focus on portability and flexibility. Extra care and careful craftingmore »« less

Similar Records

Title: Exploiting Internal Parallelism for Address Translation in Solid-State Drives

Abstract

Citation Formats

Achieving page-mapping FTL performance at block-mapping FTL cost by hiding address translation conference, May 2010

Hot data identification for flash-based storage systems using multiple bloom filters conference, May 2011

FlashSim: A Simulator for NAND Flash-Based Solid-State Drives conference, September 2009

A mean field model for a class of garbage collection algorithms in flash-based solid state drives conference, January 2013

LazyFTL: a page-level flash translation layer optimized for NAND flash memory conference, January 2011

Efficient identification of hot data for flash memory storage systems journal, February 2006

Sprinkler: Maximizing resource utilization in many-chip solid state disks conference, February 2014

Analytic modeling of SSD write performance conference, January 2012

Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity conference, January 2011

A space-efficient flash translation layer for CompactFlash systems journal, May 2002

Hydra: A Block-Mapped Parallel Flash Memory Solid-State Disk Architecture journal, July 2010

CBM: A cooperative buffer management for SSD conference, June 2014

FASTer FTL for Enterprise-Class Flash Memory SSDs conference, May 2010

Ozone (O3): An Out-of-Order Flash Memory Controller Architecture journal, May 2011

A log buffer-based flash translation layer using fully-associative sector translation journal, July 2007

On the role of burst buffers in leadership-class storage systems conference, April 2012

The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device architecture, and system organization conference, January 2009

Two-mode data distribution scheme for heterogeneous storage in data centers conference, October 2015

Elastic Consistent Hashing for Distributed Storage Systems conference, May 2017

Hystor: making the best use of solid state drives in high performance storage systems conference, January 2011

ASA-FTL: An adaptive separation aware flash translation layer for solid state drives journal, January 2017

DFTL: a flash translation layer employing demand-based selective caching of page-level address mappings conference, January 2009

Hot/cold clustering for page mapping in NAND flash memory journal, November 2011

Performance of greedy garbage collection in flash-based solid-state drives journal, November 2010

Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing conference, February 2011

Revealing applications' access pattern in collective I/O for cache management conference, January 2014

Using data clustering to improve cleaning performance for flash memory journal, March 1999

Exploiting Internal Parallelism of Flash-based SSDs journal, January 2010

Multi-Channel Architecture-Based FTL for Reliable and High-Performance SSD journal, December 2014

Cleaning policies in mobile computers using flash memory journal, November 1999

Write amplification analysis in flash-based solid state drives conference, January 2009

Parallel-DFTL: A Flash Translation Layer That Exploits Internal Parallelism in Solid State Drives conference, August 2016

Using data clustering to improve cleaning performance for flash memory journal, March 1999

Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems conference, November 2014

PUD-LRU: An Erase-Efficient Write Buffer Management Algorithm for Flash Memory SSD conference, August 2010

Locality-driven high-level I/O aggregation for processing scientific datasets conference, October 2013

ADAPT: Efficient workload-sensitive flash management based on adaptation, prediction and aggregation conference, April 2012

The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device architecture, and system organization journal, June 2009

A mean field model for a class of garbage collection algorithms in flash-based solid state drives journal, June 2013

A mean field model for a class of garbage collection algorithms in flash-based solid state drives journal, April 2014

Achieving page-mapping FTL performance at block-mapping FTL cost by hiding address translation
conference, May 2010

Hot data identification for flash-based storage systems using multiple bloom filters
conference, May 2011

FlashSim: A Simulator for NAND Flash-Based Solid-State Drives
conference, September 2009

A mean field model for a class of garbage collection algorithms in flash-based solid state drives
conference, January 2013

LazyFTL: a page-level flash translation layer optimized for NAND flash memory
conference, January 2011

Efficient identification of hot data for flash memory storage systems
journal, February 2006

Sprinkler: Maximizing resource utilization in many-chip solid state disks
conference, February 2014

Analytic modeling of SSD write performance
conference, January 2012

Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity
conference, January 2011

A space-efficient flash translation layer for CompactFlash systems
journal, May 2002

Hydra: A Block-Mapped Parallel Flash Memory Solid-State Disk Architecture
journal, July 2010

CBM: A cooperative buffer management for SSD
conference, June 2014

FASTer FTL for Enterprise-Class Flash Memory SSDs
conference, May 2010

Ozone (O3): An Out-of-Order Flash Memory Controller Architecture
journal, May 2011

A log buffer-based flash translation layer using fully-associative sector translation
journal, July 2007

On the role of burst buffers in leadership-class storage systems
conference, April 2012

The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device architecture, and system organization
conference, January 2009

Two-mode data distribution scheme for heterogeneous storage in data centers
conference, October 2015

Elastic Consistent Hashing for Distributed Storage Systems
conference, May 2017

Hystor: making the best use of solid state drives in high performance storage systems
conference, January 2011

ASA-FTL: An adaptive separation aware flash translation layer for solid state drives
journal, January 2017

DFTL: a flash translation layer employing demand-based selective caching of page-level address mappings
conference, January 2009

Hot/cold clustering for page mapping in NAND flash memory
journal, November 2011

Performance of greedy garbage collection in flash-based solid-state drives
journal, November 2010

Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing
conference, February 2011

Revealing applications' access pattern in collective I/O for cache management
conference, January 2014

Using data clustering to improve cleaning performance for flash memory
journal, March 1999

Exploiting Internal Parallelism of Flash-based SSDs
journal, January 2010

Multi-Channel Architecture-Based FTL for Reliable and High-Performance SSD
journal, December 2014

Cleaning policies in mobile computers using flash memory
journal, November 1999

Write amplification analysis in flash-based solid state drives
conference, January 2009

Parallel-DFTL: A Flash Translation Layer That Exploits Internal Parallelism in Solid State Drives
conference, August 2016

Using data clustering to improve cleaning performance for flash memory
journal, March 1999

Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems
conference, November 2014

PUD-LRU: An Erase-Efficient Write Buffer Management Algorithm for Flash Memory SSD
conference, August 2010

Locality-driven high-level I/O aggregation for processing scientific datasets
conference, October 2013

ADAPT: Efficient workload-sensitive flash management based on adaptation, prediction and aggregation
conference, April 2012

The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device architecture, and system organization
journal, June 2009

A mean field model for a class of garbage collection algorithms in flash-based solid state drives
journal, June 2013

A mean field model for a class of garbage collection algorithms in flash-based solid state drives
journal, April 2014