Append is Near: Log-based Data Management on ZNS SSDs
Appeared in Conference on Innovative Data Systems Research 2022 (CIDR '22).
Abstract
Log-based data management systems use storage as if it was an append-only medium in order to transform random writes into sequential writes, delivering a major advantage when logs were persisted on hard disks. Although solid-state drives (SSDs) offer improved random write capabilities, sequential writes continue to be advantageous due to locality and space efficiency. However, the inherent properties of flash-based SSDs induce significant disadvantages when utilizing a random write block interface, causing write amplification, uneven wear, log stacking, and garbage collection overheads. To eliminate these disadvantages, Zoned Namespace (ZNS) SSDs have recently been introduced. They offer increased capacity, reduced write amplification, and higher performance but require the host to participate in data placement through zones, which have sequential-write semantics and must be explicitly reset.
In ZNS, the Zone Append primitive allows the host to push down fine-grained data placement onto the device, supporting appends to a zone without knowing the location of the tail. Full zones become immutable, greatly simplifying disaggregated storage and operations like replication.
We propose another pushdown technique, Group Append, which allows appends of data that is smaller (or possibly larger) than a block, offloading data buffering to the controller. We explore how ZNS SSDs with Zone Append, Group Append, and computational storage can benefit four log-based data management areas: (i) log-based file systems, (ii) LSM trees such as RocksDB, (iii) database systems, and (iv) event logs/shared logs. We also propose research directions for all four log-based data management using ZNS SSDs.
Publication date:
January 2022
        Authors:
        
            
                Devashish Purandare
            
        
            
                Pete Wilcox
            
        
            
                Heiner Litz
            
        
            
                Shel Finkelstein
            
        
    
        Projects:
        
            Archival Storage
        
            Designing an Efficient Flash Translation
        
            Designing systems for QLC flash
        
            Shingled Disk
        
            Computational Storage
        
    
Available media
            
                Full paper text:
                
                    PDF
                
                
                    
Presentation:
                    
                    
                    video
                
            
        
Bibtex entry
@inproceedings{purandare-cidr22,
  author       = {Devashish Purandare and Pete Wilcox and Heiner Litz and Shel Finkelstein},
  title        = {Append is Near: Log-based Data Management on {ZNS} {SSDs}},
  booktitle    = {Conference on Innovative Data Systems Research 2022 (CIDR '22)},
  month        = jan,
  year         = {2022},
}
    
