Measurement and Analysis of Large-Scale Network File System Workloads
Appeared in Proceedings of the 2008 USENIX Technical Conference.
Abstract
In this paper we present the analysis of two large-scale network file system workloads. We measured CIFS traffic for two enterprise-class file servers deployed in the NetApp data center for a three month period. One file server was used by marketing, sales, and finance departments and the other by the engineering department. Together these systems represent over 22 TB of storage used by over 1500 employees, making this the first ever large-scale study of the CIFS protocol.
We analyzed how our network file system workloads compared to those of previous file system trace studies and took an in-depth look at access, usage, and sharing patterns. We found that our workloads were quite different from those previously studied; for example, our analysis found increased read-write file access patterns, decreased read-write ratios, more random file access, and longer file lifetimes. In addition, we found a number of interesting properties regarding file sharing, file re-use, and the access patterns of file types and users, showing that modern file system workload has changed in the past 5–10 years. This change in workload characteristics has implications on the future design of network file systems, which we describe in the paper.
Publication date:
June 2008
        Authors:
        
            
                Andrew Leung
            
        
            
                Shankar Pasupathy
            
        
            
                Garth Goodson
            
        
            
                Ethan L. Miller
            
        
    
        Projects:
        
            Scalable File System Indexing
        
            Tracing and Benchmarking
        
            Ultra-Large Scale Storage
        
    
Available media
            
                Full paper text:
                
                    PDF
                
                
                    
Presentation:
                    slides
                    
                    
                
            
        
Bibtex entry
@inproceedings{leung-usenix08,
  author       = {Andrew Leung and Shankar Pasupathy and Garth Goodson and Ethan L. Miller},
  title        = {Measurement and Analysis of Large-Scale Network File System Workloads},
  booktitle    = {Proceedings of the 2008 USENIX Technical Conference},
  month        = jun,
  year         = {2008},
}
    
