ACM Home Page
Please provide us with feedback. Feedback
Demystifying data deduplication
Full text PdfPdf (528 KB)
Source Middleware Conference archive
Proceedings of the ACM/IFIP/USENIX Middleware '08 Conference Companion table of contents
Leuven, Belgium
SESSION: Industrial track table of contents
Pages 12-17  
Year of Publication: 2008
ISBN:978-1-60558-369-3
Authors
Nagapramod Mandagere  University of Minnesota
Pin Zhou  IBM Almaden Research Center
Mark A Smith  IBM Almaden Research Center
Sandeep Uttamchandani  IBM Almaden Research Center
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 31,   Downloads (12 Months): 191,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1462735.1462739
What is a DOI?

ABSTRACT

Effectiveness and tradeoffs of deduplication technologies are not well understood -- vendors tout Deduplication as a "silver bullet" that can help any enterprise optimize its deployed storage capacity. This paper aims to provide a comprehensive taxonomy and experimental evaluation using real-world data. While the rate of change of data on a day-to-day basis has the greatest influence on the duplication in backup data, we investigate the duplication inherent in this data, independent of rate of change of data or backup schedule or backup algorithm used. Our experimental results show that between different deduplication techniques the space savings varies by about 30%, the CPU usage differs by almost 6 times and the time to reconstruct a deduplicated file can vary by more than 15 times.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
M. O. Rabin. Fingerprinting by random polynomials. In Center for Research in Computing Technology, Harvard University. Tech Report TRCSE-03-01, 2006, 1981.
 
6
L. You and C. Karamanolis. Evaluation of efficient archival storage techniques. In 21st IEEE/12th NASA Goddard Conference on Mass Storage systems and Technologies, 2004.
 
7
 
8


Collaborative Colleagues:
Nagapramod Mandagere: colleagues
Pin Zhou: colleagues
Mark A Smith: colleagues
Sandeep Uttamchandani: colleagues