|
ABSTRACT
Synchronous transactional logging is the central mechanism for ensuring data persistency and recoverability in database systems. Unfortunately, magnetic disks are ill-suited for the small sequential write pattern of synchronous logging. Alternative solutions (e.g., backup servers or sophisticated battery-backed write caches in high-end disk arrays) are either expensive or complicated. In this paper, we exploit flash devices for synchronous logging based on the observation that flash devices support small sequential writes well. Comparing a wide variety of flash devices, we find that USB flash drives are a good match for this task because of its unique characteristics: widely available USB ports, hot-plug capability useful for coping with flash wear, and low price so that multiple drives are affordable. We propose FlashLogging, a logging solution that exploits multiple (USB) flash drives for synchronous logging. We identify and address four challenges: (i) efficiently exploiting multiple flash drives for logging; (ii) coping with the large variance of write latencies because of device erasure operations; (iii) efficient recovery processing; and (iv) combining flash drives and disks for better logging and recovery performance. We implemented our solution within MySQL-InnoDB. Our real machine experiments running online transaction processing workloads (TPCC) show that FlashLogging achieves up to 5.7X improvements over magnetic-disk-based logging, and obtains up to 98.6% of the ideal performance. We further compare our design with one that uses Solid-State Drives (SSDs), and find that although SSDs improve logging performance, multiple USB flash drives can achieve comparable or better performance with much lower price.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Todd Anderson , Yuri Breitbart , Henry F. Korth , Avishai Wool, Replication, consistency, and practicality: are these mutually exclusive?, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.484-495, June 01-04, 1998, Seattle, Washington, United States
|
| |
2
|
|
| |
3
|
L. Bouganim, B. T. Jónsson, and P. Bonnet. uFLIP: Understanding ash io patterns. In CIDR, 2009.
|
 |
4
|
Yuri Breitbart , Raghavan Komondoor , Rajeev Rastogi , S. Seshadri , Avi Silberschatz, Update propagation protocols for replicated databates, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.97-108, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
 |
5
|
Adrian M. Caulfield , Laura M. Grupp , Steven Swanson, Gordon: using flash memory to build fast, power-efficient clusters for data-intensive applications, Proceeding of the 14th international conference on Architectural support for programming languages and operating systems, March 07-11, 2009, Washington, DC, USA
|
| |
6
|
Chris Mellor. Faster usb 3.0 is coming. http://www.pcworld.com/article/137551/.
|
| |
7
|
|
 |
8
|
|
 |
9
|
|
 |
10
|
|
 |
11
|
Jim Gray , Pat Helland , Patrick O'Neil , Dennis Shasha, The dangers of replication and a solution, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.173-182, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
12
|
|
 |
13
|
Stavros Harizopoulos , Daniel J. Abadi , Samuel Madden , Michael Stonebraker, OLTP through the looking glass, and what we found there, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
[doi> 10.1145/1376616.1376713]
|
| |
14
|
|
| |
15
|
Intel X25-M 80GB SATA Solid State Drive, Intel Ups The Ante. http://hothardware.com/Articles/Intel-X25M-80GB-SATA-Solid-State-Drive-Intel-Ups-The-Ante/?page=2.
|
| |
16
|
Intel X25-M and X18-M Mainsteam SATA Solid State Drives. http://download.intel.com/design/ ash/nand/mainstream/mainstream-sata-ssd-product-brief.pdf.
|
| |
17
|
D. Kline and N. Pilaka. An overview of transactional logging in db2 universal database. http://www.ibm.com/developerworks/db2/library/techarticle/0301kline/0301kline.html.
|
| |
18
|
|
 |
19
|
|
 |
20
|
Sang-Won Lee , Bongki Moon , Chanik Park , Jae-Myung Kim , Sang-Woo Kim, A case for flash memory ssd in enterprise database applications, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
[doi> 10.1145/1376616.1376723]
|
 |
21
|
|
| |
22
|
Microsoft Corp. Sql server 7.0, sql server 2000, and sql server 2005 logging and data storage algorithms. http://support.microsoft.com/kb/230785.
|
| |
23
|
|
| |
24
|
|
 |
25
|
|
| |
26
|
Edmund B. Nightingale , Kaushik Veeraraghavan , Peter M. Chen , Jason Flinn, Rethink the sync, Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, p.1-1, November 06-08, 2006, Seattle, WA
|
| |
27
|
Oracle Corp. Oracle database administrator's guide 10g release 1 (chapter 6. managing the redo log). Part Number B10739-01.
|
| |
28
|
OSDL DBT2. http://sourceforge.net/projects/osdldbt.
|
| |
29
|
V. Prabhakaran, T. L. Rodeheffer, and L. Zhou. Transactional ash. In OSDI, 2008.
|
 |
30
|
|
| |
31
|
Samsung Electronics. Onenand specification. http://www.samsung.com.
|
 |
32
|
|
| |
33
|
Michael Stonebraker , Samuel Madden , Daniel J. Abadi , Stavros Harizopoulos , Nabil Hachem , Pat Helland, The end of an architectural era: (it's time for a complete rewrite), Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
| |
34
|
Transaction Processing Performance Council. TPC-C benchmark. http://www.tpc.org/tpcc/.
|
| |
35
|
V. Vasudevan, J. Franklin, D. Andersen, A. Phanishayee, L. Tan, M. Kaminsky, and I. Moraru. Fawndamentally power-efficient clusters. In HotOS, 2009.
|
 |
36
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.2
Physical Design
Additional Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.4
Systems
Subjects:
Transaction processing
H.2.7
Database Administration
Subjects:
Logging and recovery
General Terms:
Algorithms,
Design,
Performance,
Reliability
Keywords:
flash devices,
flashlogging,
near-zero-delay archival disk,
online transaction processing,
outlier detection and hiding,
recovery processing,
synchronous logging,
unconventional array organization
|