ACM Home Page
Please provide us with feedback. Feedback
Optimization and bottleneck analysis of network block I/O in commodity storage systems
Full text PdfPdf (443 KB)
Source
International Conference on Supercomputing archive
Proceedings of the 21st annual international conference on Supercomputing table of contents
Seattle, Washington
SESSION: Runtime systems table of contents
Pages: 33 - 42  
Year of Publication: 2007
ISBN:978-1-59593-768-1
Authors
Manolis Marazakis  Institute of Computer Science (ICS), Heraklion, Greece
Vassilis Papaefstathiou  Institute of Computer Science (ICS), Heraklion, Greece
Angelos Bilas  Institute of Computer Science (ICS), Heraklion, Greece
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 76,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1274971.1274979
What is a DOI?

ABSTRACT

Building commodity networked storage systems is an important architectural trend; Commodity servers hosting a moderate number of consumer-grade disks and interconnected with a high-performance network are an attractive option for improving storage system scalability and cost-efficiency. However, such systems incur significant overheads and are not able to deliver to applications the available throughput. We examine in detail the sources of overheads in such systems, using a working prototype to quantify the overheads associated with various parts of the I/O protocol. We optimize our base protocol to deal with small requests by batching them at the network level and without any I/O-specific knowledge. We also redesign our protocol stack to allow for asynchronous event processing, in-line, during send-path request processing. These techniques improve performance for a 8-disk SATA RAID0 array from 200 to 290 MBytes/s (45% improvement). Using a ramdisk, peak performance improves from 320 to 474 MBytes/s (48% improvement), which is 72% of the maximum possible throughput in our experimental setup. We also analyze the remaining system bottlenecks, and find that although commodity storage systems have potential for building high-performance I/O subsystems, traditional network and I/O protocols are not fully capable of delivering this potential.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
An Infiniband Technology Overview. Infiniband Trade Association, http://www.infinibandta.org/ibta.
 
2
Chelsio Communications. http://www.chelsio.com.
 
3
Mellanox Technologies. http://www.mellanox.com.
 
4
Myri-10G Overview. http://www.myricom.com/Myri-10G/overview/.
 
5
Rocket I/O User Guide. Xilinx Inc, http://www.xilinx.com/bvdocs-/userguides/ug024.pdf.
 
6
 
7
 
8
L. Chung, J. Gray, B. Worthington, and R. Horst. Windows 2000 Disk I/O Performance. Technical Report MS-TR-2000-55, Microsoft Research, 2000.
 
9
D. Dalessandro, P. Wyckoff, and G. Montry. Initial Performance Evaluation of the NetEffect 10 Gigabit iWARP Adapter. In Proceedings of the RAIT Workshop (in conjunction with the IEEE International Conference on Cluster Computing), 2006.
10
 
11
 
12
B. Hausauer. iWARP: Reducing Ethernet Overhead in Data Center Designs. CommsDesign, Feb. 2004. http://www.commsdesign.com-/article/printableArticle.jhtml?articleID=51202855.
 
13
I/O Performance Inc. The xdd I/O Benchmark. http://www.ioperformance.com.
 
14
 
15
J. Liu, B. Chandrasekaran, W. Yu, J. Wu, D. Buntinas, S. Kini, and D. Panda. Microbenchmark Performance Comparison of High-Speed Cluster Interconnects. IEEE Micro, 24(1):42--51, 2004.
 
16
 
17
J. Liu, D. Panda, and M. Banikazem. Evaluating the Impact of RDMA on Storage I/O over InfiniBand. In Proceedings of the SAN Workshop (in conjunction with HPCA Conference), 2004.
 
18
M. Marazakis, V. Papaefstathiou, G. Kalokairinos, and A. Bilas. Experiences from Debugging a PCI-X-based RDMA-capable NIC. In Proceedings of the RAIT Workshop (in conjunction with IEEE International Conference on Cluster Computing, 2006.
19
 
20
D. Mayhew and V. Krishnan. PCI Express and Advanced Switching: Evolutionary Path to Building Next-Generation Interconnects. In Proceedings of the 11th IEEE Symposium on High Performance Interconnects, 2003.
 
21
Mindshare, Inc. and T. Shanley. PCI-X System Architecture. Addison-Wesley Professional, 2001.
22
 
23
 
24
 
25
J. Pinkerton. The case for rdma, 2002. RDMA Consortium, http://www.rdmaconsortium.org/home/The_Case_for_RDMA-02053.pdf.
 
26
 
27
 
28
J. Smith, and C. Traw. Operating System Support for End-to-end GBps Networking. IEEE Network, 7(2), 1993.
29
30

Collaborative Colleagues:
Manolis Marazakis: colleagues
Vassilis Papaefstathiou: colleagues
Angelos Bilas: colleagues