ACM Home Page
Please provide us with feedback. Feedback
Efficient remote block-level I/O over an RDMA-capable NIC
Full text PdfPdf (320 KB)
Source International Conference on Supercomputing archive
Proceedings of the 20th annual international conference on Supercomputing table of contents
Cairns, Queensland, Australia
SESSION: I/O--communication table of contents
Pages: 97 - 106  
Year of Publication: 2006
ISBN:1-59593-282-8
Authors
Manolis Marazakis  Institute of Computer Science (ICS), Heraklion, Greece GR
Konstantinos Xinidis  Institute of Computer Science (ICS), Heraklion, Greece GR
Vassilis Papaefstathiou  Institute of Computer Science (ICS), Heraklion, Greece GR
Angelos Bilas  Institute of Computer Science (ICS), Heraklion, Greece GR
Sponsors
SIGARCH: ACM Special Interest Group on Computer Architecture
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 49,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1183401.1183417
What is a DOI?

ABSTRACT

Modern storage systems are required to scale to large storage capacities and I/O throughput in a cost effective manner. For this reason, they are increasingly being built out of commodity components, mainly PCs equipped with large numbers of disks and interconnected of high-performance system area networks. A main issue in these efforts is to achieve high I/O throughput over commodity, low-cost system area networks and commodity operating systems.In this work, we examine in detail the performance of remote block-level storage I/O over commodity, RDMA-capable network interfaces and networks. We examine the support that is required from the network interface for achieving high throughput. We also examine in detail the overheads associated in kernel-level protocols for networked storage access. We find that base system performance is limited by (a) interrupt cost, (b) request size, and (c) protocol message size. We examine the impact of techniques to alleviate these factors and find that our techniques combined can improve throughput by up to 100% over a simpler unoptimized configuration. Our current prototype is able to achieve a throughput of about 200 MBytes/s over a network that is capable of delivering about 500 MBytes/s. We identify major limiting factors, mostly at the I/O target-side.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
An infiniband technology overview. Infiniband Trade Association, http://www.infinibandta.org/ibta.
 
2
Reiserfs. Namesys Inc, http://www.namesys.com.
 
3
Rocket i/o user guide. Xilinx Inc, http://www.xilinx.com/bvdocs/userguides/ug024.pdf.
 
4
5
6
 
7
8
 
9
 
10
 
11
J. Chase, A. Gallatin, and K. Yocum. End-system optimizations for high-speed tcp. IEEE Communications, 39(4):68--74, 2001. Special issue on TCP Performance in Future Networking Environments.
12
 
13
C. Dubnicki, A. Bilas, Y. Chen, S. Damianakis, and K. Li. VMMC-2: efficient support for reliable, connection-oriented communication. In Proc. of The 1997 IEEE Symposium on High Performance Interconnects (HOT Interconnects V). Stanford CA, USA., Aug. 1997
 
14
15
 
16
Giganet, Giganet cLAN family of products. http://www.emulex.com/products.html, 2001.
 
17
 
18
I/O Performance Inc. The xdd i/o benchmark. http://www.ioperformance.com.
 
19
G. Kalokairinos, V. Papaefstathiou, A. Ioannou, D. Simos, M. Papamichail, G. Mihelogiannakis, M. Marazakis, D. Pnevmatikatos, and M. Katevenis. Design and implementation of a multi-gigabit nic anda scalable buffered crossbar switch. Technical Report TR376-04-2006, FORTH-ICS, 2006.
 
20
J. Katcher. Postmark: A new file system benchmark. Technical Report TR3022, Network Applicance Inc., 1997.
 
21
22
 
23
J. Liu, B. Chandrasekaran, W. Yu, J. Wu, D. Buntinas, S. Kini, and D. Panda. Microbenchmark performance comparison of high-speed cluster interconnects. IEEE Micro, 24(1):42--51, 2004.
 
24
25
 
26
D. Mayhew and V. Krishnan. Pci express and advanced switching: Evolutionary path to building next-generation interconnects. In Proceedings of the 11th IEEE Symposium on High Performance Interconnects, 2003.
 
27
 
28
V. Olaru and W. Tichy. On the design and performance of remote disk drivers for clusters of pcs. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications. CSREA Press, 2004.
 
29
 
30
PCI-SIG. Pci express. http://www.pcisig.com.
 
31
 
32
 
33
J. Pinkerton. The case for rdma, 2002. RDMA Consortium, http://www.rdmaconsortium.org/home/The_Case_for_RDMA-02053.pdf.
 
34
L. Prylli and B. Tourancheau. BIP: a new protocol designed for high performance. In PC-NOW Workshop, held in parallel with IPPS/SPDP98, Orlando, USA, March 30 - April 3 1998.
 
35
 
36
G. Regnier, D. Minturn, G. McAlpine, V. Saletore, and A. Foong. Eta: Experience with an intel xeon processor as a packet processing engine. IEEE Micro, 24(1):24--31, 2004.
 
37
Y. Saito, S. Frolund, A. Veitch, A. Merchant, and S. Spence. FAB: Enterprise storage systems on a shoestring. In Proc. of the ASPLOS 2004, Oct. 2004.
 
38
A. S. I. SIG. Asi technical overview. http://www.asi-sig.org.
 
39
H. Tezuka, A. Hori, and Y. Ishikawa. PM: a high-performance communication library for multi-user parallel environments. Technical Report TR-96015, Real World Computing Partnership, 1996.
 
40
41
42


Collaborative Colleagues:
Manolis Marazakis: colleagues
Konstantinos Xinidis: colleagues
Vassilis Papaefstathiou: colleagues
Angelos Bilas: colleagues