|
ABSTRACT
In this paper we report on the performance of the remote procedure call (RPC) implementation for the Firefly multiprocessor and analyze the implementation to account precisely for all measured latency. From the analysis and measurements, we estimate how much faster RPC could be if certain improvements were made. The elapsed time for an intermachine call to a remote procedure that accepts no arguments and produces no results is 2.66 ms. The elapsed time for an RPC that has a single 1440-byte result (the maximum result that will fit in a single packet) is 6.35 ms. Maximum intermachine throughput of application program data using RPC is 4.65 Mbits/s, achieved with four threads making parallel RPCs that return the maximum-size result that fits in a single RPC result packet. CPU utilization at maximum throughput is about 1.2 CPU seconds per second on the calling machine and a little less on the server. These measurements are for RPCs from user space on one machine to user space on another, using the installed system and a 10 Mbit/s Ethernet. The RPC packet exchange protocol is built on IP/UDP, and the times include calculating and verifying UDP checksums. The Fireflies used in the tests had 5 MicroVAX II processors and a DEQNA Ethernet controller.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
 |
3
|
|
| |
4
|
DIGITAL EQUIPMENT CORP. Microsystems Handbook. Digital Equipment Corp., Palo Alto, Calif., 1985 Appendix A.
|
| |
5
|
DIGITAL EQUIPMENT CORP. DEQNA ETHERNETmUser's Guide. Digital Equipment Corp., Palo Alto, Calif., Sept. 1986.
|
| |
6
|
|
| |
7
|
ROVNER, P.R. Extending Modula-2 to build large, integrated systems. IEEE Softw. 37, 8 (Nov. 1986), 46-57.
|
| |
8
|
|
 |
9
|
|
CITED BY 40
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Edward W. Felten , Richard D. Alpert , Angelos Bilas , Matthias A. Blumrich , Douglas W. Clark , Stefanos N. Damianakis , Cezary Dubnicki , Liviu Iftode , Kai Li, Early experience with message-passing on the SHRIMP multicomputer, ACM SIGARCH Computer Architecture News, v.24 n.2, p.296-307, May 1996
|
|
|
C. Consel , L. Hornof , R. Marlet , G. Muller , S. Thibault , E.-N. Volanschi , J. Lawall , J. Noyé, Tempo: specializing systems applications and beyond, ACM Computing Surveys (CSUR), v.30 n.3es, Sept. 1998
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
B. N. Bershad , S. Savage , P. Pardyak , E. G. Sirer , M. E. Fiuczynski , D. Becker , C. Chambers , S. Eggers, Extensibility safety and performance in the SPIN operating system, ACM SIGOPS Operating Systems Review, v.29 n.5, p.267-283, Dec. 3, 1995
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bryan Ford , Mike Hibler , Jay Lepreau , Roland McGrath , Patrick Tullmann, Interface and execution models in the Fluke kernel, Proceedings of the third symposium on Operating systems design and implementation, p.101-115, February 1999, New Orleans, Louisiana, United States
|
|
|
|
|
|
Michael Hicks , Suresh Jagannathan , Richard Kelsey , Jonathan T. Moore , Cristian Ungureanu, Transparent communication for distributed objects in Java, Proceedings of the ACM 1999 conference on Java Grande, p.160-170, June 12-14, 1999, San Francisco, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Masanobu Yuhara , Brian N. Bershad , Chris Maeda , J. Eliot B. Moss, Efficient packet demultiplexing for multiple endpoints and large messages, Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference, p.13-13, January 17-21, 1994, San Francisco, California
|
|
|
|
|
|
|
REVIEW
"Andrew S. Tanenbaum : Reviewer"
Most computer science research papers in the systems area focus on
the design and sometimes the implementation of the system in question.
Rarely is the performance of the resulting system measured (assuming the
system is actually built), and m
more...
|