| Design and implementation of FMPL, a fast message-passing library for remote memory operations |
| Full text |
Pdf
(190 KB)
|
| Source
|
Conference on High Performance Networking and Computing
archive
Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM)
table of contents
Denver, Colorado
Pages: 15 - 15
Year of Publication: 2001
ISBN:1-58113-293-X
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 8, Citation Count: 0
|
|
|
ABSTRACT
A fast message-passing library FMPL has been designed and developed to maximize communication performance by utilizing general architectural communication support such as remote memory operations, as well as to maximize total performance by eliminating dynamic communication overhead and overlapping communication and computation. FMPL provides a low-cost general-purpose point-to-point communication and collective communication such as broadcast, barrier synchronization and reduction. On a Hitachi SR8000, FMPL achieves an 8-byte latency of 12.8μsec., while MPI achieves 20μsec. FMPL is designed for building more highly functional message-passing libraries like BLACS as well as applications that need maximum performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Jack J. Dongarra , L. S. Blackford , J. Choi , A. Cleary , E. D'Azeuedo , J. Demmel , I. Dhillon , S. Hammarling , G. Henry , A. Petitet , K. Stanley , D. Walker , R. C. Whaley, ScaLAPACK user's guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, 1997
|
 |
3
|
Taisuke Boku , Ken'ichi Itakura , Hiroshi Nakamura , Kisaburo Nakazawa, CP-PACS: a massively parallel processor for large scale scientific calculations, Proceedings of the 11th international conference on Supercomputing, p.108-115, July 07-11, 1997, Vienna, Austria
[doi> 10.1145/263580.263608]
|
| |
4
|
|
| |
5
|
|
 |
6
|
Yuetsu Kodama , Hirohumi Sakane , Mitsuhisa Sato , Hayato Yamana , Shuichi Sakai , Yoshinori Yamaguchi, The EM-X parallel computer: architecture and basic performance, Proceedings of the 22nd annual international symposium on Computer architecture, p.14-23, June 22-24, 1995, S. Margherita Ligure, Italy
|
| |
7
|
|
 |
8
|
|
| |
9
|
|
 |
10
|
Hiroshi Nakamura , Taisuke Boku , Hideo Wada , Hiromitsu Imori , Ikuo Nakata , Yasuhiro Inagami , Kisaburo Nakazawa , Yoshiyuki Yamashita, A scalar architecture for pseudo vector processing based on slide-windowed registers, Proceedings of the 7th international conference on Supercomputing, p.298-307, July 19-23, 1993, Tokyo, Japan
[doi> 10.1145/165939.165998]
|
| |
11
|
netlib. http://www.netlib.org/, http://phase.hpcc.gr.jp/mirrors/netlib/.
|
 |
12
|
|
 |
13
|
Scott Pakin , Mario Lauria , Andrew Chien, High performance messaging on workstations: Illinois fast messages (FM) for Myrinet, Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), p.55-es, December 04-08, 1995, San Diego, California, United States
[doi> 10.1145/224170.224360]
|
| |
14
|
D. Sitsky and K. Hayashi. Implementing MPI for the Fujitsu AP1000/AP1000+ using polling, interrupts and remote copying. In Proceedings of Joint Symposium on Parallel Processing 1996 (JSPP'96), pages 177-184, June 1996.
|
| |
15
|
|
 |
16
|
Shinji Sumimoto , Hiroshi Tezuka , Atsushi Hori , Hiroshi Harada , Toshiyuki Takahashi , Yutaka Ishikawa, The design and evaluation of high performance communication using a Gigabit Ethernet, Proceedings of the 13th international conference on Supercomputing, p.260-267, June 20-25, 1999, Rhodes, Greece
[doi> 10.1145/305138.305202]
|
| |
17
|
Toshiyuki Takahashi , Shinji Sumimoto , Atsushi Hori , Hiroshi Harada , Yutaka Ishikawa, PM2: a high performance communication middleware for heterogeneous network environments, Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), p.16-es, November 04-10, 2000, Dallas, Texas, United States
|
 |
18
|
|
| |
19
|
|
 |
20
|
Thorsten von Eicken , David E. Culler , Seth Copen Goldstein , Klaus Erik Schauser, Active messages: a mechanism for integrated communication and computation, Proceedings of the 19th annual international symposium on Computer architecture, p.256-266, May 19-21, 1992, Queensland, Australia
|
| |
21
|
F. C. Wong and D. E. Culler. Message Passing Interface Implementation on Active Messages.http://now.CS.Berkeley.EDU/Fastcomm/MPI/.
|
|