| Vector multiprocessors with arbitrated memory access |
| Full text |
Pdf
(906 KB)
|
| Source
|
International Symposium on Computer Architecture
archive
Proceedings of the 22nd annual international symposium on Computer architecture
table of contents
S. Margherita Ligure, Italy
Pages: 243 - 252
Year of Publication: 1995
ISBN:0-89791-698-0
Also published in ...
|
|
Authors
|
|
Montse Peiron
|
Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, c/ Gran Capità s/n, Mòdul D6, 08071 - Barcelona, Spain
|
|
Mateo Valero
|
Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, c/ Gran Capità s/n, Mòdul D6, 08071 - Barcelona, Spain
|
|
Eduard Ayguadé
|
Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, c/ Gran Capità s/n, Mòdul D6, 08071 - Barcelona, Spain
|
|
Tomás Lang
|
Department of Electrical and Computer Engineering, University of California at Irvine
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 21, Citation Count: 4
|
|
|
ABSTRACT
The high latency of memory accesses is one of the factors that most contribute to reduce the performance of current vector supercomputers. The conflicts that can occur in the memory modules plus the collisions in the interconnection network in the case of multiprocessors make that the execution time of applications increases significantly. In this work we propose a memory access method that for both cases of vector uniprocessors and multiprocessors allows to perform stream accesses with the smallest possible latency in the majority of the cases. The basic idea is to arbitrate the memory access by defining the order in which the memory modules are visited. The stream elements are requested out of order. In addition, the access method also reduces the cost of the interconnection network.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
R M. Kogge, "The Architecture of Pipelined Computers", McGraw-Hill, New York, 1981.
|
| |
2
|
E Budnik and D. J. Kuck, "The Organization and Use of Parallel Memories", IEEE Trans. on Computers, vol. 20, no. 12, pp. 1566-1569, 1971.
|
| |
3
|
J. Frailong, W. Jalby and J. Lenfant, "XOR-schemes: A Flexible Data Organization in Parallel Memories", Int. Conf. on Parallel Processing, pp. 276-283, 1985.
|
| |
4
|
D.T. Harper III, "Address Transformations to Increase Memory Performance", Int. Conf. on Parallel Processing, pp. 237-241, 1989.
|
 |
5
|
|
 |
6
|
Mateo Valero , Tomás Lang , José M. Llabería , Montse Peiron , Eduard Ayguadé , Juan J. Navarra, Increasing the number of strides for conflict-free vector access, Proceedings of the 19th annual international symposium on Computer architecture, p.372-381, May 19-21, 1992, Queensland, Australia
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
D.A. Calahan and D.H. Bailey, "Measurement and Analysis of Memory Conflicts on Vector Multiprocessors", Performance Evaluation of Supercomputers, Elsevier Science Publishers, pp. 83- 106, 1988.
|
| |
12
|
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
| |
17
|
D.A. Calahan, "Characterization of Memory Conflict Loading on the Cray-2", Int. Conf. on Parallel Processing, pp. 299-302, 1988.
|
 |
18
|
|
 |
19
|
|
| |
20
|
|
 |
21
|
|
CITED BY 4
|
|
|
|
|
Binu K. Mathew , Sally A. McKee , John B. Carter , Al Davis, Algorithmic foundations for a parallel vector access memory system, Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures, p.156-165, July 09-13, 2000, Bar Harbor, Maine, United States
|
|
|
|
|
|
|
|