| Synchronized access to streams in SIMD vector multiprocessors |
| Full text |
Pdf
(1.07 MB)
|
| Source
|
International Conference on Supercomputing
archive
Proceedings of the 8th international conference on Supercomputing
table of contents
Manchester, England
Pages: 23 - 32
Year of Publication: 1994
ISBN:0-89791-665-4
|
|
Authors
|
|
Montse Peiron
|
Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Campus Nerd - Mòdul D6, cr. Gran Capità s/núm, 08071- Barcelona, SPAIN
|
|
Mateo Valero
|
Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Campus Nerd - Mòdul D6, cr. Gran Capità s/núm, 08071- Barcelona, SPAIN
|
|
Eduard Ayguadé
|
Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Campus Nerd - Mòdul D6, cr. Gran Capità s/núm, 08071- Barcelona, SPAIN
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 12, Citation Count: 2
|
|
|
ABSTRACT
The synchronized and simultaneous access to several vectors that form a single stream is typical in SIMD vector multiprocessors as well as in MIMD superscalar multiprocessors with decoupled access. In this paper we propose a block-interleaved storage scheme and an out-of-order access mechanism that allows conflict-free access to streams with an arbitrary initial address and constant stride between elements. The memory system can have any degree of unmatchness and we consider the use of either a crossbar or a multistage interconnection network. A maximal number of conflict-free families including the most commonly used strides can be obtained. We describe the hardware for address calculation and control and show that their additional costs are minimal compared with the cost of the hardware for in-order access. Finally, we evaluate the applicability of this technique to real loops from some programs of the Perfect Club and SPEC suites.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
P. Budnik and D.J. Kuck, "The Organization and Use of Parallel Memories", IEEE Trans. on Computers, vol. 20, no. 12 pp. 1566-1569, 1971.
|
| |
2
|
J. Frailong, W. Jalby and J. Lenfant "XOR-schemes: A Flexible Data Organization Parallel Memories", Int. Conf. on Parallel Processing, pp. 276-283, 1985.
|
| |
3
|
|
 |
4
|
|
 |
5
|
Mateo Valero , Tomás Lang , José M. Llabería , Montse Peiron , Eduard Ayguadé , Juan J. Navarra, Increasing the number of strides for conflict-free vector access, Proceedings of the 19th annual international symposium on Computer architecture, p.372-381, May 19-21, 1992, Queensland, Australia
|
| |
6
|
|
 |
7
|
Mateo Valero , Tomás Lang , Eduard Ayguadé, Conflict-free access of vectors with power-of-two strides, Proceedings of the 6th international conference on Supercomputing, p.149-156, July 19-24, 1992, Washington, D. C., United States
[doi> 10.1145/143369.143403]
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
D.A. Calahan and D.H. Barley, "Measurement and Analysis of Memory Conflicts on Vector Multiprocessors", Performance Evaluation of Supercomputers, Elsevier Science Publishers, pp. 83-106, 1988
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
D.A. Calahan, "Characterization of Memory Conflict Loading on the Cray-2" Int. Conf. on Parallel Processing, pp. 299- 302,1988.
|
 |
17
|
|
 |
18
|
|
 |
19
|
|
| |
20
|
H. Tamura Y. Shinnkai and F. Isobe "The Supercomputer FACOM VP System", Fujisu Techical Journal, 1985.
|
| |
21
|
|
| |
22
|
M. Peiron, M. Valero, E. Ayguade and T. Lang, "Conflict- Free Acess Streams in Multiprocessor Systems" Research Report DAC RR-93/04, 1993.
|
| |
23
|
D.H. Lawrie "Access and Afignment of Data in an Array Processor", IEEE Trans. on Computers, vol. 24, no. 12, pp. 1145-1155, 1975.
|
| |
24
|
R. Espasa et al, "Quantitative Analysis of Vector Code", Research Report CEPBA/UPC, 1994.
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE Design Automation Conference on
Gwo-Dong Chen
, Daniel D. Gajski
|