|
ABSTRACT
In parallel matrix manipulation operations, some data patterns need to be accessed in one memory cycle without conflict. Investigating the frequently used data patterns, we propose a powerful skewing scheme which allows most frequently used data patterns of N elements, including rows, columns, diagonals, blocks with various shapes, points scattered over various blocks, and chessboards with various shapes, to be accessed in one memory cycle. We also propose simple methods to combine different skewing schemes into a single parallel storage system such that all the frequently used data patterns of N elements (the above patterns plus folded lines, two-pairs, and column-pairs) can be accessed in one memory cycle. The storage sytem uses N memory modules where N is any (even or odd) power of two. Address generation in the system need only exclusive-or operations and can be completed in constant time. The storage scheme allows different sized matrices to be processed efficiently on large scale systems by using the skewing scheme designed according to the size of the system, i.e. address generation mechanism is independent of the size of the matrices to be processed. Data alignment requirements (connecting each memory module to a proper processor) can be easily realized on a general-purpose interconnecting network, such as a hypercube.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Balakrishnan, R. Jain, and C. S. Raghavendra. On array storage for conflict-free memory access for parallel processors, in Proc. of lhe 1988 International Conference on Parallel Processing, volume I, pages 103-107, Aug. 1988.
|
| |
2
|
K. Batcher. The multidimensional access memory in STARAN. IEEE Trans. on Computers, C-26(2):174-177, February 1977.
|
| |
3
|
B. V. Boppana and C. S. Raghavendra. Efficient storage schemes for arbitrary size square matrices in parallel processors with shuffle-exchange networks, in Proc. of the 1991 International Conference on Parallel Processing, volume I, pages 365-368, Aug. 1991.
|
| |
4
|
P. Budnik and D. J. Kuck. The organization and use of parallel memories. IEEE Trans. on Computers, C-20(12):1566-1569, Dec. 1971.
|
| |
5
|
B. Chor, C. E. Leiserson, and R. L. Rivest. All a,pplication of number theory to the organizatiol~ of raster-graphics memory. In Proc. of lhe 23rd Annual IEEE Symposium on Foundations of Computer Science, pages 92-99, 1982.
|
| |
6
|
J. M. Frailong, W. Jalby, and J. Lenfant. XOR- Schemes: A flexible data organization in parallel memories. In Proc. of the 1985 lnternatzonal Conference on Parallel Processing, pages 276- 283, Aug. 1985.
|
| |
7
|
|
| |
8
|
D. Harper III. Address transformation to increase memory performance. In PTvc. of the 1989 h~lernational Conference on Parallel Processing, pages 237- 241, Aug. 1989.
|
 |
9
|
|
| |
10
|
|
| |
11
|
K. Kim and V. K. P. Kumar. Parallel memory systems for image processing. In Proc. of the 1989 Conference on Compuler vision and Patter~ Recognition (CVPR'89), pages 654-659, 1989.
|
 |
12
|
|
| |
13
|
D. H. Lawrie. Access and alignment of data in array processors. IEEE Trans. on Computers, C- 24(12):1145-1155, Dec. 1975.
|
| |
14
|
D. H. Lawrie and C. R. Vora. The prime memory system for array access. IEEE Trans. on Computers, C-31(5):435-442, May 1982.
|
| |
15
|
|
| |
16
|
|
| |
17
|
D. Lee and M. Aboelaze. An efficient method for distributing data in hypercube computers. In Proc. of the 5th D~strzbuted Memory Computing Conference, pages 858-866, April 1990.
|
| |
18
|
|
| |
19
|
A. Norton and E. Melton. A class of Boolean linear transformations for conflict-free power-of-two stride access, in Proc. of the 1987 International Conference on Parallel Processing, pages 247- 254, Aug. 1987.
|
| |
20
|
|
| |
21
|
C. S. Raghavendra and R. Boppana. On methods for fast and efficient parallel memory access. In Proc. of 1he 1990 international Conference on Parallel Processing, volume I, pages 76- 83, Aug. 1990.
|
| |
22
|
D. V. Voorhis and T. H. Morrin. Memory systems for image processing. IEEE Trans. on Computers, C-27(2):113-125. February 1978.
|
| |
23
|
|
|