| SIMD instruction cache |
| Full text |
Pdf
(1.06 MB)
|
| Source
|
ACM Symposium on Parallel Algorithms and Architectures
archive
Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures
table of contents
Cape May, New Jersey, United States
Pages: 67 - 75
Year of Publication: 1994
ISBN:0-89791-671-9
|
|
Author
|
|
Todd E. Rockoff
|
Discipline of Computer Science, Flinders University, GPO Box 2100, Adelaide 5001 SA, AUSTRALIA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 21, Citation Count: 0
|
|
|
ABSTRACT
SIMD instruction cache (or I-cache) is proposed to remedy a heretofore un-compensated instruction delivery rate limitation of SIMD computers. This paper introduces the concept of SIMD I-cache and sketches the I-cache design space. On the basis of throughput using chip area as a hardware cost constraint, detailed evaluations of simple I-cache variants for a diverse set of sample problems are presented. Simple I-cache variants occupy negligible area in chips while providing significant speedups, even for problems ordinarily thought to be inherently communication-bound. These results suggest that I-cached SIMD computers exhibit the highest throughput of any multiprocessors for scalable data-parallel problems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Timothy Bridges. The GPA machine: A generally partitionable MSIMD architecture. In The Third Sympoisum on the Frontiers of Massively Parallel Computation. IEEE, October 1990.
|
| |
3
|
|
| |
4
|
Dan Dobberpuhl, Richard Witek, et al. A 200MHz 64b dual-issue CMOS microprocessor. IEEE Journal of Solid-State Circuits, 27(11):1555-1567, November 1992.
|
| |
5
|
David C. Douglas, Brewster A. Kahle, and Alex Vasilevsky. The architecture of the CM-2 data processor. Technical Report HA88-1, Thinking Machines Corporation, February 1988.
|
| |
6
|
Allan L. Fisher, Peter T. Highnam, and Todd E. Rockoff. A four-processor building block for SIMD processor arrays. 1EEE Journal o.f Solid State Circuits, 25(2):369- 375, April 1990.
|
| |
7
|
Michael J. Flynn. Some computer organizations and their effectiveness. IEEE Transactions on Computers, C-21(9):948-960, September 1972.
|
| |
8
|
|
| |
9
|
R. Heaton, D. Blevins, and E. Davis. A bit-serial VLSI array processing chip for image processing. IEEE Journal o# Solid State Circuits, 25(2):364-368, April 1990.
|
 |
10
|
|
| |
11
|
|
 |
12
|
T. Kondo , T. Tsuchiya , T. Kitamura , Y. Sugiyama , T. Kimura, Pseudo MIMD array processor—AAP2, Proceedings of the 13th annual international symposium on Computer architecture, p.330-337, June 02-05, 1986, Tokyo, Japan
|
 |
13
|
|
| |
14
|
John R. Nickons. The design of the MaaPar MP-I: A cost effective massively parallel computer, in Proceedings of IEEE Compcon Spring 1990. IEEE, February 1990.
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
David Shu, Lap-Wai Chow, and J. G. Nash. A content addressable, bit-serial associative processor, in Robert W. Brodersen and Howard S. Moscovitz, editors, VLS1 Signal Processing, 11I, chapter 12, pages 120-128. IEEE Press, 1988.
|
| |
19
|
T. E. Williams, M. Horowitz, R. L. Alverson, and T. S. Yang. A self-timed chip for division, in Advanced Research into VLSI: The 1987 Stan}ord Conference, pages 75-95, 1987.
|
|