ACM Home Page
Please provide us with feedback. Feedback
SIMD instruction cache
Full text PdfPdf (1.06 MB)
Source ACM Symposium on Parallel Algorithms and Architectures archive
Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures table of contents
Cape May, New Jersey, United States
Pages: 67 - 75  
Year of Publication: 1994
ISBN:0-89791-671-9
Author
Todd E. Rockoff  Discipline of Computer Science, Flinders University, GPO Box 2100, Adelaide 5001 SA, AUSTRALIA
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGARCH: ACM Special Interest Group on Computer Architecture
European Comp Soc : European Computer Society
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 21,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/181014.181034
What is a DOI?

ABSTRACT

SIMD instruction cache (or I-cache) is proposed to remedy a heretofore un-compensated instruction delivery rate limitation of SIMD computers. This paper introduces the concept of SIMD I-cache and sketches the I-cache design space. On the basis of throughput using chip area as a hardware cost constraint, detailed evaluations of simple I-cache variants for a diverse set of sample problems are presented. Simple I-cache variants occupy negligible area in chips while providing significant speedups, even for problems ordinarily thought to be inherently communication-bound. These results suggest that I-cached SIMD computers exhibit the highest throughput of any multiprocessors for scalable data-parallel problems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Timothy Bridges. The GPA machine: A generally partitionable MSIMD architecture. In The Third Sympoisum on the Frontiers of Massively Parallel Computation. IEEE, October 1990.
 
3
 
4
Dan Dobberpuhl, Richard Witek, et al. A 200MHz 64b dual-issue CMOS microprocessor. IEEE Journal of Solid-State Circuits, 27(11):1555-1567, November 1992.
 
5
David C. Douglas, Brewster A. Kahle, and Alex Vasilevsky. The architecture of the CM-2 data processor. Technical Report HA88-1, Thinking Machines Corporation, February 1988.
 
6
Allan L. Fisher, Peter T. Highnam, and Todd E. Rockoff. A four-processor building block for SIMD processor arrays. 1EEE Journal o.f Solid State Circuits, 25(2):369- 375, April 1990.
 
7
Michael J. Flynn. Some computer organizations and their effectiveness. IEEE Transactions on Computers, C-21(9):948-960, September 1972.
 
8
 
9
R. Heaton, D. Blevins, and E. Davis. A bit-serial VLSI array processing chip for image processing. IEEE Journal o# Solid State Circuits, 25(2):364-368, April 1990.
10
 
11
12
13
 
14
John R. Nickons. The design of the MaaPar MP-I: A cost effective massively parallel computer, in Proceedings of IEEE Compcon Spring 1990. IEEE, February 1990.
 
15
 
16
 
17
 
18
David Shu, Lap-Wai Chow, and J. G. Nash. A content addressable, bit-serial associative processor, in Robert W. Brodersen and Howard S. Moscovitz, editors, VLS1 Signal Processing, 11I, chapter 12, pages 120-128. IEEE Press, 1988.
 
19
T. E. Williams, M. Horowitz, R. L. Alverson, and T. S. Yang. A self-timed chip for division, in Advanced Research into VLSI: The 1987 Stan}ord Conference, pages 75-95, 1987.