ACM Home Page
Please provide us with feedback. Feedback
Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor
Full text PdfPdf (1.48 MB)
Source International Symposium on Computer Architecture archive
Proceedings of the 23rd annual international symposium on Computer architecture table of contents
Philadelphia, Pennsylvania, United States
Pages: 191 - 202  
Year of Publication: 1996
ISBN:0-89791-786-3
Also published in ...
Authors
Dean M. Tullsen  Dept of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA
Susan J. Eggers  Dept of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA
Joel S. Emer  Digital Equipment Corporation, HLO2-3/J3, 77 Reed Road, Hudson, MA
Henry M. Levy  Dept of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA
Jack L. Lo  Dept of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA
Rebecca L. Stamm  Digital Equipment Corporation, HLO2-3/J3, 77 Reed Road, Hudson, MA
Sponsors
IEEE-CS\TCCA : TC on Computer Arhitecture
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 124,   Citation Count: 150
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/232973.232993
What is a DOI?

ABSTRACT

Simultaneous multithreading is a technique that permits multiple independent threads to issue multiple instructions each cycle. In previous work we demonstrated the performance potential of simultaneous multithreading, based on a somewhat idealized model. In this paper we show that the throughput gains from simultaneous multithreading can be achieved without extensive changes to a conventional wide-issue superscalar, either in hardware structures or sizes. We present an architecture for simultaneous multithreading that achieves three goals: (1) it minimizes the architectural impact on the conventional superscalar design, (2) it has minimal performance impact on a single thread executing alone, and (3) it achieves significant throughput gains when running multiple threads. Our simultaneous multithreading architecture achieves a throughput of 5.4 instructions per cycle, a 2.5-fold improvement over an unmodified superscalar with similar hardware resources. This speedup is enhanced by an advantage of multithreading previously unexploited in other architectures: the ability to favor for fetch and issue those threads most efficiently using the processor each cycle, thereby providing the "best" instructions to the processor.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
4
5
 
6
G.E. Daddis, Jr. and H.C. Tomg. The concurrent execution of multiple instruction streams on superscalar processors. In International Conference on Parallel Processing, pages I:76- 83, August 1991.
 
7
 
8
J. Edmondson and R Rubinfield. An overview of the 21164 AXP microprocessor. In Hot Chips VI, pages 1-8, August 1994.
 
9
 
10
 
11
 
12
B.K. Gunther. Superscalarperformance in a multithreaded microprocessor. PhD thesis, University of Tasmania, December 1993.
13
14
15
 
16
 
17
 
18
S. McFarling. Combining branch predictors. TechnicalReport TN-36, DEC-WRL, June 1993.
 
19
R.G. Prasadh and C.-L. Wu. A benchmark evaluation of a multi-threaded RISC processor architecture. In International Conference on Parallel Processing, pages I:84-91, August 1991.
 
20
Microprocessor Report, October 24 1994.
 
21
Microprocessor Report, November 14 1994.
 
22
E.G. Sirer. Measuring limits of fine-grained parallelism. Senior Independent Work, Princeton University, June 1993.
 
23
B.J. Smith. Architecture and applications ofthe HEP multiprocessor computer system. In SPIE Real Time Signal Processing /V, pages 241-248, 1981.
24
25
26
27
 
28
 
29
W. Yamamoto, M.J. Serrano, A.R. Talcott, R.C. Wood, and M. Nemirosky. Performance estimation of multistreamed, superscalar processors. In Twenty-Seventh Hawaii International Conference on System Sciences, pages I:195-204, January 1994.
30

CITED BY  150

Collaborative Colleagues:
Dean M. Tullsen: colleagues
Susan J. Eggers: colleagues
Joel S. Emer: colleagues
Henry M. Levy: colleagues
Jack L. Lo: colleagues
Rebecca L. Stamm: colleagues