|
ABSTRACT
This paper investigates the limitations on designing a processor which can sustain an execution rate of greater than one instruction per cycle on highly-optimized, non-scientific applications. We have used trace-driven simulations to determine that these applications contain enough instruction independence to sustain an instruction rate of about two instructions per cycle. In a straightforward implementation, cost considerations argue strongly against decoding more than two instructions in one cycle. Given this constraint, the efficiency in instruction fetching rather than the complexity of the execution hardware limits the concurrency attainable at the instruction level.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
Acos 86
|
|
| |
Aho 86
|
Alfred V. Aho , Ravi Sethi , Jeffrey D. Ullman, Compilers: principles, techniques, and tools, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1986
|
| |
Apol 88
|
Apollo Computer Inc. Marketing Brochure. The series 10000 Personal Supercomputer. Chelmsford. MA, 1988.
|
| |
Fost 72
|
C.C. Foster and E.M. Riseman, "Percolation of Code to Enhance Parallel Dispatching and Execution IEEE Transactions on. Computers, Vol. C-21 (December 1972), pp. 1411-1415.
|
| |
Henn 86
|
J.L. Hennessy, "RISC-Based Processors: Concepts and Prospects". New Froniiers in Corn p,ler Architecture Conference Proceedings (hlarch 1986), pp. 95- 103.
|
 |
Kell 75
|
|
| |
Kuck 72
|
D.J. Kuck. Y. Muraol,'a, and S. C. hen, "On the Number of Opera. Lions Simultaneously Executable in Fortran-like Programs and Their resulting Speedup". IEEE Transaciions on computers, Vol. C-21 (December 1972), pp. 1293--1310.
|
| |
Lee 84
|
J.K.F. Lee and A.J. Smith, "Branch Prediction Strategies and Branch Target Buffer Design" IEEE Computer (January 1984). pp. 6-22.
|
| |
Logr 72
|
L. Logrippo. "Renamillg in Program Schemas". Proceedings of the IEEE 13th Annual Symposium on switching and Automata Theory. (October 1972), Pp. 67- 70.
|
 |
McFa 86
|
|
| |
MIPS 86
|
MIPS Computer Systems, Inc., MIPS Language Programmers Guide (1986).
|
| |
Nico 84
|
A. Nicolau and J.A. Fisher, "Mea.suring tile Parallelism Availa.ble for Very Long Instruction Word Architectures". IEEE Transactions on Computers. vol. C-33 (November 1984), pp. 968-976.
|
| |
Rise 72
|
E.hl. Riselnan and C.C. Foster, "The Inhibition of Potential Parallelism by Conditional Pumps". IEEE Transactions on Computers, Vol. C-21 (December 1972), pp. 1405-1411.
|
| |
Slav 88
|
G.A. Slaxenburg, Phillips Research Laboratories Sunnyvale, Signetics Corporation, Sunnyvale, CA. Personal Correspondence, 12 May 1988.
|
 |
Smit 87
|
J. E. Smith , G. E. Dermer , B. D. Vanderwarn , S. D. Klinger , C. M. Rozewski, The ZS-1 central processor, Proceedings of the second international conference on Architectual support for programming languages and operating systems, p.199-204, October 1987, Palo Alto, California, United States
|
 |
Sohi 87
|
G. S. Sohi , S. Vajapeyam, Instruction issue logic for high-performance, interruptable pipelined processors, Proceedings of the 14th annual international symposium on Computer architecture, p.27-34, June 02-05, 1987, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/30350.30354]
|
| |
Tjad 70
|
G.S. Tjaden and M.J. Flynn, "Detection and Parallel Execution of Independent. Instructions". IEEE Transacactions on Computers, Vol. C-19 (October 1970), pp. 889-895.
|
| |
Toma 67
|
R.M. Tomasulo, "An Efficient Algorithm for Exploiting Multiple Arithmetic Units'. IBM Journal, Vol. 11 (January 1967), pp. 25-33.
|
| |
Weis 84
|
S. Weiss and J.E. Smith, "Instrtlction Issue Logic in Pipelined Supercomputers". IEEE Transaction"s on Computer's, Vol. C-33 (November 1984), pp. 1013-1022.
|
CITED BY 51
|
|
Pohua P. Chang , William Y. Chen , Scott A. Mahlke , Wen-mei W. Hwu, Comparing static and dynamic code scheduling for multiple-instruction-issue processors, Proceedings of the 24th annual international symposium on Microarchitecture, p.25-33, September 1991, Albuquerque, New Mexico, Puerto Rico
|
|
|
|
|
|
|
|
|
Dean M. Tullsen , Susan J. Eggers , Joel S. Emer , Henry M. Levy , Jack L. Lo , Rebecca L. Stamm, Exploiting choice: instruction fetch and issue on an implementable simultaneous multithreading processor, ACM SIGARCH Computer Architecture News, v.24 n.2, p.191-202, May 1996
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Mayan Moudgill , Keshav Pingali , Stamatis Vassiliadis, Register renaming and dynamic speculation: an alternative approach, Proceedings of the 26th annual international symposium on Microarchitecture, p.202-213, December 01-03, 1993, Austin, Texas, United States
|
|
|
|
|
|
Michael Butler , Tse-Yu Yeh , Yale Patt , Mitch Alsup , Hunter Scales , Michael Shebanow, Single instruction stream parallelism is greater than two, ACM SIGARCH Computer Architecture News, v.19 n.3, p.276-286, May 1991
|
|
|
Alessandra Costa , Alessandro De Gloria , Paolo Faraboschi , Mauro Olivieri, An analysis of dynamic scheduling techniques for symbolic applications, Proceedings of the 26th annual international symposium on Microarchitecture, p.185-191, December 01-03, 1993, Austin, Texas, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Pohua P. Chang , Scott A. Mahlke , William Y. Chen , Nancy J. Warter , Wen-mei W. Hwu, IMPACT: an architectural framework for multiple-instruction-issue processors, 25 years of the international symposia on Computer architecture (selected papers), p.408-417, June 27-July 02, 1998, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Roger A. Bringmann , Scott A. Mahlke , Richard E. Hank , John C. Gyllenhaal , Wen-mei W. Hwu, Speculative execution exception recovery using write-back suppression, Proceedings of the 26th annual international symposium on Microarchitecture, p.214-223, December 01-03, 1993, Austin, Texas, United States
|
|
|
Scott A. Mahlke , Richard E. Hank , Roger A. Bringmann , John C. Gyllenhaal , David M. Gallagher , Wen-mei W. Hwu, Characterizing the impact of predicated execution on branch prediction, Proceedings of the 27th annual international symposium on Microarchitecture, p.217-227, November 30-December 02, 1994, San Jose, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Pradeep K. Dubey , Kevin O'Brien , Kathryn M. O'Brien , Charles Barton, Single-program speculative multithreading (SPSM) architecture: compiler-assisted fine-grained multithreading, Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques, p.109-121, June 27-29, 1995, Limassol, Cyprus
|
|
|
Kevin Skadron , Pritpal S. Ahuja , Margaret Martonosi , Douglas W. Clark, Branch Prediction, Instruction-Window Size, and Cache Size: Performance Trade-Offs and Simulation Techniques, IEEE Transactions on Computers, v.48 n.11, p.1260-1281, November 1999
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|