| Performance benefits of large execution atomic units in dynamically scheduled machines |
| Full text |
Pdf
(672 KB)
|
| Source
|
International Conference on Supercomputing
archive
Proceedings of the 3rd international conference on Supercomputing
table of contents
Crete, Greece
Pages: 427 - 432
Year of Publication: 1989
ISBN:0-89791-309-4
|
|
Authors
|
|
Stephen W. Melvin
|
Computer Science Division, University of California, Berkeley, CA
|
|
Yale N. Patt
|
Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 22, Citation Count: 8
|
|
|
ABSTRACT
In this paper we identify three types of atomic units, or indivisible units of work: architectural atomic units (defined by architecture level interrupts and exceptions), compiler atomic units (defined by compiler code generation) and execution atomic units (defined by run-time interruptibility). We discuss trade-offs for these units and show that size has different performance implications depending on the atomic unit. We simulate a number of different implementations of the VAX architecture, focusing on different execution atomic unit sizes. We show that significant performance benefits can be achieved by having large execution atomic units in dynamically scheduled machines.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. A. Fisher, "Trace Scheduling' ATe, l,,t,que for Global Microcode Compaction," {EEl"_.' Transactions on Computers, vo{, 0-30, rio. 7, July 1981.
|
| |
2
|
|
 |
3
|
|
| |
4
|
S. W. Melvin , M. C. Shebanow , Y. N. Patt, Hardware support for large atomic units in dynamically scheduled machines, Proceedings of the 21st annual workshop on Microprogramming and microarchitecture, p.60-63, November 28-December 02, 1988, San Diego, California, United States
|
 |
5
|
Y. N. Patt , W. M. Hwu , M. Shebanow, HPS, a new microarchitecture: rationale and introduction, Proceedings of the 18th annual workshop on Microprogramming, p.103-108, December 03-06, 1985, Pacific Grove, California, United States
|
| |
6
|
Yale N. P~tt, Michael C. Shebanow. Wen-reel Hwu and Stephen W. Melvin, "A C Conlpiler for HPS 1, A Highly ParMle{ Execution Engine," Proceedings, 19tb Hawaii fnternational Conferemce on System Sciences, Honolulu, HI..january 1986.
|
| |
7
|
George R~ai-, "The 801 Minicomputer," IBM JournM of Research and Development. Vol. 27, No. 3, May 1983, pp. 2aZ-'246.
|
| |
8
|
R. M. Tomas.ulo, "An Efficient Algorithm fo~ Exploiting Multiple Arithmetic Units," fBM Journal of Research and Development, Vol. 11, 1967, pp. 25-33.
|
CITED BY 8
|
|
Jared Stark , Paul Racunas , Yale N. Patt, Reducing the performance impact of instruction cache misses by writing instructions into the reservation stations out-of-order, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.34-43, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
|
|
Daniel Holmes Friendly , Sanjay Jeram Patel , Yale N. Patt, Alternative fetch and issue policies for the trace cache fetch mechanism, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.24-33, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
|
|
|
|
|
|
|
|
Karthikeyan Sankaralingam , Ramadass Nagarajan , Haiming Liu , Changkyu Kim , Jaehyuk Huh , Nitya Ranganathan , Doug Burger , Stephen W. Keckler , Robert G. McDonald , Charles R. Moore, TRIPS: A polymorphous architecture for exploiting ILP, TLP, and DLP, ACM Transactions on Architecture and Code Optimization (TACO), v.1 n.1, p.62-93, March 2004
|
|
|
|
|
|
Roni Rosner , Micha Moffie , Yiannakis Sazeides , Ronny Ronen, Selecting long atomic traces for high coverage, Proceedings of the 17th annual international conference on Supercomputing, June 23-26, 2003, San Francisco, CA, USA
|
|
|
Yale N. Patt , Sanjay J. Patel , Marius Evers , Daniel H. Friendly , Jared Stark, One Billion Transistors, One Uniprocessor, One Chip, Computer, v.30 n.9, p.51-57, September 1997
|
|