| Value prediction in VLIW machines |
| Full text |
Pdf
(226 KB)
|
| Source
|
International Symposium on Computer Architecture
archive
Proceedings of the 26th annual international symposium on Computer architecture
table of contents
Atlanta, Georgia, United States
Pages: 258 - 269
Year of Publication: 1999
ISBN:0-7695-0170-2
Also published in ...
|
|
Authors
|
|
Tarun Nakra
|
Department of Computer Science, University of Pittsburgh
|
|
Rajiv Gupta
|
Department of Computer Science, University of Pittsburgh
|
|
Mary Lou Soffa
|
Department of Computer Science, University of Pittsburgh
|
|
| Sponsors |
|
| Publisher |
IEEE Computer Society
Washington, DC, USA
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 26, Citation Count: 5
|
|
|
ABSTRACT
The performance of VLIW architectures is dependent on the capability of the compiler to detect and exploit instruction-level parallelism during instruction scheduling. To exploit the detected parallelism, instructions are reordered to reduce the length of the code schedule and minimize the cycle count for execution. Code reordering is limited by the dependencies among instructions arising from both control flow and data flow. In this paper, we present the design of a VLIW architecture that uses value prediction to remove data dependencies and improve the instruction schedule. Our architecture consists of two execution engines, one for executing the original VLIW code, and the other for executing compensation code after a misprediction. Any code executed due to mispredictions is executed in parallel with the VLIW instructions. The instruction set and hardware of a traditional VLIW machine are modified accordingly to support this type of concurrent execution. The efficacy of the proposed architecture is demonstrated by implementing the prediction model in the Trimaran compiler infrastructure and studying the speedups that result due to the parallel execution of compensation code.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
The trimaran compiler research infrastructure. Tutorial Notes, November 1997.
|
| |
2
|
Brad Calder , Peter Feller , Alan Eustace, Value profiling, Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, p.259-269, December 01-03, 1997, Research Triangle Park, North Carolina, United States
|
| |
3
|
J. Fisher. Trace scheduling: a technique for global microcode compaction. IEEE Tranactions on Computers, July t981.
|
 |
4
|
Chao-Ying Fu , Matthew D. Jennings , Sergei Y. Larin , Thomas M. Conte, Value speculation scheduling for high performance processors, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.262-271, October 02-07, 1998, San Jose, California, United States
|
| |
5
|
F. Gabbay and A. Mendelson. Speculative execution based on value prediction. EE Department TR# t080, Technion- Israel Institute of Technology, November t996.
|
| |
6
|
|
| |
7
|
Wen-Mei W. Hwu , Scott A. Mahlke , William Y. Chen , Pohua P. Chang , Nancy J. Warter , Roger A. Bringmann , Roland G. Ouellette , Richard E. Hank , Tokuzo Kiyohara , Grant E. Haab , John G. Holm , Daniel M. Lavery, The superblock: an effective technique for VLIW and superscalar compilation, The Journal of Supercomputing, v.7 n.1-2, p.229-248, May 1993
[doi> 10.1007/BF01205185]
|
| |
8
|
V. Kathail, M. Schlansker, and B. R. Rau. Hpl playdoh architecture specification: Version 1.0. HewIett- Packard Laboratories Technical Report HPL-93-80, Computer Systems Laboratory, February 1994.
|
| |
9
|
|
 |
10
|
Mikko H. Lipasti , Christopher B. Wilkerson , John Paul Shen, Value locality and load value prediction, Proceedings of the seventh international conference on Architectural support for programming languages and operating systems, p.138-147, October 01-04, 1996, Cambridge, Massachusetts, United States
|
 |
11
|
Scott A. Mahlke , David C. Lin , William Y. Chen , Richard E. Hank , Roger A. Bringmann, Effective compiler support for predicated execution using the hyperblock, Proceedings of the 25th annual international symposium on Microarchitecture, p.45-54, December 01-04, 1992, Portland, Oregon, United States
|
| |
12
|
|
| |
13
|
|
| |
14
|
M. Schlansker, B. R. Ran, S. Mahlke, V. Kathail, R.Johnson, S.Anik, and S. G. Abraham. Achieving high levels of instruction-level parallelism with reduced hardware complexity. Hewtett-Packard Laboratories Technical Report HPL-960120, Hewlett-Packard Laboratories, February 1997.
|
| |
15
|
|
|