ACM Home Page
Please provide us with feedback. Feedback
Speculation techniques for improving load related instruction scheduling
Full text PdfPdf (164 KB)
Source International Symposium on Computer Architecture archive
Proceedings of the 26th annual international symposium on Computer architecture table of contents
Atlanta, Georgia, United States
Pages: 42 - 53  
Year of Publication: 1999
ISBN:0-7695-0170-2
Also published in ...
Authors
Adi Yoaz  Intel Corporation, Intel Israel (74) Ltd., BMD Architecture Dept., MS: IDC-3C, P.O. Box 1659, Haifa 31015, Israel
Mattan Erez  Intel Corporation, Intel Israel (74) Ltd., BMD Architecture Dept., MS: IDC-3C, P.O. Box 1659, Haifa 31015, Israel
Ronny Ronen  Intel Corporation, Intel Israel (74) Ltd., BMD Architecture Dept., MS: IDC-3C, P.O. Box 1659, Haifa 31015, Israel
Stephan Jourdan  Intel Corporation, Intel Israel (74) Ltd., BMD Architecture Dept., MS: IDC-3C, P.O. Box 1659, Haifa 31015, Israel
Sponsors
IEEE-CS\TCCA : TC on Computer Arhitecture
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 50,   Citation Count: 32
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/300979.300983
What is a DOI?

ABSTRACT

State of the art microprocessors achieve high performance by executing multiple instructions per cycle. In an out-of-order engine, the instruction scheduler is responsible for dispatching instructions to execution units based on dependencies, latencies, and resource availability. Most existing instruction schedulers are doing a less than optimal job of scheduling memory accesses and instructions dependent on them, for the following reasons:• Memory dependencies cannot be resolved prior to execution, so loads are not advanced ahead of preceding stores.• The dynamic latencies of load instructions are unknown, so scheduling dependent instructions is based on either optimistic load-use delay (may cause re-scheduling and re-execution) or pessimistic delay (creating unnecessary delays).• Memory pipelines are more expensive than other execution units, and as such, are a scarce resource. Currently, an increase in the memory execution bandwidth is usually achieved through multi-banked caches where bank conflicts limit efficiency.In this paper we present three techniques to address these scheduler limitations. One is to improve the scheduling of load instructions by using a simple memory disambiguation mechanism. The second is to improve the scheduling of load dependent instructions by employing a Data Cache Hit-Miss Predictor to predict the dynamic load latencies. And the third is to improve the efficiency of load scheduling in a multi-banked cache through Cache-Bank Prediction.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
Aust97
Beke99
Chry98
 
Digi97
Digital Equipment Corporation, Maynard MA- "21164 Alpha Microprocessor Hardware Reference Manual"- Digital Equipment Corporation, 1997.
 
Fran96
Gall94
 
Hess95
J. Hesson, J. LeBlanc and S. Ciavaglia- "'Apparatus to Dynamically Control the Out-Of-Order Execution of Load-Store Instructions"- US. Patent 5,615,350 Filed Dec. 1995, Issued Mar. 1997.
Huan94
 
Hunt95
 
Inte96
Intel Corporation- "Pentium~ Pro Family Developers Manual "- lnteI Corporation, 1996
 
Jour98
S. Jourdan, R. Ronen, M. Bekerman, B. Shomar, and A. Yoaz- "A Novel Renaming Scheme to Exploit Value Temporal Locality through Physical Register Reuse and Unification"- MICRO-31, Dec. 1998.
 
Kess98
R.E. Kessler- "The Alpha 21264 Microprocessor: Out-of-Order Execution at 600 MHz"- HOT-CHIPS 10, Aug. 1998,
 
Mcfa93
S. McFarling- "'Combining Branch Predictors"- WRL Technical Note TN-36, June 1993.
Mich97
Mosh97
 
Mosh97b
 
Mowr97
 
Nico89
 
Patt95
 
Pinte96
 
Rive97
Simo95
Tuls95
 
Weis94
Wils96
 
Yeh97
Yeh and Patt- ''Two-Level Adaptive Training Branch Prediction" - ISCA-24, June 1997.

CITED BY  32

Collaborative Colleagues:
Adi Yoaz: colleagues
Mattan Erez: colleagues
Ronny Ronen: colleagues
Stephan Jourdan: colleagues