ACM Home Page
Please provide us with feedback. Feedback
Recycling waste: exploiting wrong-path execution to improve branch prediction
Full text PdfPdf (311 KB)
Source International Conference on Supercomputing archive
Proceedings of the 17th annual international conference on Supercomputing table of contents
San Francisco, CA, USA
SESSION: Processor microarchitecture I table of contents
Pages: 12 - 21  
Year of Publication: 2003
ISBN:1-58113-733-8
Authors
Haitham Akkary  Microprocessor Research Lab, Intel Corporation
Srikanth T. Srinivasan  Microprocessor Research Lab, Intel Corporation
Konrad Lai  Microprocessor Research Lab, Intel Corporation
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 25,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/782814.782819
What is a DOI?

ABSTRACT

Despite continuous improvement in branch prediction algorithms, branch misprediction remains a major limitation on microprocessor performance. As pipelines are widened or stretched deeper, branch prediction will become even more crucial. This paper taps into a currently wasted resource, wrong-path execution, to help improve branch prediction. Due to control independence, often the outcomes of branches that are executed along the wrong-path match the outcomes on the correct-path. Current branch prediction methods rely on correlation between branches on the correct path, therefore leaving potentially useful wrong-path branch information unexploited. We present in this paper a new, very simple, and very effective method that extends branch prediction to allow the recycling of wrong-path branch outcomes at the fetch stage. Simulations of deeply pipelined processors using a selected set of SpecInt 2000 and other benchmarks, with more than 5 branch mispredictions per thousand micro-operations, show that branch misprediction rate can be reduced by up to 30%. Depending on the pipeline depth, the corresponding average performance improvement varies from 5% to 20%.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
R. S. Chappell, J. Stark, S. Kim, S. Reinhardt, and Y. N. Patt. Simultaneous Subordinate Microthreading (SSMT). In Proceedings of the 28th Annual International Symposium on Computer Architecture, 2001.
 
5
6
 
7
8
9
 
10
G. Hinton, D. Sager, M. Upton, D. Boggs, D. Carmean, A. Kyker, and P. Roussel. The Microarchitecture of the Pentium® 4 Processor. In Intel Technical Journal, Q1 2001 Issue.
 
11
12
 
13
S. McFarling. Combining Branch Predictors. Technical Report TN-36, Digital Western Research Laboratory, June 1993.
 
14
O. Mutlu, J. Stark, C. Wilkerson, and Y. N. Patt. Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-Order Processors. In Proceedings of the 8th International Symposium on High Performance Computer Architecture, 2002.
15
16
17
18
19
20


Collaborative Colleagues:
Haitham Akkary: colleagues
Srikanth T. Srinivasan: colleagues
Konrad Lai: colleagues