ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Decoupling local variable accesses in a wide-issue superscalar processor
Full text PdfPdf (671 KB)
Source International Symposium on Computer Architecture archive
Proceedings of the 26th annual international symposium on Computer architecture table of contents
Atlanta, Georgia, United States
Pages: 100 - 110  
Year of Publication: 1999
ISBN:0-7695-0170-2
Also published in ...
Authors
Sangyeun Cho  MCU Team, System LSI Div., Samsung Electronics Co., Yongin-City, Korea
Pen-Chung Yew  Dept. of Comp. Sci. and Eng., University of Minnesota, Minneapolis, MN
Gyungho Lee  Division of Engineering, Univ. of Texas at San Antonio, San Antonio, TX
Sponsors
IEEE-CS\TCCA : TC on Computer Arhitecture
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
IEEE Computer Society  Washington, DC, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 29,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/300979.300988
What is a DOI?

ABSTRACT

Providing adequate data bandwidth is extremely important for a wide-issue superscalar processor to achieve its full performance potential. Adding a large number of ports to a data cache, however, becomes increasingly inefficient and can add to the hardware complexity significantly. This paper takes an alternative or complementary approach for providing more data bandwidth, called the data-decoupled architecture. The approach, with support from the compiler and/or hardware, partitions the memory stream into two independent streams early in the processor pipeline, and feeds each stream to a separate memory access queue and cache. Under this model, the paper studies the potential of decoupling memory accesses to program's local variables that are allocated on the run-time stack. Using a set of integer and floating-point programs from the SPEC95 benchmark suite, it is shown that local variable accesses constitute a large portion of all the memory references, while their reference space is very small, averaging around 7 words per (static) procedure. To service local variable accesses quickly, two optimizations, fast data forwarding and access combining, are proposed and studied. Some of the important design parameters, such as the cache size, the number of cache ports, and the degree of access combining, are studied based on simulations. The potential performance of the proposed scheme is measured using various configurations, and it is concluded that the scheme can become a viable alternative to building a single multi-ported data cache.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
D. Burger and T. M. Austin, "The SimpleScalar Tool Set, Version 2.0~" Computer Sciences Department Technical Report, No. 1342, Univ, of Wisconsin, June 1997.
4
 
5
S Cho, P.-C. Yew, and G. Lee. "Decoupling Local Variable Accesses in a Wide-Issue Superscalar Processor," Technical Repoa #98-020, Dept, of Computer Sci. and Eng., Univ, of Minnesota, May 1998.
 
6
S. Cho, P.-C. Yew, and G. Lee. "'Access Region Locality for High- Bandwidth Processor Memory System Design," Technical Report #99-004, Dept. of Computer Sci. and Eng., Univ. of Minnesota, Feb. 1999.
7
8
 
9
 
10
EGCS Projecl. ht tp : / / egcs. cygnus, com.
11
 
12
M.J. Flynn and L. W. Hoevel. "Execution Architecture: The DEL- tran Experiment," IEEE Trans. on Computers, C-32(2): 156-175, Feb. 1983.
 
13
L. Gwennap. "'Digital 21264 Sets New Standard," Microprocessor Report, Volume 10, Issue 14, Oct. 1996.
 
14
 
15
16
 
17
18
 
19
 
20
D.A. Patterson and C. H. Sequin. "A VLSI RISC," IEEE Computer, pp. 8 - 2I, Sept. 1982.
 
21
 
22
 
23
 
24
25
26
 
27
The Standard Performance Evaluation Corporation, http: //www. specbench, org.
 
28
Y. Tamir and C. H. Sequin. "'Strategies for Managing the Register File in RISC," IEEE Trans. on Computers, C-32(11): 977 - 989, Nov. 1983.
 
29
 
30
31
32
 
33
34

CITED BY  16

Collaborative Colleagues:
Sangyeun Cho: colleagues
Pen-Chung Yew: colleagues
Gyungho Lee: colleagues