| Dynamic addressing memory arrays with physical locality |
| Full text |
Publisher Site
,
Pdf
(907 KB)
|
| Source
|
International Symposium on Microarchitecture
archive
Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
table of contents
Istanbul, Turkey
SESSION: Register file and memory system design
table of contents
Pages: 161 - 170
Year of Publication: 2002
ISBN ~ ISSN:1072-4451 , 0-7695-1859-1
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
IEEE Computer Society Press
Los Alamitos, CA, USA
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 9, Citation Count: 1
|
|
|
ABSTRACT
As pipeline width and depth grow to improve performance, memory arrays in microprocessors are growing in entries and ports. Arrays will increase in physical size, which prolongs the access time due to wiring delay. In order to boost clock frequency, these memory arrays must take multiple cycles to complete an access. This delays the scheduling of dependent instructions and affects overall performance. This paper proposes a different circuit organization to enable fast and slow accesses solely dependent on physical locality. Since the access time depends on a fixed physical location, it is pre-determined to scheduling dependent instructions. Furthermore, this paper presents a mechanism to re-configure the address decoding of the physical register file to increase the occurrence of fast access. Detailed circuit simulation using this proposed method determines the access cycle time. Reduction in average access cycle time for the register file and the first level data cache recovers 73% of the IPC degradation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Bohr, "Interconnect Scaling -- The Real Limiter to High Performance ULSI," 1995 International Electron Devices Meeting, pp. 241--144.
|
| |
2
|
R. Ho, K. Mai, M. Horowitz, "The Future of Wires," The Proceedings of the IEEE, Vol. 89, No. 4, April 2001.
|
 |
3
|
Vikas Agarwal , M. S. Hrishikesh , Stephen W. Keckler , Doug Burger, Clock rate versus IPC: the end of the road for conventional microarchitectures, Proceedings of the 27th annual international symposium on Computer architecture, p.248-259, June 2000, Vancouver, British Columbia, Canada
|
 |
4
|
|
 |
5
|
M. S. Hrishikesh , Doug Burger , Norman P. Jouppi , Stephen W. Keckler , Keith I. Farkas , Premkishore Shivakumar, The optimal logic depth per pipeline stage is 6 to 8 FO4 inverter delays, Proceedings of the 29th annual international symposium on Computer architecture, p.14, May 25-29, 2002, Anchorage, Alaska
|
 |
6
|
|
| |
7
|
|
 |
8
|
Javier Zalamea , Josep Llosa , Eduard Ayguadé , Mateo Valero, Two-level hierarchical register file organization for VLIW processors, Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture, p.137-146, December 2000, Monterey, California, United States
[doi> 10.1145/360128.360143]
|
 |
9
|
|
| |
10
|
|
| |
11
|
Doug Burger, Todd M. Austin, etc., "SimpleScalar toolset 3.0b," http://www.simplescalar.com.
|
| |
12
|
SPEC 2000 benchmarks suites, http://www.spec.org.
|
|