|
ABSTRACT
One critical aspect in designing set-associative cache at high clock rate is deriving timely results from directory lookup. In this paper we investigate the possibility of accurately approximating the results of conventional directory search with faster matches of few partial address bits. Such fast and accurate approximations may be utilized to optimize cache access timing, particularly in a customized design environment. Through analytic and simulation studies we examine the trade-offs of various design choices. We also discuss few other applications of partial address matching to computer designs.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
C.J. Conti, "Concepts for Buffer Storage," IEEE Comp. Group News, 2, 8, (March 1969), pp. 9-13.
|
| |
2
|
A. J. Smith, "A Comparative Study of Set Associative Memory Mapping Algorithms and Their Use for Cache and Main Memory," IEEE Trans. on Software Engineering, SE-4, 2 (March 1978).
|
 |
3
|
|
| |
4
|
|
 |
5
|
S. Prybylski , M. Horowitz , J. Hennessy, Performance tradeoffs in cache design, Proceedings of the 15th Annual International Symposium on Computer architecture, p.290-298, May 30-June 02, 1988, Honolulu, Hawaii, United States
|
| |
6
|
T. Wada, S. Rajan and S. A. Pryzybylski, "An Analytic Access Time Model for On-Chip Cache Memories," IEEE Journal of Sohd-State Czrcu~ts, Vol. 27, No. 8 (August 1992) pp. 1147-1156.
|
| |
7
|
J. J. Losq, G. S. Rao and H. E. Sachar, "Decode History Table for Conditional Branch instructions,'' U.S. Patent No. 4,477,872 (October 16, 1984).
|
 |
8
|
|
| |
9
|
L. Liu, "Partial Address Directory for Cache Access,'' IBM Research Report, RC18803 (February 1993). Also published in IEEE Trans. on VLSI Systems, Vol. 2, No. 2 (June 1994) pp. 226-240.
|
| |
10
|
J. M. Lee and A. Weinberger, "A Solution to the Synonym Problem," IBM Technical D~sclosure Bulletin, 22, 8A (January 1980), pp. 3331-3333.
|
| |
11
|
"3033 Processor Complex, Theory of Operation/Diagrams Manual (Vol. 4), Processor Storage Control Function (15SCF),'' IBM, Poughkeepsie, New York (1978).
|
 |
12
|
|
 |
13
|
J. H. Chang , H. Chao , K. So, Cache design of a sub-micron CMOS system/370, Proceedings of the 14th annual international symposium on Computer architecture, p.208-213, June 02-05, 1987, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/30350.30374]
|
| |
14
|
K. Hun, A. Hunt, L. Liu, J. Peir, D. Pruett and J. Temple, "Early Resolution of Address Translation in Cache Design," Proc. 1990 IEEE International Conf. on Computer Design (September 1990)
|
| |
15
|
S. G. Tucker, "The IBM 3090 Systems: An Overview," IBM Systems Journal, 25, 6 (January 1986).
|
 |
16
|
|
CITED BY 10
|
|
|
|
|
|
|
|
|
|
|
Jih-Kwon Peir , Shih-Chang Lai , Shih-Lien Lu , Jared Stark , Konrad Lai, Bloom filtering cache misses for accurate data speculation and prefetching, Proceedings of the 16th international conference on Supercomputing, June 22-26, 2002, New York, New York, USA
|
|
|
|
|
|
Chuanjun Zhang , Frank Vahid , Jun Yang , Walid Najjar, A way-halting cache for low-energy high-performance systems, Proceedings of the 2004 international symposium on Low power electronics and design, August 09-11, 2004, Newport Beach, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|