ACM Home Page
Please provide us with feedback. Feedback
Optimal prefetching via data compression
Full text PdfPdf (565 KB)
Source Journal of the ACM (JACM) archive
Volume 43 ,  Issue 5  (September 1996) table of contents
Pages: 771 - 793  
Year of Publication: 1996
ISSN:0004-5411
Authors
Jeffrey Scott Vitter  Duke Univ., Durham, NC
P. Krishnan  Bell Labs, Holmdel, NJ
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 53,   Citation Count: 28
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/234752.234753
What is a DOI?

ABSTRACT

Caching and prefetching are important mechanisms for speeding up access time to data on secondary storage. Recent work in competitive online algorithms has uncovered several promising new algorithms for caching. In this paper, we apply a form of the competitive philosophy for the first time to the problem of prefetching to develop an optimal universal prefetcher in terms of fault rate, with particular applications to large-scale databases and hypertext systems. Our prediction algorithms with particular applications to large-scale databases and hypertext systems. Our prediction algorithms for prefetching are novel in that they are based on data compression techniques that are both theoretically optimal and good in practice. Intuitively, in order to compress data effectively, you have to be able to predict future data well, and thus good data compressors should be able to predict well for purposes of prefetching. We show for powerful models such as Markov sources and mthe order Markov sources that the page fault rate incurred by our prefetching algorithms are optimal in the limit for almost all sequences of page requests.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
AMIT, Y., AND MILLER, M. 1990. Large deviations for coding Markov chains and Gibbs random ~ field~s. Tech. Rep. Washington Univ.
 
3
 
4
BLACKWELL, D. 1956. An analog to the minimax theorem for vector payoffs. Pac. J. Math. 6, 1-8.
 
5
6
7
8
 
9
10
 
11
COVER, r. M., AND SHENHAR, A. 1977. Compound Bayes predictors with apparent Markov ~ structure. IEEE Trans. Syst. Man Cyb. SMC-7 (June), 421-424.
12
 
13
FEDER, M., MERHAV, N., AND GUTMAN, M. 1992. Universal prediction of individual sequences. ~ IEEE Trans. Inf. Theory IT-38 (July), 1258-1270.
 
14
 
15
 
16
HANNAN, J.F. 1957. Approximation to Bayes risk in repeated plays. In Contributions to the Theory ~ of Games, Vol. 3, Annals of Mathematical Studies. Princeton, N.J., 97-139.
 
17
 
18
 
19
KARLIN, A. R., PHILLIPS, S. J., AND RAGHAVAN, P. 1992. Markov paging. In Proceedings of the 33rd ~ Annual IEEE Conference on Foundations of Computer Science (Oct.). IEEE, New York, pp. ~ 208 -217.
 
20
KARLIN, S., AND TAYLOR, H.M. 1975. A First Course in Stochastic Processes, 2nd ed., Academic ~Press, New York.
 
21
KEARNS, M. J., AND SCHAPIRE, R.E. 1990. Efficient distribution-free learning of probabilistic concepts. In Proceedings of the 31st Annual IEEE Symposium on Foundations of Computer Science (Oct.). IEEE, New York, pp. 382-391.
 
22
 
23
LAIRD, P. 1992. Discrete sequence prediction and its applications. AI Research Branch, NASA ~ Ames Research Center, Moffet Field, Calif.
 
24
LANGDON, G. G. 1983. A note on the Ziv-Lempel model for compressing individual sequences. ~ IEEE Trans. Inf. Theory 29 (Mar.), 284-287.
 
25
LANGDON, G. G. 1984. An introduction to arithmetic coding. IBM J. Res. Develop. 28 (Mar.), ~ 135-149.
 
26
LEMPEL, A., AND ZIv, J. 1976. On the complexity of finite sequences. IEEE Trans. Inf. Theory ~ IT-22, i (Jan.), 75-81.
 
27
McGEOCH, L. A., AND SLEATOR, D.D. 1989. A strongly competitive randomized paging algorithm. ~ CS-89-122. Carnegie-Mellon Univ., Pittsburgh, Pa.
28
 
29
30
31
 
32
SHWARTZ, A., AND WEISS, A. 1995. Large Deviations for Performance Analysis. Chapman & Hall, ~ New York.
33
 
34
TRIVEDI, K.S. 1979. An analysis of prepaging. Computing 22, 191-210.
 
35
VAPNIIC, V. 1982. Estimation of Dependencies Based on Empirical Data. Springer-Verlag, New ~ York.
 
36
VITTER, J. S., CUREWITZ, K., AND KRISHNAN, P. 1996. Online background predictors and prefetch- ~ ers. Duke Univ., United States Patent No. 5,485,609.
37
 
38
ZIv, J., AND LEMPEL, A. 1978. Compression of individual sequences via variable-rate coding. IEEE ~ Trans. Inf. Theory 24 (Sept.), 530-536.

CITED BY  28


REVIEW

"R. Nigel Horspool : Reviewer"

The work reported here is based on the observation that lossless data compression methods implicitly or explicitly predict incoming source symbols and that similar predictions would be highly applicable to the problem of prefetching the data n  more...

Collaborative Colleagues:
Jeffrey Scott Vitter: colleagues
P. Krishnan: colleagues