| Mining web logs for prediction models in WWW caching and prefetching |
| Full text |
Pdf
(414 KB)
|
| Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
San Francisco, California
Pages: 473 - 478
Year of Publication: 2001
ISBN:1-58113-391-X
|
|
Authors
|
|
Qiang Yang
|
Simon Fraser University, Burnaby, BC, Canada
|
|
Haining Henry Zhang
|
IBM E-business Innovation Center, Vancouver, Burnaby, BC, Canada
|
|
Tianyi Li
|
Simon Fraser University, Burnaby, BC, Canada
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 21, Downloads (12 Months): 116, Citation Count: 25
|
|
|
ABSTRACT
Web caching and prefetching are well known strategies for improving the performance of Internet systems. When combined with web log mining, these strategies can decide to cache and prefetch web documents with higher accuracy. In this paper, we present an application of web log mining to obtain web-document access patterns and use these patterns to extend the well-known GDSF caching policies and prefetching policies. Using real web logs, we show that this application of data mining can achieve dramatic improvement to web-access performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Arlitt, R. Friedrich L. Cherkasova, J. DiUey, and T. Jin. Evaluating content management techniques for web proxy caches. In HP Technical report, Palo Alto, Apr. 1999.
|
| |
2
|
|
| |
3
|
|
| |
4
|
P. Cao and S. Irani. Cost-aware www proxy caching algorithms. In USENIX Symposium on Internet Technologies and Systems, Monterey, CA, Dec. 1997.
|
| |
5
|
Pitknw J. and Pirolli P. Mining longest repeating subsequences to predict www surfing. In Proceedings of the 1999 USENIX Annual Technical Conference, 1999.
|
| |
6
|
T.M. Kroeger and D. D. E. Long. Predicting future filesystem actions from prior events. In USENIX 96, San Diego, Calif., Jan. 1996.
|
| |
7
|
K. Chinen and S. Yamaguchi. An Interactive Prefetching Proxy Server for Improvement of WWW Latency. In Proceedings of the Seventh Annual Conference of the Internet Society (INEt'97), Kuala Lumpur, June 1997.
|
| |
8
|
|
| |
9
|
L. Cherkasova. Improving www proxies performance with greedy-dual-size-frequency caching policy. In HP Technical Report, Palo Alto, November 1998.
|
 |
10
|
Pei Cao , Edward W. Felten , Anna R. Karlin , Kai Li, A study of integrated prefetching and caching strategies, Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, p.188-197, May 15-19, 1995, Ottawa, Ontario, Canada
|
| |
11
|
K. Chinen and S. Yamaguchi. An interactive prefetching proxy server for improvement of www latency. In Proceedings of the Seventh Annual Conference of the Internet Society (INET '97), Kuala Lumpur, Malaysia, June 1997.
|
| |
12
|
D. Duchamp. Prefetching hyperlinks. In Proceedings of the Second USENIX Symposium on Internet Technologies and Systems (USITS '99), Boulder, CO, October 1999.
|
| |
13
|
|
| |
14
|
V. Padmanabhan and J. Mogul. Using predictive prefetching to improve world of the Seventeenth International Conference on very Large Database, pages 255-264, September 1991.
|
| |
15
|
|
| |
16
|
M. Arlitt, R. Friedrich, L. Cherkasova, J. Dilley, and T. Jin. Evaluating content management techniques for web proxy caches. In IIP Technical report, Palo Alto, Apr. 1999.
|
CITED BY 25
|
|
|
|
|
|
|
|
Gui-Rong Xue , Hua-Jun Zeng , Zheng Chen , Wei-Ying Ma , Hong-Jiang Zhang , Chao-Jun Lu, Implicit link analysis for small web search, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, July 28-August 01, 2003, Toronto, Canada
|
|
|
|
|
|
Mark Bilezikjian , John C. Tang , James Bo Begole , Nicole Yankelovich, Exploring web browser history comparisons, CHI '02 extended abstracts on Human factors in computing systems, April 20-25, 2002, Minneapolis, Minnesota, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nicolas Poggi , Toni Moreno , Josep Lluis Berral , Ricard Gavaldí , Jordi Torres, Self-adaptive utility-based web session management, Computer Networks: The International Journal of Computer and Telecommunications Networking, v.53 n.10, p.1712-1721, July, 2009
|
|
|
|
|
|
|
|
|
|
|