|
ABSTRACT
Predicting the next request of a user as she visits Web pages has gained importance as Web-based activity increases. Markov models and their variations, or models based on sequence mining have been found well suited for this problem. However, higher order Markov models are extremely complicated due to their large number of states whereas lower order Markov models do not capture the entire behavior of a user in a session. The models that are based on sequential pattern mining only consider the frequent sequences in the data set, making it difficult to predict the next request following a page that is not in the sequential pattern. Furthermore, it is hard to find models for mining two different kinds of information of a user session. We propose a new model that considers both the order information of pages in a session and the time spent on them. We cluster user sessions based on their pair-wise similarity and represent the resulting clusters by a click-stream tree. The new user session is then assigned to a cluster based on a similarity measure. The click-stream tree of that cluster is used to generate the recommendation set. The model can be used as part of a cache prefetching system as well as a recommendation model.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
A. Banerjee and J. Ghosh. Clickstream clustering using weighted longest common subsequences. In Proceedings of the Wokshop on Web Mining, SIAM Conference on Data Mining, pages 33--40, 2001. Chicago, IL.
|
| |
3
|
K. Cahrter, J. Schaeffer, and D. Szafron. Sequence alignment using fastlsa. In Proceedings of the International Conference on Mathematics and Engineering Techniques in Medicine and Biological Sciences (METMBS'2000), pages 239--245, 2000.
|
| |
4
|
Cluto. http://www-users.cs.umn.edu/ karypis/cluto/index.html.
|
| |
5
|
Dan Cosley, Steve Lawrence, and David M. Pennock. REFEREE: An open framework for practical testing of recommender systems using researchindex. In Proceedings of 28th International Conference on Very Large Databases, VLDB 2002, Hong Kong, August 20--23 2002.
|
| |
6
|
Chris Ding, Xiaofeng He, Hongyuan Zha, Minh Gu, and Horst Simon. Spectral min-max cut for graph partitioning and data clustering. 2001. Technical Report TR-2001-XX, Lawrence Berkeley National Laboratory, University of CaliforniaBerkeley, CA.
|
 |
7
|
|
| |
8
|
ClarkNet WWW Server Log. http://ita.ee.lbl.gov/html/contrib/ClarkNet-HTTP.html.
|
| |
9
|
NASA Kennedy Space Center Log. http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html.
|
| |
10
|
B. Mobasher, H. Dai, T. Luo, and M. Nakagawa. Discovery of aggregate usage profiles for web personalization. In Proceedings of the Web Mining for E-Commerce Workshop (WebKDD'2000), 2000.
|
 |
11
|
Bamshad Mobasher , Honghua Dai , Tao Luo , Miki Nakagawa, Effective personalization based on association rule discovery from web usage data, Proceedings of the 3rd international workshop on Web information and data management, November 09-01, 2001, Atlanta, Georgia, USA
[doi> 10.1145/502932.502935]
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
CITED BY 12
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ding Zhou , Levent Bolelli , Jia Li , C. Lee Giles , Hongyuan Zha, Learning user clicks in web search, Proceedings of the 20th international joint conference on Artifical intelligence, p.1162-1167, January 06-12, 2007, Hyderabad, India
|
|