| BrowseRank: letting web users vote for page importance |
| Full text |
Pdf
(381 KB)
|
Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Singapore, Singapore
SESSION: Analysis of social networks
table of contents
Pages 451-458
Year of Publication: 2008
ISBN:978-1-60558-164-4
|
|
Authors
|
|
Yuting Liu
|
Beijing Jiaotong University, Beijing, China
|
|
Bin Gao
|
Microsoft Research Asia, Beijing, China
|
|
Tie-Yan Liu
|
Microsoft Research Asia, Beijing, China
|
|
Ying Zhang
|
Nankai University, Tianjin, China
|
|
Zhiming Ma
|
Chinese Academy of Science, Beijing, China
|
|
Shuyuan He
|
Peking University, Beijing, China
|
|
Hang Li
|
Microsoft Research Asia, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 55, Downloads (12 Months): 679, Citation Count: 5
|
|
|
ABSTRACT
This paper proposes a new method for computing page importance, referred to as BrowseRank. The conventional approach to compute page importance is to exploit the link graph of the web and to build a model based on that graph. For instance, PageRank is such an algorithm, which employs a discrete-time Markov process as the model. Unfortunately, the link graph might be incomplete and inaccurate with respect to data for determining page importance, because links can be easily added and deleted by web content creators. In this paper, we propose computing page importance by using a 'user browsing graph' created from user behavior data. In this graph, vertices represent pages and directed edges represent transitions between pages in the users' web browsing history. Furthermore, the lengths of staying time spent on the pages by users are also included. The user browsing graph is more reliable than the link graph for inferring page importance. This paper further proposes using the continuous-time Markov process on the user browsing graph as a model and computing the stationary probability distribution of the process as page importance. An efficient algorithm for this computation has also been devised. In this way, we can leverage hundreds of millions of users' implicit voting on page importance. Experimental results show that BrowseRank indeed outperforms the baseline methods such as PageRank and TrustRank in several tasks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
|
| |
7
|
Z. Gyongyi and H. Garcia-Molina. Web spam taxonomy, 2005.
|
| |
8
|
|
| |
9
|
T. Haveliwala. Efficient computation of pageRank. Technical Report 1999-31, 1999.
|
| |
10
|
T. Haveliwala and S. Kamvar. The second eigenvalue of the google matrix, 2003.
|
| |
11
|
T. Haveliwala, S. Kamvar, and G. Jeh. An analytical comparison of approaches to personalizing pagerank, 2003.
|
 |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
|
| |
16
|
A. N. Langville and C. D. Meyer. Deeper inside pagerank. Internet Mathematics, 1(3):335--400, 2004.
|
 |
17
|
|
| |
18
|
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
|
| |
19
|
J. A. Rice. Mathematical Statistics and Data Analysis (2nd ed.). Duxbery Press, 1995.
|
| |
20
|
M. Richardson and P. Domingos. The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank. In Advances in Neural Information Processing Systems 14. MIT Press, 2002.
|
| |
21
|
S. E. Robertson. Overview of okapi projects. Journal of Documentatioin, 53(1):3--7, 1997.
|
| |
22
|
W. J. Stewart. Introduction to the Numerical Solution of Markov Chains. Princeton University Press, Princeton, N,J., 1994.
|
| |
23
|
Z. K. Wang and X. Q. Yang. Birth and Death Processes and Markov Chains. Springer-Verlag, New York, 1992.
|
 |
24
|
|
CITED BY 5
|
|
Jianfeng Gao , Wei Yuan , Xiao Li , Kefeng Deng , Jian-Yun Nie, Smoothing clickthrough data for web search ranking, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|
|
|
|
|
Mark Meiss , John Duncan , Bruno Gonçalves , José J. Ramasco , Filippo Menczer, What's in a session: tracking individual behavior on the web, Proceedings of the 20th ACM conference on Hypertext and hypermedia, June 29-July 01, 2009, Torino, Italy
|
|
|
|
|
|
|
|