ACM Home Page
Please provide us with feedback. Feedback
Local approximation of pagerank and reverse pagerank
Full text PdfPdf (314 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
SESSION: KM: link and graph mining table of contents
Pages 279-288  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Ziv Bar-Yossef  Technion and Google Haifa Engineering Center, Haifa, Israel
Li-Tal Mashiach  Technion, Haifa, Israel
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 26,   Downloads (12 Months): 205,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458122
What is a DOI?

ABSTRACT

We consider the problem of approximating the PageRank of a target node using only local information provided by a link server. This problem was originally studied by Chen, Gan, and Suel (CIKM 2004), who presented an algorithm for tackling it. We prove that local approximation of PageRank, even to within modest approximation factors, is infeasible in the worst-case, as it requires probing the link server for Ω(n) nodes, where n is the size of the graph. The difficulty emanates from nodes of high in-degree and/or from slow convergence of the PageRank random walk.

We show that when the graph has bounded in-degree and admits fast PageRank convergence, then local PageRank approximation can be done using a small number of queries. Unfortunately, natural graphs, such as the web graph, are abundant with high in-degree nodes, making this algorithm (or any other local approximation algorithm) too costly. On the other hand, reverse natural graphs tend to have low in-degree while maintaining fast PageRank convergence. It follows that calculating Reverse PageRank locally is frequently more feasible than computing PageRank locally.

We demonstrate that Reverse PageRank is useful for several applications, including computation of hub scores for web pages, finding influencers in social networks, obtaining good seeds for crawling, and measurement of semantic relatedness between concepts in a taxonomy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
A. L. Barabasi and R. Albert. Emergence of scaling in random networks. Science, 286(5439):509--512, 1999.
 
3
P. Berkhin. A survey on PageRank computing. Internet Mathematics, 2(1):73--120, 2005.
 
4
 
5
 
6
 
7
 
8
9
 
10
11
12
 
13
14
 
15
D. Fogaras. Where to start browsing the Web? In IICS, pages 65--79, 2003.
 
16
E. Gabrilovich and S. Markovitch. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In Proc. 20th IJCAI, pages 250--257, 2007.
 
17
 
18
 
19
T. H. Haveliwala and S. D. Kamvar. The second eigenvalue of the Google matrix. Technical report, Stanford University, 2003.
 
20
A. Java, P. Kolari, T. Finin, and T. Oates. Modeling the spread of influence on the Blogosphere. Technical report, University of Maryland, Baltimore County, 2006.
21
 
22
S. Kamvar, H. Haveliwala, and G. Golub. Adaptive methods for the computation of PageRank. Linear Algebra and its Applications, 386:51--65, 2004.
 
23
C. Kohlschütter, P. A. Chirita, and W. Nejdl. Efficient parallel computation of PageRank. In Proc. 28th ECIR, pages 241--252, 2006.
 
24
G. Kollias and E. Gallopoulos. Asynchronous computation of PageRank computation in an interactive multithreading environment. In Web Information Retrieval and Linear Algebra Algorithms, 2007.
 
25
 
26
 
27
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford Digital Library Technologies Project, 1998.
 
28
 
29
R. Rada, H. Mili, E. Bicknell, and M. Blettner. Development and application of a metric on semantic nets. IEEE Trans. on Systems, Man and Cybernetics, 19(1):17--30, 1989.
30
 
31
 
32
 
33
M. Strube and S. P. Ponzetto. WikiRelate! computing semantic relatedness using Wikipedia. In AAAI 2006, pages 1419--1424, 2006.
 
34
 
35
Y. Wu. Subgraphrank: PageRank approximation for a subgraph or in a decentralized system. VLDB PhD workshop, 2007.

Collaborative Colleagues:
Ziv Bar-Yossef: colleagues
Li-Tal Mashiach: colleagues