ACM Home Page
Please provide us with feedback. Feedback
Using ODP metadata to personalize search
Full text PdfPdf (310 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Salvador, Brazil
SESSION: Web search 1 table of contents
Pages: 178 - 185  
Year of Publication: 2005
ISBN:1-59593-034-5
Authors
Paul Alexandru Chirita  L3S and University of Hannover, Hannover, Germany
Wolfgang Nejdl  L3S and University of Hannover, Hannover, Germany
Raluca Paiu  L3S and University of Hannover, Hannover, Germany
Christian Kohlschütter  L3S and University of Hannover, Hannover, Germany
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 131,   Citation Count: 27
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1076034.1076067
What is a DOI?

ABSTRACT

The Open Directory Project is clearly one of the largest collaborative efforts to manually annotate web pages. This effort involves over 65,000 editors and resulted in metadata specifying topic and importance for more than 4 million web pages. Still, given that this number is just about 0.05 percent of the Web pages indexed by Google, is this effort enough to make a difference? In this paper we discuss how these metadata can be exploited to achieve high quality personalized web search. First, we address this by introducing an additional criterion for web page ranking, namely the distance between a user profile defined using ODP topics and the sets of ODP topics covered by each URL returned in regular web search. We empirically show that this enhancement yields better results than current web search using Google. Then, in the second part of the paper, we investigate the boundaries of biasing PageRank on subtopics of the ODP in order to automatically extend these metadata to the whole web.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Bortz. Statistics for Social Scientists. Springer Verlag, 1993.
 
2
S. Brin, R. Motwani, L. Page, and T. Winograd. What can you do with a web in your pocket? Data Engineering Bulletin, 21(2):37--47, 1998.
 
3
 
4
P.-A. Chirita, D. Olmedilla, and W. Nejdl. Pros: A personalized ranking platform for web search. In Proceedings of the International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, Aug 2004.
5
6
 
7
M. Ester, H.-P. Kriegel, and M. Schubert. Accurate and efficient crawling for relevant websites. In Proceedings of the 30th International VLDB Conference, 2004.
 
8
Google search api. http://api.google.com.
 
9
Google search engine. http://www.google.com.
 
10
Z. Gý'ongyi, H. Garcia-Molina, and J. Pendersen. Combating web spam with trustrank. In Proceedings of the 30th International VLDB Conference, 2004.
11
12
13
 
14
O. Kolesnikov, W. Lee, and R. Lipton. Filtering spam using search engines, 2003.
 
15
 
16
Y. Li, Z. A. Bandar, and D. McLean. An approach for measuring semantic similarity between words using multiple information sources. IEEE Transactions on Knowledge and Data Engineering, 15(4):871--882, 2003.
17
18
19
 
20
Open directory project. http://dmoz.org/.
 
21
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University, 1998.
 
22
Stanford webbase project. http://webbase.stanford.edu.
 
23
 
24
M. Williamson. Using dmoz open directory project lists with novell bordermanager, 2003.
 
25
J. B. Winer. Statistical principles in experimental design. McGraw Hill, 1962.

CITED BY  27

Collaborative Colleagues:
Paul Alexandru Chirita: colleagues
Wolfgang Nejdl: colleagues
Raluca Paiu: colleagues
Christian Kohlschütter: colleagues