ACM Home Page
Please provide us with feedback. Feedback
Automatic document prior feature selection for web retrieval
Full text PdfPdf (239 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
POSTER SESSION: Posters group 2: blog, tagging, opinion analysis and web IR table of contents
Pages 761-762  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Jie Peng  University of Glasgow, Glasgow, United Kngdm
Craig Macdonald  University of Glasgow, Glasgow, United Kngdm
Iadh Ounis  University of Glasgow, Glasgow, United Kngdm
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 19,   Downloads (12 Months): 162,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390490
What is a DOI?

ABSTRACT

Document prior features, such as Pagerank and URL depth, can improve the retrieval effectiveness of Web Information Retrieval (IR) systems. However, not all queries equally benefit from the application of a document prior feature. This paper aims to investigate whether the retrieval performance can be further enhanced by selecting the best document prior feature on a per-query basis. We present a novel method for selecting the best document prior feature on a per-query basis. We evaluate our technique on the TREC .GOV Web test collection and its associated TREC 2003 Web search tasks. Our experiments demonstrate the effectiveness and robustness of our proposed selection method.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Kamps, G. Mishne, M. de Rijke. Language Models for Searching in Web Corpora. In Proceedings of TREC 2004.
2
 
3
S. Kullback. Information Theory and Statistics. Jown Wiley & Sons, New York, USA, 1959.
 
4
J. Lin. Divergence Measures Based on the Shannon Entropy. IEEE Transactions on Information Theory, 37:145--151, 1991.
 
5
C. Macdonald, V. Plachouras, B. He, C. Lioma, I. Ounis. University of Glasgow at WebCLEF 2005: Experiments in per-field normalisation and language specific stemming. In Proceedings of CLEF 2005.
6
 
7
V. Plachouras. Selective Web Information Retrieval. Phd Thesis. Univ of Glasgow, 2006.

Collaborative Colleagues:
Jie Peng: colleagues
Craig Macdonald: colleagues
Iadh Ounis: colleagues