ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Using parsimonious language models on web data
Full text PdfPdf (135 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
POSTER SESSION: Posters group 2: blog, tagging, opinion analysis and web IR table of contents
Pages: 763-764  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Rianne Kaptein  University of Amsterdam, Amsterdam, Netherlands
Rongmei LI  University of Twente, Enschede, Netherlands
Djoerd Hiemstra  University of Twente, Enschede, Netherlands
Jaap Kamps  University of Amsterdam, Amsterdam, Netherlands
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 73,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390491
What is a DOI?

ABSTRACT

In this paper we explore the use of parsimonious language models for web retrieval. These models are smaller thus more efficient than the standard language models and are therefore well suited for large-scale web retrieval. We have conducted experiments on four TREC topic sets, and found that the parsimonious language model results in improvement of retrieval effectiveness over the standard language model for all data-sets and measures. In all cases the improvement is significant, and more substantial than in earlier experiments on newspaper/newswire data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
J. Kamps. Effective smoothing for a terabyte of text. In The Fourteenth Text REtrieval Conference (TREC 2005). National Institute of Standards and Technology. NIST Special Publication, 2006.
 
3
K. Sparck-Jones, S. Robertson, D. Hiemstra, and H. Zaragoza. Language modelling and relevance. In W. Croft and J. Lafferty, editors, Language Modeling for Information Retrieval, pages 57--71. Kluwer Academic Publishers, 2003.
 
4
TREC. Text REtrieval Conference, 2008. http://trec.nist.gov/.

Collaborative Colleagues:
Rianne Kaptein: colleagues
Rongmei LI: colleagues
Djoerd Hiemstra: colleagues
Jaap Kamps: colleagues