ACM Home Page
Please provide us with feedback. Feedback
Relevance based language models
Full text PdfPdf (166 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
New Orleans, Louisiana, United States
Pages: 120 - 127  
Year of Publication: 2001
ISBN:1-58113-331-6
Authors
Victor Lavrenko  Univ. of Massachusetts, Amherst
W. Bruce Croft  Univ. of Massachusetts, Amherst
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 35,   Downloads (12 Months): 282,   Citation Count: 155
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/383952.383972
What is a DOI?

ABSTRACT

We explore the relation between classical probabilistic models of information retrieval and the emerging language modeling approaches. It has long been recognized that the primary obstacle to effective performance of classical models is the need to estimate arelevance model: probabilities of words in the relevant class. We propose a novel technique for estimating these probabilities using the query alone. We demonstrate that our technique can produce highly accurate relevance models, addressing important notions of synonymy and polysemy. Our experiments show relevance models outperforming baseline language modeling systems on TREC retrieval and TDT tracking tasks. The main contribution of this work is an effective formal method for estimating a relevance model with no training data.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. Allan, J. Callan, F. Feng, and D. Malin. INQUERY and TREC-8. In D. Harman, editor, Proceedings of the Eighth Text REtrieval Conference (TREC-8), 1999.
2
 
3
4
5
6
 
7
 
8
 
9
C. Cieri, D.Graff, M.Liberman, N.Martey, and S.Strassel. The TDT-2 text and speech corpus. In Proceedings of the DARPA Broadcast News Workshop, pp 57-60, 1999.
 
10
D. Hiemstra. Using language models for information retrieval. In PhD Thesis, University of Twente, 2001.
 
11
D. Hiemstra and A. de Vries. Relating the new language models of information retrieval to the traditional retrieval models. In CTIT Technical Report TR-CTIT-00-09, 2000.
 
12
H. Jin, R. Schwartz, S. Sista, and F. Walls. Topic tracking for radio, TV broadcast and newswire. In Proceedings of DARPA Broadcast News Workshop, pp 199-204, 1999.
 
13
A. Martin, G. Doddington, T. Kamm, and M. Ordowski. The DET curve in assessment of detection task performance. In EuroSpeech, pages 1895-1898, 1997.
14
 
15
16
 
17
S. Robertson and K. S. Jones. Relevance weighting of search terms. In Journal of the American Society for Information Science, vol.27, 1977.
 
18
 
19
 
20
S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford. OKAPI at TREC-3. In D. Harman, editor, Proceedings of the 3rd Text REtrieval Conference (TREC-3), 1996.
21
 
22
H. Turtle and W. B. Croft. Efficient probabilistic inference for text retrieval. In Proceedings of RIAO 3, pages 644-651, 1991.
 
23
C. J. van Rijsbergen. A theoretical basis for the use of co-occurrence data in information retrieval. Journal of Documentation, 33:106-119, 1977.
24
 
25
J. Yamron, I. Carp, L. Gillick, S.Lowe, and P. van Mulbregt. Topic tracking in a news stream. In Proceedings of DARPA Broadcast News Workshop, pp 133-136, 1999.
 
26
J. Yamron, S. Knecht, and P. van Mulbregt. Dragon's tracking and detection systems for the TDT2000 evaluation. In Proceedings of Topic Detection and Tracking Workshop, pp 75-80, 2000.

CITED BY  155

Collaborative Colleagues:
Victor Lavrenko: colleagues
W. Bruce Croft: colleagues