ACM Home Page
Please provide us with feedback. Feedback
Cluster-based retrieval using language models
Full text PdfPdf (248 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Sheffield, United Kingdom
SESSION: Language models table of contents
Pages: 186 - 193  
Year of Publication: 2004
ISBN:1-58113-881-4
Authors
Xiaoyong Liu  University of Massachusetts, Amherst, MA
W. Bruce Croft  University of Massachusetts, Amherst, MA
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 186,   Citation Count: 56
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1008992.1009026
What is a DOI?

ABSTRACT

Previous research on cluster-based retrieval has been inconclusive as to whether it does bring improved retrieval effectiveness over document-based retrieval. Recent developments in the language modeling approach to IR have motivated us to re-examine this problem within this new retrieval framework. We propose two new models for cluster-based retrieval and evaluate them on several TREC collections. We show that cluster-based retrieval can perform consistently across collections of realistic size, and significant improvements over document-based retrieval can be obtained in a fully automatic manner and without relevance information provided by human.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Allan, J., Carbonell, J., Doddington, G., Yamron, J., and Yang, Y. (1998). Topic detection and tracking pilot study: Final report. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, pp. 194--218.
 
2
Croft, W. B. (1980). A model of cluster searching based on classification. Information Systems, Vol. 5, pp. 189--195.
 
3
 
4
 
5
Evans, D.A.; Huettner, A.; Tong, X.; Jansen, P.; & Bennett, J. (1999). Effectiveness of clustering in ad-hoc retrieval. In TREC-7 proceedings, pp. 90--95.
 
6
Griffiths, A., Luckhurst, H.C., and Willett, P. (1986). Using interdocument similarity information in document retrieval systems. Journal of the American Society for Information Science, 37, pp. 3--11.
7
 
8
Jardine, N. and van Rijsbergen, C.J. (1971). The use of hierarchical clustering in information retrieval. Information Storage and Retrieval, 7:217--240.
9
10
11
12
 
13
Ponte, J. (2001). Is information retrieval anything more than smoothing? In Proceedings of the Workshop on Language Modeling and Information Retrieval, Carnegie Mellon University, Pittsburgh.
14
 
15
Rosenfeld, R. (2000). Two decades of statistical language modeling: where do we go from here? In Proceedings of the IEEE, 88(8), 2000.
 
16
 
17
 
18
Spitters, M., and Kraaij, W. (2001). TNO at TDT2001: Language model-based topic detection. In Topic Detection and Tracking Workshop Report.
 
19
 
20
 
21
van Rijsbergen, C.J. & Croft, W. B. (1975). Document clustering: An evaluation of some experiments with the Cranfield 1400 collection. Information Processing & Management, 11, pp. 171--182.
22
 
23
 
24
 
25
Willet, P. (1985). Query specific automatic document classification. International Forum on Information and Documentation, 10(2), pp. 28--32.
26
 
27
Yamron, J.P., Carp, I., Gillick, L., Lowe, S.A., and van Mulbregt, P. (1999). Topic tracking in a news stream. In Proceedings of the DARPA Broadcast News Workshop.
28
29

CITED BY  56

Collaborative Colleagues:
Xiaoyong Liu: colleagues
W. Bruce Croft: colleagues