ACM Home Page
Please provide us with feedback. Feedback
Better than the real thing?: iterative pseudo-query processing using cluster-based language models
Full text PdfPdf (236 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Salvador, Brazil
SESSION: Theory 1 table of contents
Pages: 19 - 26  
Year of Publication: 2005
ISBN:1-59593-034-5
Authors
Oren Kurland  Cornell University, Ithaca NY and Carnegie Mellon University, Pittsburgh PA
Lillian Lee  Cornell University, Ithaca NY and Carnegie Mellon University, Pittsburgh PA
Carmel Domshlak  Technion, Haifa, Israel
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 72,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1076034.1076041
What is a DOI?

ABSTRACT

We present a novel approach to pseudo-feedback-based ad hoc retrieval that uses language models induced from both documents and clusters. First, we treat the pseudo-feedback documents produced in response to the original query as a set of pseudo-query that themselves can serve as input to the retrieval process. Observing that the documents returned in response to the pseudo-query can then act as pseudo-query for subsequent rounds, we arrive at a formulation of pseudo-query-based retrieval as an iterative process. Experiments show that several concrete instantiations of this idea, when applied in conjunction with techniques designed to heighten precision, yield performance results rivaling those of a number of previously-proposed algorithms, including the standard language-modeling approach. The use of cluster-based language models is a key contributing factor to our algorithms' success.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
James Allan. HARD track overview in TREC 2003: High accuracy retrieval from documents. In Proceedings of the Twelfth Text Retrieval Conference (TREC-12), pages 24--37, 2003.
 
3
James Allan, Margaret E. Connell, W. Bruce Croft, Fang-Fang Feng, David Fisher, and Xiaoyan Li. INQUERY and TREC-9. In Proceedings of the Ninth Text Retrieval Conference (TREC-9), pages 551--562, 2001. NIST Special Publication 500--249.
4
 
5
Margaret Connell, Ao Feng, Giridhar Kumaran, Hema Raghavan, Chirag Shah, and James Allan. UMass at TDT 2004. TDT2004 System Description, 2004.
 
6
 
7
 
8
Steve Cronen-Townsend, Yun Zhou, and W. Bruce Croft. A language modeling framework for selective query expansion. Technical Report IR-338, Center for Intelligent Information Retrieval, University of Massachusetts, 2004.
 
9
10
11
12
13
14
15
 
16
 
17
Victor Lavrenko, James Allan, Edward DeGuzman, Daniel LaFlamme, Veera Pollard, and Steven Thomas. Relevance models for topic detection and tracking. In Proceedings of the Human Language Technology Conference (HLT), pages 104--110, 2002.
18
 
19
Victor Lavrenko and W. Bruce Croft. Relevance models in information retrieval. In Croft and Lafferty {7}, pages 11--56.
20
21
 
22
Kenney Ng. A maximum likelihood ratio information retrieval model. In Proceedings of the Eighth Text Retrieval Conference (TREC-8), pages 483--492, 2000.
 
23
Paul Ogilvie and Jamie Callan. Experiments using the LEMUR toolkit. In Proceedings of the Tenth Text Retrieval Conference (TREC-10), pages 103--108, 2001.
24
 
25
Joseph John Rocchio. Relevance feedback in information retrieval. In Gerard Salton, editor, The SMART Retrieval System: Experiments in Automatic Document Processing, pages 313--323. Prentice Hall, 1971.
 
26
27
 
28
Tao Tao and ChengXiang Zhai. A mixture clustering model for pseudo feedback in information retrieval. In Proceedings of the International Federation of Classification Societies (IFCS), 2004. Invited paper.
29
30
31
32

CITED BY  16

Collaborative Colleagues:
Oren Kurland: colleagues
Lillian Lee: colleagues
Carmel Domshlak: colleagues