ACM Home Page
Please provide us with feedback. Feedback
Recognition and classification of noun phrases in queries for effective retrieval
Full text PdfPdf (499 KB)
Source
Conference on Information and Knowledge Management archive
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management table of contents
Lisbon, Portugal
SESSION: Natural language II (IR) table of contents
Pages 711-720  
Year of Publication: 2007
ISBN:978-1-59593-803-9
Authors
Wei Zhang  University of Illinois at Chicago, Chicago, IL
Shuang Liu  Ask.com, Edison, NJ
Clement Yu  University of Illinois at Chicago, Chicago, IL
Chaojing Sun  Broadcom Corporation, San Diego, CA
Fang Liu  Microsoft, Redmond, WA
Weiyi Meng  Binghamton University, Binghamton, NY
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 17,   Downloads (12 Months): 107,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1321440.1321540
What is a DOI?

ABSTRACT

It has been shown that using phrases properly in the document retrieval leads to higher retrieval effectiveness. In this paper, we define four types of noun phrases and present an algorithm for recognizing these phrases in queries. The strengths of several existing tools are combined for phrase recognition. Our algorithm is tested using a set of 500 web queries from a query log, and a set of 238 TREC queries. Experimental results show that our algorithm yields high phrase recognition accuracy. We also use a baseline noun phrase recognition algorithm to recognize phrases from the TREC queries. A document retrieval experiment is conducted using the TREC queries (1) without any phrases, (2) with the phrases recognized from a baseline noun phrase recognition algorithm, and (3) with the phrases recognized from our algorithm respectively. The retrieval effectiveness of (3) is better than that of (2), which is better than that of (1). This demonstrates that utilizing phrases in queries does improve the retrieval effectiveness, and better noun phrase recognition yields higher retrieval performance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
4
5
6
 
7
Nancy Chinchor. Overview of MUC-7. In Proc. of MUC. 1998.
 
8
Ken Chow, Robert Luk, Kam-Fai Wong and Kui-Lam. Kwok: Hybrid Term Indexing for Weighted Boolean and Vector Space Models. Int. J. Comput. Proc. Oriental Lang. 14(2): 133--151, 2001.
 
9
 
10
 
11
Glossary of linguistic terms, by E. Loos, S. Anderson, D. Day, P. Jordan, and D. Wingate (editors). SIL International. 2003
 
12
C. Fellbaum. WordNet, An electronic Lexical Database. The MIT Press, 1998.
 
13
 
14
Google: http://www.Google.com/apis/
 
15
16
17
 
18
 
19
D. Lin. Using collocation statistics in information extraction. In Proc. of Message Understanding Conference. 1998.
20
 
21
 
22
 
23
 
24
 
25
G. Miller. WordNet: An On-line Lexical Database, Special Issue, International Journal of Lexicography. 1990.
 
26
S. Robertson and S. Walker. Okapi/Keenbow at TREC-8. In Proc. of TREC. 1999.
 
27
 
28
University of Glasgow, LILT project, www.arts.gla.ac.uk/SESLL/EngLang/LILT/frameset.htm
 
29
 
30
E. Voorhees. Overview of the TREC 2004 Robust Retrieval Track. In Proc. of the 13th TREC. 2004.
 
31
E. Voorhees. Overview of the TREC 2005 Robust Retrieval Track. In Proc. of the 14th TREC. 2005.
 
32
Wikipedia: http://en.wikipedia.org
33


Collaborative Colleagues:
Wei Zhang: colleagues
Shuang Liu: colleagues
Clement Yu: colleagues
Chaojing Sun: colleagues
Fang Liu: colleagues
Weiyi Meng: colleagues