ACM Home Page
Please provide us with feedback. Feedback
Exploiting proximity feature in bigram language model for information retrieval
Full text PdfPdf (260 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
POSTER SESSION: Posters group 4: theory and IR models table of contents
Pages 821-822  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Seung-Hoon Na  Pohang University of Science and Technology (POSTECH), Pohang, South Korea
Jungi Kim  Pohang University of Science and Technology (POSTECH), Pohang, South Korea
In-Su Kang  Kyungsung University, Pusan, South Korea
Jong-Hyeok Lee  Pohang University of Science and Technology (POSTECH), Pohang, South Korea
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 18,   Downloads (12 Months): 146,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390522
What is a DOI?

ABSTRACT

Language modeling approaches have been effectively dealing with the dependency among query terms based on N-gram such as bigram or trigram models. However, bigram language models suffer from adjacency-sparseness problem which means that dependent terms are not always adjacent in documents, but can be far from each other, sometimes with distance of a few sentences in a document. To resolve the adjacency-sparseness problem, this paper proposes a new type of bigram language model by explicitly incorporating the proximity feature between two adjacent terms in a query. Experimental results on three test collections show that the proposed bigram language model significantly improves previous bigram model as well as Tao's approach, the state-of-art method for proximity-based method.




Collaborative Colleagues:
Seung-Hoon Na: colleagues
Jungi Kim: colleagues
In-Su Kang: colleagues
Jong-Hyeok Lee: colleagues