ACM Home Page
Please provide us with feedback. Feedback
Do we mean the same?: disambiguation of extracted keyword queries for database search
Full text PdfPdf (482 KB)
Source
International Conference on Management of Data archive
Proceedings of the First International Workshop on Keyword Search on Structured Data table of contents
Providence, Rhode Island
SESSION: Keyword query disambiguation table of contents
Pages 33-38  
Year of Publication: 2009
ISBN:978-1-60558-570-3
Authors
Elena Demidova  L3S Research Center, Hannover, Germany
Irina Oelze  L3S Research Center, Hannover, Germany
Peter Fankhauser  L3S Research Center, Hannover, Germany
Sponsors
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
SIGMOD: ACM Special Interest Group on Management of Data
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 41,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1557670.1557682
What is a DOI?

ABSTRACT

Users often try to accumulate information on a topic of interest from multiple information sources. In this case a user's informational need might be expressed in terms of an available relevant document, e.g. a web-page or an e-mail attachment, rather than a query. Database search engines are mostly adapted to the queries manually created by the users. In case a user's informational need is expressed in terms of a document, we need algorithms that map keyword queries automatically extracted from this document to the database content.

In this paper we analyze the impact of selected document and database statistics on the effectiveness of keyword disambiguation for manually created as well as automatically extracted keyword queries. Our evaluation is performed using a set of user queries from the AOL query log and a set of queries automatically extracted from Wikipedia articles both executed against the Internet Movie Database (IMDB). Our experimental results show that (1) knowledge of the document context is crucial in order to extract meaningful keyword queries; (2) statistics which enable effective disambiguation of user queries are not sufficient to achieve the same quality for the automatically extracted requests.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Agrawal, S., Chaudhuri, S., and Das, G., DBXplorer: A System for Keyword-Based Search over Relational Databases. ICDE 2002.
 
2
 
3
4
 
5
 
6
Gospodnetic, O. and Hatcher, E., Lucene in Action, Manning 2005.
7
 
8
 
9
10
 
11
12
13
 
14
 
15
Matsuo, Y., and Ishizuka, M. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools, 2004
 
16
 
17
18
 
19
Tran, T., P. Cimiano, Rudolph, S., and Studer, R.: Ontology-Based Interpretation of Keywords for Semantic Search. ISWC 2007.
20
 
21
Zhou, Q., Wang, C., Xiong, M., Wang. H. and Yu, Y. SPARK: Adapting Keyword Query to Semantic Search. ISWC 2007.

Collaborative Colleagues:
Elena Demidova: colleagues
Irina Oelze: colleagues
Peter Fankhauser: colleagues