|
ABSTRACT
With the amount of available text data in relational databases growing rapidly, the need for ordinary users to search such information is dramatically increasing. Even though the major RDBMSs have provided full-text search capabilities, they still require users to have knowledge of the database schemas and use a structured query language to search information. This search model is complicated for most ordinary users. Inspired by the big success of information retrieval (IR) style keyword search on the web, keyword search in relational databases has recently emerged as a new research topic. The differences between text databases and relational databases result in three new challenges: (1) Answers needed by users are not limited to individual tuples, but results assembled from joining tuples from multiple tables are used to form answers in the form of tuple trees. (2) A single score for each answer (i.e. a tuple tree) is needed to estimate its relevance to a given query. These scores are used to rank the most relevant answers as high as possible. (3) Relational databases have much richer structures than text databases. Existing IR strategies to rank relational outputs are not adequate. In this paper, we propose a novel IR ranking strategy for effective keyword search. We are the first that conducts comprehensive experiments on search effectiveness using a real world database and a set of keyword queries collected by a major search company. Experimental results show that our strategy is significantly better than existing strategies. Our approach can be used both at the application level and be incorporated into a RDBMS to support keyword-based search in relational databases.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S Agrawal, S Chaudhuri, G Das: DBXplorer: A system for keyword-based search over relational databases. ICDE 2002
|
| |
2
|
|
| |
3
|
A. Balmin, V. Hristidis,Y. Papakonstantinou: Authority Based Keyword Queries in Databases using ObjectRank. VLDB 2004
|
| |
4
|
|
| |
5
|
S. Cohen, Jonathan Mamou, Yaron Kanza, Yehoshua Sagiv: XSEarch: A Semantic Search Engine for XML. VLDB 2003
|
| |
6
|
|
| |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. VLDB 2002.
|
| |
11
|
V. Hristidis,L. Gravano,Y. Papakonstantinou:Efficient IR-Style Keyword Search over Relational Databases.VLDB 2003
|
 |
12
|
|
| |
13
|
Y. Li, Cong Yu, H. V. Jagadish: Schema-Free XQuery. VLDB 2004
|
 |
14
|
|
| |
15
|
L. Page, S. Brin, R. Motwani and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web, Technical Report, 1998
|
| |
16
|
R. Sacks-Davis, Tuong Dao, James A. Thom, Justin Zobel Indexing documents for queries on structure, content and attributes. ISDM 1997
|
 |
17
|
|
| |
18
|
A. Singhal. Modern information retrieval: A brief overview. IEEE Data Eng. Bull. 24(4), 2001
|
| |
19
|
E. M. Voorhees. Overview of the TREC-9 Question Answering Track. TREC 2000
|
| |
20
|
Pew Internet & American Life Project Report: Search Engine Users, 2005. www.pewinternet.org/pdfs/PIP_Searchengine_users.pd
|
| |
21
|
Google. www.google.com/ 2005
|
| |
22
|
DB2 Text Information Extender. 2005 http://www.ibm.com/software/data/db2/extenders/textinformation/index.html
|
| |
23
|
Micorsoft SQL Server 2000. www.microsoft.com/sql/ 2005
|
| |
24
|
MySQL. dev.mysql.com/doc/mysql/en/Fulltext_Search.html.
|
 |
25
|
|
| |
26
|
|
| |
27
|
|
CITED BY 23
|
|
|
|
|
|
|
|
Guoliang Li , Beng Chin Ooi , Jianhua Feng , Jianyong Wang , Lizhu Zhou, EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yi Chen , Wei Wang , Ziyang Liu , Xuemin Lin, Keyword search on structured and semi-structured data, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|
|
Guoliang Li , Shengyue Ji , Chen Li , Jianhua Feng, Efficient type-ahead search on relational data: a TASTIER approach, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|
|
Eric Chu , Akanksha Baid , Xiaoyong Chai , AnHai Doan , Jeffrey Naughton, Combining keyword search and forms for ad hoc querying of databases, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|
|
|
|