| Answering aggregate keyword queries on relational databases using minimal group-bys |
| Full text |
Pdf
(582 KB)
|
| Source
|
Extending Database Technology; Vol. 360
archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
table of contents
Saint Petersburg, Russia
SESSION: Research sessions: Database summarization
table of contents
Pages 108-119
Year of Publication: 2009
ISBN:978-1-60558-422-5
|
|
Authors
|
|
Bin Zhou
|
Simon Fraser University, Canada
|
|
Jian Pei
|
Simon Fraser University, Canada
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 81, Citation Count: 1
|
|
|
ABSTRACT
Keyword search has been recently extended to relational databases to retrieve information from text-rich attributes. However, all the existing methods focus on finding individual tuples matching a set of query keywords from one table or the join of multiple tables. In this paper, we motivate a novel problem of aggregate keyword search: finding minimal group-bys covering a set of query keywords well, which is useful in many applications. We develop two interesting approaches to tackle the problem, and further extend our methods to allow partial matches. An extensive empirical evaluation using both real data sets and synthetic data sets is reported to verify the effectiveness of aggregate keyword search and the efficiency of our methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. Agrawal et al. DBXplorer: A system for keyword-based search over relational databases. In ICDE'02.
|
 |
2
|
|
 |
3
|
|
| |
4
|
G. Bhalotia et al. Keyword searching and browsing in databases using banks. In ICDE'02.
|
| |
5
|
S. Chaudhuri et al. Integrating DB and IR technologies: What is the sound of one hand clapping? In CIDR'05.
|
| |
6
|
|
| |
7
|
B. Ding et al. Finding top-k min-cost connected trees in databases. In ICDE'07.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
Jim Gray , Adam Bosworth , Andrew Layman , Hamid Pirahesh, Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total, Proceedings of the Twelfth International Conference on Data Engineering, p.152-159, February 26-March 01, 1996
|
 |
12
|
Jiawei Han , Jian Pei , Guozhu Dong , Ke Wang, Efficient computation of Iceberg cubes with complex measures, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.1-12, May 21-24, 2001, Santa Barbara, California, United States
|
| |
13
|
Donna Harman , R. Baeza-Yates , Edward Fox , W. Lee, Inverted files, Information retrieval: data structures and algorithms, Prentice-Hall, Inc., Upper Saddle River, NJ, 1992
|
 |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
Varun Kacholia , Shashank Pandit , Soumen Chakrabarti , S. Sudarshan , Rushi Desai , Hrishikesh Karambelkar, Bidirectional expansion for keyword search on graph databases, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
 |
18
|
|
 |
19
|
Fang Liu , Clement Yu , Weiyi Meng , Abdur Chowdhury, Effective keyword search in relational databases, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
[doi> 10.1145/1142473.1142536]
|
 |
20
|
|
 |
21
|
Raymond T. Ng , Alan Wagner , Yu Yin, Iceberg-cube computation with PC clusters, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.25-36, May 21-24, 2001, Santa Barbara, California, United States
|
 |
22
|
|
 |
23
|
|
 |
24
|
|
| |
25
|
Dong Xin , Jiawei Han , Xiaolei Li , Benjamin W. Wah, Star-cubing: computing iceberg cubes by top-down and bottom-up integration, Proceedings of the 29th international conference on Very large data bases, p.476-487, September 09-12, 2003, Berlin, Germany
|
 |
26
|
|
CITED BY
|
|
Yi Chen , Wei Wang , Ziyang Liu , Xuemin Lin, Keyword search on structured and semi-structured data, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|