| EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data |
| Full text |
Pdf
(394 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
table of contents
Vancouver, Canada
SESSION: Research Session 19: Keywords on Structure
table of contents
Pages 903-914
Year of Publication: 2008
ISBN:978-1-60558-102-6
|
|
Authors
|
|
Guoliang Li
|
Tsinghua University, Beijing, China
|
|
Beng Chin Ooi
|
National University of Singapore, Singapore, Singapore
|
|
Jianhua Feng
|
Tsinghua University, Beijing, China
|
|
Jianyong Wang
|
Tsinghua University, Beijing, China
|
|
Lizhu Zhou
|
Tsinghua University, Beijing, China
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 42, Downloads (12 Months): 338, Citation Count: 9
|
|
|
ABSTRACT
Conventional keyword search engines are restricted to a given data model and cannot easily adapt to unstructured, semi-structured or structured data. In this paper, we propose an efficient and adaptive keyword search method, called EASE, for indexing and querying large collections of heterogenous data. To achieve high efficiency in processing keyword queries, we first model unstructured, semi-structured and structured data as graphs, and then summarize the graphs and construct graph indices instead of using traditional inverted indices. We propose an extended inverted index to facilitate keyword-based search, and present a novel ranking mechanism for enhancing search effectiveness. We have conducted an extensive experimental study using real datasets, and the results show that EASE achieves both high search efficiency and high accuracy, and outperforms the existing approaches significantly.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
S. Chaudhuri, R. Ramakrishnan, and G. Weikum. Integrating db and ir technologies: What is the sound of one hand clapping? In CIDR, pages 1--12, 2005.
|
| |
4
|
Sara Cohen , Jonathan Mamou , Yaron Kanza , Yehoshua Sagiv, XSEarch: a semantic search engine for XML, Proceedings of the 29th international conference on Very large data bases, p.45-56, September 09-12, 2003, Berlin, Germany
|
| |
5
|
P. DeRose, W. Shen, F. Chen, Y. Lee, D. Burdick, A. Doan, and R. Ramakrishnan. Dblife: A community information management platform for the database research community. In CIDR, 2007.
|
| |
6
|
B. Ding, J. X. Yu, S. Wang, L. Qin, X. Zhang, and X. Lin. Finding top-k min-cost connected trees in databases. In ICDE, 2007.
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
| |
10
|
L. Guo, J. Shanmugasundaram, and G. Yona. Topology search over biological databases. In ICDE, 2007.
|
 |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
V. Hristidis, Y. Papakonstantinou, and A. Balmin. Keyword proximity search on XML graphs. In ICDE, pages 367--378, 2003.
|
| |
17
|
Varun Kacholia , Shashank Pandit , Soumen Chakrabarti , S. Sudarshan , Rushi Desai , Hrishikesh Karambelkar, Bidirectional expansion for keyword search on graph databases, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
 |
18
|
Wen-Syan Li , K. Selçuk Candan , Quoc Vu , Divyakant Agrawal, Retrieving and organizing web pages by “information unit”, Proceedings of the 10th international conference on World Wide Web, p.230-244, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372057]
|
 |
19
|
|
 |
20
|
Fang Liu , Clement Yu , Weiyi Meng , Abdur Chowdhury, Effective keyword search in relational databases, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
[doi> 10.1145/1142473.1142536]
|
 |
21
|
|
 |
22
|
|
 |
23
|
|
| |
24
|
M. Mutsuzaki, M. Theobald, A. Keijzer, J. Widom, P. Agrawal, andet al. Trio-one: Layering uncertainty and lineage on a conventional dbms. In CIDR, 2007.
|
 |
25
|
|
| |
26
|
|
 |
27
|
|
 |
28
|
|
 |
29
|
|
CITED BY 9
|
|
|
|
|
|
|
|
Guoliang Li , Shengyue Ji , Chen Li , Jianhua Feng, Efficient type-ahead search on relational data: a TASTIER approach, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|
|
Yueguo Chen , Su Chen , Yu Gu , Mei Hui , Feng Li , Chen Liu , Liangxu Liu , Beng Chin Ooi , Xiaoyan Yang , Dongxiang Zhang , Yuan Zhou, MarcoPolo: a community system for sharing and integrating travel information on maps, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, March 24-26, 2009, Saint Petersburg, Russia
|
|
|
|
|
|
Yi Chen , Wei Wang , Ziyang Liu , Xuemin Lin, Keyword search on structured and semi-structured data, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|
|
|
|