|
ABSTRACT
Keyword search on graph structured data has attracted a lot of attention in recent years. Graphs are a natural "lowest common denominator" representation which can combine relational, XML and HTML data. Responses to keyword queries are usually modeled as trees that connect nodes matching the keywords. In this paper we address the problem of keyword search on graphs that may be significantly larger than memory. We propose a graph representation technique that combines a condensed version of the graph (the "supernode graph") which is always memory resident, along with whatever parts of the detailed graph are in a cache, to form a multi-granular graph representation. We propose two alternative approaches which extend existing search algorithms to exploit multigranular graphs; both approaches attempt to minimize IO by directing search towards areas of the graph that are likely to give good results. We compare our algorithms with a virtual memory approach on several real data sets. Our experimental results show significant benefits in terms of reduction in IO due to our algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Sanjay Agrawal, Surajit Chaudhari, and Gautam Das. DBXplorer: A system for keyword-based search search over relational databases. ICDE, 2002.
|
| |
2
|
|
| |
3
|
Gaurav Bhalotia, Arvind Hulgeri, Charuta Nakhe, Soumen Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. ICDE, 2002.
|
| |
4
|
Kumar Gaurav Bijay. Towards external memory algorithms for keyword-search in relational databases. Bachelors Thesis, under the guidance of S. Sudarshan, 2006.
|
| |
5
|
Adam L. Buchsbaum , Michael Goldwasser , Suresh Venkatasubramanian , Jeffery R. Westbrook, On external memory graph traversal, Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms, p.859-860, January 09-11, 2000, San Francisco, California, United States
|
| |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
Bolin Ding, Jeffrey Xu Yu, Shan Wang, Lu Qin, Xiao Zhang, and Xuemin Lin. Finding top-k min-cost connected trees in databases. ICDE, 2007.
|
| |
10
|
|
 |
11
|
|
| |
12
|
Nitin Gupta. EMBANKS: Towards disk based algorithms for keyword-search in structured databases. Bachelors Thesis, under the guidance of S. Sudarshan, 2006.
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Varun Kacholia , Shashank Pandit , Soumen Chakrabarti , S. Sudarshan , Rushi Desai , Hrishikesh Karambelkar, Bidirectional expansion for keyword search on graph databases, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
 |
20
|
Fang Liu , Clement Yu , Weiyi Meng , Abdur Chowdhury, Effective keyword search in relational databases, Proceedings of the 2006 ACM SIGMOD international conference on Management of data, June 27-29, 2006, Chicago, IL, USA
[doi> 10.1145/1142473.1142536]
|
 |
21
|
|
| |
22
|
M. H. Nodine, M. T. Goodrich, and J. S. Vitter. Blocking for external graph searching. Algorithmica, 16: 181--214, 1996.
|
| |
23
|
Sriram Raghavan and Hector Garcia-Molina. Representing Web graphs. ICDE, pages 405--416, 2003.
|
| |
24
|
|
| |
25
|
|
CITED BY 2
|
|
Yi Chen , Wei Wang , Ziyang Liu , Xuemin Lin, Keyword search on structured and semi-structured data, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|
|
|
|