ACM Home Page
Please provide us with feedback. Feedback
Keyword search on external memory data graphs
Full text PdfPdf (1.20 MB)
Source
Proceedings of the VLDB Endowment archive
Volume 1 ,  Issue 1  (August 2008) table of contents
SESSION: Mining B and external memory table of contents
Pages 1189-1204  
Year of Publication: 2008
ISSN:2150-8097
Authors
Bhavana Bharat Dalvi  I.I.T. Bombay
Meghana Kshirsagar  I.I.T. Bombay
S. Sudarshan  I.I.T. Bombay
Publisher
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 116,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1453856.1453982
What is a DOI?

ABSTRACT

Keyword search on graph structured data has attracted a lot of attention in recent years. Graphs are a natural "lowest common denominator" representation which can combine relational, XML and HTML data. Responses to keyword queries are usually modeled as trees that connect nodes matching the keywords.

In this paper we address the problem of keyword search on graphs that may be significantly larger than memory. We propose a graph representation technique that combines a condensed version of the graph (the "supernode graph") which is always memory resident, along with whatever parts of the detailed graph are in a cache, to form a multi-granular graph representation. We propose two alternative approaches which extend existing search algorithms to exploit multigranular graphs; both approaches attempt to minimize IO by directing search towards areas of the graph that are likely to give good results. We compare our algorithms with a virtual memory approach on several real data sets. Our experimental results show significant benefits in terms of reduction in IO due to our algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Sanjay Agrawal, Surajit Chaudhari, and Gautam Das. DBXplorer: A system for keyword-based search search over relational databases. ICDE, 2002.
 
2
 
3
Gaurav Bhalotia, Arvind Hulgeri, Charuta Nakhe, Soumen Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. ICDE, 2002.
 
4
Kumar Gaurav Bijay. Towards external memory algorithms for keyword-search in relational databases. Bachelors Thesis, under the guidance of S. Sudarshan, 2006.
 
5
 
6
7
8
 
9
Bolin Ding, Jeffrey Xu Yu, Shan Wang, Lu Qin, Xiao Zhang, and Xuemin Lin. Finding top-k min-cost connected trees in databases. ICDE, 2007.
 
10
11
 
12
Nitin Gupta. EMBANKS: Towards disk based algorithms for keyword-search in structured databases. Bachelors Thesis, under the guidance of S. Sudarshan, 2006.
13
 
14
 
15
 
16
17
 
18
 
19
20
21
 
22
M. H. Nodine, M. T. Goodrich, and J. S. Vitter. Blocking for external graph searching. Algorithmica, 16: 181--214, 1996.
 
23
Sriram Raghavan and Hector Garcia-Molina. Representing Web graphs. ICDE, pages 405--416, 2003.
 
24
 
25


Collaborative Colleagues:
Bhavana Bharat Dalvi: colleagues
Meghana Kshirsagar: colleagues
S. Sudarshan: colleagues