| Efficiently answering reachability queries on very large directed graphs |
| Full text |
Pdf
(516 KB)
|
Source
|
International Conference on Management of Data
archive
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
table of contents
Vancouver, Canada
SESSION: Research Session 13: Graphs II
table of contents
Pages 595-608
Year of Publication: 2008
ISBN:978-1-60558-102-6
|
|
Authors
|
|
Ruoming Jin
|
Kent State University, Kent, OH, USA
|
|
Yang Xiang
|
Kent State University, Kent, OH, USA
|
|
Ning Ruan
|
Kent State University, Kent, OH, USA
|
|
Haixun Wang
|
IBM T.J. Watson Research, Hawthorne, NY, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 44, Downloads (12 Months): 344, Citation Count: 2
|
|
|
ABSTRACT
Efficiently processing queries against very large graphs is an important research topic largely driven by emerging real world applications, as diverse as XML databases, GIS, web mining, social network analysis, ontologies, and bioinformatics. In particular, graph reachability has attracted a lot of research attention as reachability queries are not only common on graph databases, but they also serve as fundamental operations for many other graph queries. The main idea behind answering reachability queries in graphs is to build indices based on reachability labels. Essentially, each vertex in the graph is assigned with certain labels such that the reachability between any two vertices can be determined by their labels. Several approaches have been proposed for building these reachability labels; among them are interval labeling (tree cover) and 2-hop labeling. However, due to the large number of vertices in many real world graphs (some graphs can easily contain millions of vertices), the computational cost and (index) size of the labels using existing methods would prove too expensive to be practical. In this paper, we introduce a novel graph structure, referred to as path-tree, to help labeling very large graphs. The path-tree cover is a spanning subgraph of G in a tree shape. We demonstrate both analytically and empirically the effectiveness of our new approaches.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
Jiefeng Cheng, Jeffrey Xu Yu, Xuemin Lin, Haixun Wang, and Philip S. Yu. Fast computation of reachability labeling for large graphs. In EDBT, pages 961--979, 2006.
|
| |
4
|
Y. J. Chu and T. H. Liu. On the shortest arborescence of a directed graph. Science Sinica, 14:1396--1400, 1965.
|
| |
5
|
Edith Cohen , Eran Halperin , Haim Kaplan , Uri Zwick, Reachability and distance queries via 2-hop labels, Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, p.937-946, January 06-08, 2002, San Francisco, California
|
| |
6
|
|
| |
7
|
|
| |
8
|
J. Edmonds. Optimum branchings. J. Research of the National Bureau of Standards, 71B:233--240, 1967.
|
| |
9
|
|
| |
10
|
A. V. Goldberg, E. Tardos, and R. E. Tarjan. Network Flow Algorithms, pages 101--164. Springer Verlag, 1990.
|
 |
11
|
|
| |
12
|
T. Kameda. On the vector representation of the reachability in planar directed graphs. Information Processing Letters, 3(3), January 1975.
|
| |
13
|
R. Schenkel, A. Theobald, and G. Weikum. HOPI: An efficient connection index for complex XML document collections. In EDBT, 2004.
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
CITED BY 2
|
|
|
|
|
Ruoming Jin , Yang Xiang , Ning Ruan , David Fuhry, 3-HOP: a high-compression indexing scheme for reachability query, Proceedings of the 35th SIGMOD international conference on Management of data, June 29-July 02, 2009, Providence, Rhode Island, USA
|
|