|
ABSTRACT
Shortest-path query processing not only serves as a long established routine for numerous applications in the past but also is of increasing popularity to support novel graph applications in very large databases nowadays. For a large graph, there is the new scenario to query intensively against arbitrary nodes, asking to quickly return node distance or even shortest paths. And traditional main memory algorithms and shortest paths materialization become inadequate. We are interested in graph labelings to encode the underlying graphs and assign labels to nodes to support efficient query processing. Surprisingly, the existing work of this category mainly emphasizes on reachability query processing, while no sufficient effort has been given to distance labelings to support querying exact shortest distances between nodes. Distance labelings must be developed on the graph in whole to correctly retain node distance information. It makes many existing methods to be inapplicable. We focus on fast computing distance-aware 2-hop covers, which can encode the all-pairs shortest paths of a graph in O(|V|·|E|1/2) space. Our approach exploits strongly connected components collapsing and graph partitioning to gain speed, while it can overcome the challenges in correctly retaining node distance information and appropriately encoding all-pairs shortest paths with small overhead. Furthermore, our approach avoids pre-computing all-pairs shortest paths, which can be prohibitive over large graphs. We conducted extensive performance studies, and confirm the efficiency of our proposed new approaches.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
Lars Backstrom , Dan Huttenlocher , Jon Kleinberg , Xiangyang Lan, Group formation in large social networks: membership, growth, and evolution, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, August 20-23, 2006, Philadelphia, PA, USA
[doi> 10.1145/1150402.1150412]
|
| |
4
|
R. Bellman. On a routing problem. Quarterly of Applied Mathematics, 16(1):87--90, 1958.
|
 |
5
|
|
| |
6
|
|
| |
7
|
J. Cheng, J. X. Yu, X. Lin, H. Wang, and P. S. Yu. Fast computation of reachability labeling for large graphs. In Proc. of EDBT'06, 2006.
|
 |
8
|
Jiefeng Cheng , Jeffrey Xu Yu , Xuemin Lin , Haixun Wang , Philip S. Yu, Fast computing reachability labelings for large graphs with high compression rate, Proceedings of the 11th international conference on Extending database technology: Advances in database technology, March 25-29, 2008, Nantes, France
[doi> 10.1145/1353343.1353370]
|
| |
9
|
Edith Cohen , Eran Halperin , Haim Kaplan , Uri Zwick, Reachability and distance queries via 2-hop labels, Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, p.937-946, January 06-08, 2002, San Francisco, California
|
| |
10
|
|
| |
11
|
F. Dabek, R. Cox, F. Kaashoek, and R. Morris. Predicting internet network distance with coordinates-based approaches. In Proc. of SIGCOMM '04, 2004.
|
| |
12
|
E. W. Dijkstra. A note on two problems in connection with graphs. Numerische Math., 1:269--271, 1959.
|
 |
13
|
|
| |
14
|
|
| |
15
|
A. V. Goldberg and R. F. Werneck. Computing point-to-point shortest paths from external memory. In Proc. of ALENEX '05, 2005.
|
| |
16
|
A. V. Goldberg and R. F. Werneck. Reach for a*: Efficient point-to-point shortest path algorithms. In Proc. of ALENEX '06, 2006.
|
 |
17
|
|
 |
18
|
|
 |
19
|
|
 |
20
|
|
| |
21
|
|
| |
22
|
D. B. Johnson. Finding all the elementary circuits of a directed graph. SIAM J. Comput., 4(1):77--84, 1975.
|
 |
23
|
|
 |
24
|
|
| |
25
|
|
 |
26
|
|
| |
27
|
I. M. Keseler, J. Collado-Vides, S. Gama-Castro, J. Ingraham, S. Paley, I. T. Paulsen, M. Peralta-Gil, and P. D. Karp. Ecocyc: a comprehensive database resource for escherichia coli. Nucleic Acids Research, 33(D334-7), 2005.
|
 |
28
|
|
| |
29
|
T. S. E. Ng and H. Zhang. Predicting internet network distance with coordiantes-based approaches. In Proc. of INFOCOM '01, 2001.
|
| |
30
|
|
| |
31
|
|
| |
32
|
|
| |
33
|
R. Schenkel, A. Theobald, and G. Weikum. Hopi: An efficient connection index for complex XML document collections. In Proc. of EDBT'04, 2004.
|
| |
34
|
|
| |
35
|
Albrecht Schmidt , Florian Waas , Martin Kersten , Michael J. Carey , Ioana Manolescu , Ralph Busse, XMark: a benchmark for XML data management, Proceedings of the 28th international conference on Very Large Data Bases, p.974-985, August 20-23, 2002, Hong Kong, China
|
| |
36
|
R. E. Tarjan. Enumeration of the elementary circuits of a directed graph. SIAM J. Comput., 2(3):211--216, 1973.
|
 |
37
|
|
 |
38
|
|
| |
39
|
|
| |
40
|
S. Wasserman and K. Faust. Social Network Analysis. Cambridge University Press, 1994.
|
| |
41
|
|
 |
42
|
|
|