| GMX: an XML data partitioning scheme for holistic twig joins |
| Full text |
Pdf
(446 KB)
|
| Source
|
International Conference on Information Integration and web-based Applications and Services
archive
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
table of contents
Linz, Austria
SESSION: iiWAS 2008: XML data modelling and processing
table of contents
Pages 137-146
Year of Publication: 2008
ISBN:978-1-60558-349-5
|
|
Authors
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 45, Citation Count: 1
|
|
|
ABSTRACT
As traditional partitioning strategies do not serve well for semistructured data, partitioning and distributing heterogeneous XML documents onto a parallel cluster system have lead to such an intricacy issue for maintaining good query processing performance. In this paper, we propose a grid metadata model for XML that gives a conceptual view to partition XML data, specifically for holistic twig joins processing. The proposed model adopts a cost-based model and facilitates a set of partition refinement methods for workload balancing purpose. The model has features of reducing the workload variance significantly on the cluster system, duplicating XML data necessarily to avoid data dependency among cluster nodes, and exploiting inter query parallelism and intra query parallelism. We evaluate the effectiveness of our proposed model in the experiment that our data partitioning method has better workload balance and has an impact on better parallel speed up performance as well.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Niagara Query Engine. http://www.cs.wisc.edu/niagara.
|
| |
2
|
Stanford University Infolab. http://infolab.stanford.edu/pub/movies/dtd.html.
|
| |
3
|
|
| |
4
|
|
| |
5
|
J.-M. Bremer and M. Gertz. On Distributing XML Repositories. In International Workshop on the Web and Databases (WebDB), pages 73--78, 2003.
|
 |
6
|
|
| |
7
|
|
| |
8
|
R. Hockney and M. Berry. Public International Benchmarks for Parallel Computers. Technical report, PARKBENCH Committee on Parallel Benchmarks, 1994.
|
| |
9
|
|
 |
10
|
Ying Guang Li , Stéphane Bressan , Gillian Dobbie , Zoé Lacroix , Mong Li Lee , Ullas Nambiar , Bimlesh Wadhwa, XOO7: applying OO7 benchmark to XML query processing tool, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA
[doi> 10.1145/502585.502614]
|
| |
11
|
|
 |
12
|
|
| |
13
|
Albrecht Schmidt , Florian Waas , Martin Kersten , Michael J. Carey , Ioana Manolescu , Ralph Busse, XMark: a benchmark for XML data management, Proceedings of the 28th international conference on Very Large Data Bases, p.974-985, August 20-23, 2002, Hong Kong, China
|
| |
14
|
|
| |
15
|
|
 |
16
|
Chun Zhang , Jeffrey Naughton , David DeWitt , Qiong Luo , Guy Lohman, On supporting containment queries in relational database management systems, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.425-436, May 21-24, 2001, Santa Barbara, California, United States
|
|