ACM Home Page
Please provide us with feedback. Feedback
GMX: an XML data partitioning scheme for holistic twig joins
Full text PdfPdf (446 KB)
Source International Conference on Information Integration and web-based Applications and Services archive
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services table of contents
Linz, Austria
SESSION: iiWAS 2008: XML data modelling and processing table of contents
Pages 137-146  
Year of Publication: 2008
ISBN:978-1-60558-349-5
Authors
Imam Machdi  University of Tsukuba, Japan
Toshiyuki Amagasa  University of Tsukuba, Japan
Hiroyuki Kitagawa  University of Tsukuba, Japan
Sponsor
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 45,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1497308.1497338
What is a DOI?

ABSTRACT

As traditional partitioning strategies do not serve well for semistructured data, partitioning and distributing heterogeneous XML documents onto a parallel cluster system have lead to such an intricacy issue for maintaining good query processing performance. In this paper, we propose a grid metadata model for XML that gives a conceptual view to partition XML data, specifically for holistic twig joins processing. The proposed model adopts a cost-based model and facilitates a set of partition refinement methods for workload balancing purpose. The model has features of reducing the workload variance significantly on the cluster system, duplicating XML data necessarily to avoid data dependency among cluster nodes, and exploiting inter query parallelism and intra query parallelism. We evaluate the effectiveness of our proposed model in the experiment that our data partitioning method has better workload balance and has an impact on better parallel speed up performance as well.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Niagara Query Engine. http://www.cs.wisc.edu/niagara.
 
2
Stanford University Infolab. http://infolab.stanford.edu/pub/movies/dtd.html.
 
3
 
4
 
5
J.-M. Bremer and M. Gertz. On Distributing XML Repositories. In International Workshop on the Web and Databases (WebDB), pages 73--78, 2003.
6
 
7
 
8
R. Hockney and M. Berry. Public International Benchmarks for Parallel Computers. Technical report, PARKBENCH Committee on Parallel Benchmarks, 1994.
 
9
10
 
11
12
 
13
 
14
 
15
16


Collaborative Colleagues:
Imam Machdi: colleagues
Toshiyuki Amagasa: colleagues
Hiroyuki Kitagawa: colleagues