ACM Home Page
Please provide us with feedback. Feedback
A sampling approach for XML query selectivity estimation
Full text PdfPdf (2.38 MB)
Source Extending Database Technology; Vol. 360 archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology table of contents
Saint Petersburg, Russia
SESSION: Research sessions: XML, XPath, XQuery table of contents
Pages 335-344  
Year of Publication: 2009
ISBN:978-1-60558-422-5
Authors
Cheng Luo  Coppin State University, Baltimore, MD
Zhewei Jiang  Southern Illinois University, Carbondale, IL
Wen-Chi Hou  Southern Illinois University, Carbondale, IL
Feng Yu  Southern Illinois University, Carbondale, IL
Qiang Zhu  University of Michigan-Dearborn, Dearborn, MI
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): n/a,   Downloads (12 Months): n/a,   Citation Count: 0
Additional Information:

abstract   references   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1516360.1516400
What is a DOI?

ABSTRACT

As the Extensible Markup Language (XML) rapidly establishes itself as the de facto standard for presenting, storing, and exchanging data on the Internet, large volume of XML data and their supporting facilities start to surface. A fast and accurate selectivity estimation mechanism is of practical importance because selectivity estimation plays a fundamental role in XML query optimization. Recently proposed techniques are all based on some forms of structure synopses that could be time-consuming to build and not effective for summarizing complex structure relationships. In this research, we propose an innovative sampling method that can capture the tree structures and intricate relationships among nodes in a simple and effective way. The derived sample tree is stored as a synopsis for selectivity estimation. Extensive experimental results show that, in comparison with the state-of-the-art structure synopses, specifically the TreeSketch and Xseed synopses, our sample tree synopsis applies to a broader range of query types, requires several orders of magnitude less construction time, and generates estimates with considerably better precision for complex datasets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Dblp data set. http://www.informatik.unitrier.de/ley/db/index.html.
2
3
4
 
5
 
6
W. G. Cochran. Sampling Techniques. Wiley, 1977.
7
8
 
9
10
11
12
13
14
 
15
 
16
 
17
 
18
19
20
 
21
 
22
 
23
24
 
25
Collaborative Colleagues:
Cheng Luo: colleagues
Zhewei Jiang: colleagues
Wen-Chi Hou: colleagues
Feng Yu: colleagues
Qiang Zhu: colleagues