| Query ranking in probabilistic XML data |
| Full text |
Pdf
(698 KB)
|
| Source
|
Extending Database Technology; Vol. 360
archive
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
table of contents
Saint Petersburg, Russia
SESSION: Research sessions: XML, XPath, XQuery
table of contents
Pages 156-167
Year of Publication: 2009
ISBN:978-1-60558-422-5
|
|
Authors
|
|
Lijun Chang
|
The Chinese University of Hong Kong, Hong Kong, China
|
|
Jeffrey Xu Yu
|
The Chinese University of Hong Kong, Hong Kong, China
|
|
Lu Qin
|
The Chinese University of Hong Kong, Hong Kong, China
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 18, Downloads (12 Months): 94, Citation Count: 0
|
|
|
ABSTRACT
Twig queries have been extensively studied as a major fragment of XPATH queries to query XML data. In this paper, we study PXML-RANK query, (Q, k), which is to rank top-k probabilities of the answers of a twig query Q in probabilistic XML (PXML) data. A new research issue is how to compute top-k probabilities of answers of a twig query Q in PXML in the presence of containment (ancestor/descendant) relationships. In the presence of the ancestor/descendant relationships, the existing dynamic programming approaches to rank top-k probabilities over a set of tuples cannot be directly applied, because any node/edge in PXML may have impacts on the top-k probabilities of answers. We propose new algorithms to compute PXML-RANK queries efficiently and give conditions under which a PXML-RANK query can be processed efficiently without enumeration of all the possible worlds. We conduct extensive performance studies using both real and large benchmark datasets, and confirm the efficiency of our algorithms.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. Abiteboul and P. Senellart. Querying and updating probabilistic information in XML. In Proc. of EDBT'06, pages 1059--1068, 2006.
|
| |
2
|
Sihem Amer-Yahia , Nick Koudas , Amélie Marian , Divesh Srivastava , David Toman, Structure and content scoring for XML, Proceedings of the 31st international conference on Very large data bases, August 30-September 02, 2005, Trondheim, Norway
|
| |
3
|
|
 |
4
|
|
 |
5
|
|
| |
6
|
Sara Cohen , Jonathan Mamou , Yaron Kanza , Yehoshua Sagiv, XSEarch: a semantic search engine for XML, Proceedings of the 29th international conference on Very large data bases, p.45-56, September 09-12, 2003, Berlin, Germany
|
 |
7
|
|
 |
8
|
|
| |
9
|
M. Hua, J. Pei, W. Zhang, and X. Lin. Efficiently answering probabilistic threshold top-k queries on uncertain data. In Proc. of ICDE'08, pages 1403--1405, 2008.
|
 |
10
|
|
| |
11
|
|
| |
12
|
E. Hung, L. Getoor, and V. S. Subrahmanian. PXML: A probabilistic semistructured data model and algebra. In Proc. of ICDE'03, 2003.
|
| |
13
|
|
 |
14
|
|
| |
15
|
B. Kimelfeld and Y. Sagiv. Twig patterns: From XML trees to graphs. In Proc. of WebDB'06, 2006.
|
| |
16
|
|
| |
17
|
L. Qin, J. X. Yu, and B. Ding. TwigList: Make twig pattern matching fast. In Proc. of DASFAA'07, pages 850--862, 2007.
|
| |
18
|
C. Re, N. N. Dalvi, and D. Suciu. Efficient top-k query evaluation on probabilistic data. In Proc. of ICDE'07, pages 886--895, 2007.
|
| |
19
|
|
 |
20
|
|
| |
21
|
M. A. Soliman, I. F. Ilyas, and K. C.-C. Chang. Top-k query processing in uncertain databases. In Proc. of ICDE'07, pages 896--905, 2007.
|
| |
22
|
|
| |
23
|
K. Yi, F. Li, G. Kollios, and D. Srivastava. Efficient processing of top-k queries in uncertain databases. In Proc. of ICDE'08, 2008.
|
| |
24
|
K. Yi, F. Li, G. Kollios, and D. Srivastava. Efficient processing of top-k queries in uncertain databases with x-relations. In Proc. of TKDE'08, 2008.
|
| |
25
|
X. Zhang and J. Chomicki. On the semantics and evaluation of top-k queries in probabilistic databases. In Proc. of DBRank'08, pages 556--563, 2008.
|
|