|
ABSTRACT
We study the complexity and expressive power of conjunctive queries over unranked labeled trees, where the tree structure are represented using "axis relations" such as "child", "descendant", and "following" (we consider a superset of the XPath axes) as well as unary relations for node labels. (Cyclic) conjunctive queries over trees occur in a wide range of data management scenarios related to XML, the Web, and computational linguistics. We establish a framework for characterizing structures representing trees for which conjunctive queries can be evaluated efficiently. Then we completely chart the tractability frontier of the problem for our axis relations, i.e., we find all subset maximal sets of axes for which query evaluation is in polynomial time. All polynomial-time results are obtained immediately using the proof techniques from our framework. Finally, we study the expressiveness of conjunctive queries over trees and compare it to the expressive power of fragments of XPath. We show that for each conjunctive query, there is an equivalent acyclic positive query (i.e., a set of acyclic conjunctive queries), but that in general this query is not of polynomial size.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
|
 |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
A. Deutsch and V. Tannen. "MARS: A System for Publishing XML from Mixed and Redundant Storage". In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03), pages 201--212, 2003.
|
| |
9
|
|
| |
10
|
H.-D. Ebbinghaus and J. Flum. Finite Model Theory. Springer-Verlag, 1999. Second edition.
|
| |
11
|
|
| |
12
|
|
 |
13
|
|
| |
14
|
G. Gottlob, C. Koch, and R. Pichler. "Efficient Algorithms for Processing XPath Queries". In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02), Hong Kong, China, 2002.
|
 |
15
|
|
| |
16
|
J. Hidders. "Satisfiability of XPath Expressions". In Proc. DBPL, 2003.
|
| |
17
|
LDC. "The Penn Treebank Project", 1999. http://www.cis.upenn.edu/~treebank/home.html.
|
| |
18
|
|
| |
19
|
|
 |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
D. Olteanu, H. Meuss, T. Furche, and F. Bry. "Symmetry in XPath". In Proc. EDBT Workshop on XML Data Management, 2002.
|
 |
24
|
|
| |
25
|
|
| |
26
|
M. Schmidt-Schauß and J. Stuber. "On the Complexity of Linear and Stratified Context Matching Problems", 2001. Unpublished manuscript.
|
| |
27
|
|
| |
28
|
World Wide Web Consortium. XML Path Language (XPath) Recommendation. http://www.w3c.org/TR/xpath/, Nov. 1999.
|
| |
29
|
M. Yannakakis. "Algorithms for Acyclic Database Schemes". In Proceedings of the 7th International Conference on Very Large Data Bases (VLDB'81), 1981.
|
CITED BY 18
|
|
|
|
|
|
|
|
Georg Gottlob , Christoph Koch , Robert Baumgartner , Marcus Herzog , Sergio Flesca, The Lixto data extraction project: back and forth between theory and practice, Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, June 14-16, 2004, Paris, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
François Bry , Tim Furche , Benedikt Linse , Andreas Schroeder, Efficient evaluation of n-ary conjunctive queries over trees and graphs, Proceedings of the eighth ACM international workshop on Web information and data management, November 10-10, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|