|
ABSTRACT
This paper takes a first step towards the design and normalization theory for XML documents. We show that, like relational databases, XML documents may contain redundant information, and may be prone to update anomalies. Furthermore, such problems are caused by certain functional dependencies among paths in the document. Our goal is to find a way of converting an arbitrary DTD into a well-designed one, that avoids these problems. We first introduce the concept of a functional dependency for XML, and define its semantics via a relational representation of XML. We then define an XML normal form, XNF, that avoids update anomalies and redundancies. We study its properties and show that it generalizes BCNF and a normal form for nested relations when those are appropriately coded as XML documents. Finally, we present a lossless algorithm for converting any DTD into one in XNF.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
C. Beeri, P. Bernstein, N. Goodman. A sophisticate's introduction to database normalization theory. VLDB'78, pages 113-124.
|
 |
5
|
Peter Buneman , Susan Davidson , Wenfei Fan , Carmem Hara , Wang-Chiew Tan, Keys for XML, Proceedings of the 10th international conference on World Wide Web, p.201-210, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.371984]
|
| |
6
|
|
| |
7
|
|
| |
8
|
DBLP. http://dblp.uni-trier.de/.
|
| |
9
|
W. F. Dowling and J. H. Gallier. Linear-time algorithms for testing the satisfiability of propositional Horn formulae. JLP 1(3): 267-284 (1984).
|
| |
10
|
ebXML. Business Process Specification Schema v1.01. http://www.ebxml.org/specs/.
|
 |
11
|
|
 |
12
|
|
| |
13
|
|
| |
14
|
D. Florescu, D. Kossmann. Storing and querying XML data using an RDMBS. IEEE Data Eng. Bull. 22 (1999), 27-34.
|
| |
15
|
|
| |
16
|
|
| |
17
|
J. Higgins, R. Jelliffe QAML Version 2.4. http://xml.ascc.net/resource/qaml-xml.dtd, 1999.
|
| |
18
|
|
 |
19
|
|
| |
20
|
|
| |
21
|
|
 |
22
|
|
 |
23
|
|
 |
24
|
|
| |
25
|
Jayavel Shanmugasundaram , Kristin Tufte , Chun Zhang , Gang He , David J. DeWitt , Jeffrey F. Naughton, Relational Databases for Querying XML Documents: Limitations and Opportunities, Proceedings of the 25th International Conference on Very Large Data Bases, p.302-314, September 07-10, 1999
|
| |
26
|
|
 |
27
|
|
 |
28
|
Igor Tatarinov , Zachary G. Ives , Alon Y. Halevy , Daniel S. Weld, Updating XML, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.413-424, May 21-24, 2001, Santa Barbara, California, United States
|
| |
29
|
|
| |
30
|
W3C. XML-Data. W3C Note, Jan. 1998.
|
| |
31
|
W3C. XML Schema. W3C Working Draft, May 2001.
|
| |
32
|
W3C. XQuery 1.0: An XML Query Language. W3C Working Draft, June 2001.
|
CITED BY 21
|
|
|
|
|
|
|
|
|
|
|
|
|
|
H. V. Jagadish , Laks V. S. Lakshmanan , Monica Scannapieco , Divesh Srivastava , Nuwee Wiwatwattana, Colorful XML: one hierarchy isn't enough, Proceedings of the 2004 ACM SIGMOD international conference on Management of data, June 13-18, 2004, Paris, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yi Chen , Susan Davidson , Carmem Hara , Yifeng Zheng, RRXS: redundancy reducing XML storage in relations, Proceedings of the 29th international conference on Very large data bases, p.189-200, September 09-12, 2003, Berlin, Germany
|
|
|
|
REVIEW
"Herman Fischer : Reviewer"
Arenas and Libkin define the background and requirements of a normal form of Extensible Markup Language (XML), with the goal of converting arbitrary XML into well-formed XML. Since my own work is with Extensible Business Reporting Language (XBRL)
more...
|