ACM Home Page
Please provide us with feedback. Feedback
A normal form for XML documents
Full text PdfPdf (363 KB)
Source Symposium on Principles of Database Systems archive
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems table of contents
Madison, Wisconsin
SESSION: Research sessions 2 and 3: information processing on WWW and XML table of contents
Pages: 85 - 96  
Year of Publication: 2002
ISBN:1-58113-507-6
Authors
Marcelo Arenas  University of Toronto
Leonid Libkin  University of Toronto
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGMOD: ACM Special Interest Group on Management of Data
SIGACT: ACM Special Interest Group on Algorithms and Computation Theory
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 60,   Citation Count: 21
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/543613.543625
What is a DOI?

ABSTRACT

This paper takes a first step towards the design and normalization theory for XML documents. We show that, like relational databases, XML documents may contain redundant information, and may be prone to update anomalies. Furthermore, such problems are caused by certain functional dependencies among paths in the document. Our goal is to find a way of converting an arbitrary DTD into a well-designed one, that avoids these problems. We first introduce the concept of a functional dependency for XML, and define its semantics via a relational representation of XML. We then define an XML normal form, XNF, that avoids update anomalies and redundancies. We study its properties and show that it generalizes BCNF and a normal form for nested relations when those are appropriately coded as XML documents. Finally, we present a lossless algorithm for converting any DTD into one in XNF.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
C. Beeri, P. Bernstein, N. Goodman. A sophisticate's introduction to database normalization theory. VLDB'78, pages 113-124.
5
 
6
 
7
 
8
DBLP. http://dblp.uni-trier.de/.
 
9
W. F. Dowling and J. H. Gallier. Linear-time algorithms for testing the satisfiability of propositional Horn formulae. JLP 1(3): 267-284 (1984).
 
10
ebXML. Business Process Specification Schema v1.01. http://www.ebxml.org/specs/.
11
12
 
13
 
14
D. Florescu, D. Kossmann. Storing and querying XML data using an RDMBS. IEEE Data Eng. Bull. 22 (1999), 27-34.
 
15
 
16
 
17
J. Higgins, R. Jelliffe QAML Version 2.4. http://xml.ascc.net/resource/qaml-xml.dtd, 1999.
 
18
19
 
20
 
21
22
23
24
 
25
 
26
27
28
 
29
 
30
W3C. XML-Data. W3C Note, Jan. 1998.
 
31
W3C. XML Schema. W3C Working Draft, May 2001.
 
32
W3C. XQuery 1.0: An XML Query Language. W3C Working Draft, June 2001.

CITED BY  21


REVIEW

"Herman Fischer : Reviewer"

Arenas and Libkin define the background and requirements of a normal form of Extensible Markup Language (XML), with the goal of converting arbitrary XML into well-formed XML. Since my own work is with Extensible Business Reporting Language (XBRL)   more...

Collaborative Colleagues:
Marcelo Arenas: colleagues
Leonid Libkin: colleagues