ACM Home Page
Please provide us with feedback. Feedback
Merging changes in XML documents using reliable context fingerprints
Full text PdfPdf (324 KB)
Source
Document Engineering archive
Proceeding of the eighth ACM symposium on Document engineering table of contents
Sao Paulo, Brazil
SESSION: Structured documents table of contents
Pages 52-61  
Year of Publication: 2008
ISBN:978-1-60558-081-4
Authors
Sebastian Rönnau  Universität der Bundeswehr München, Neubiberg, Germany
Christian Pauli  Universität der Bundeswehr München, Neubiberg, Germany
Uwe M. Borghoff  Universität der Bundeswehr München, Neubiberg, Germany
Sponsors
SIGDOC : ACM Special Interest Group on Systems Documentation
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 112,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1410140.1410151
What is a DOI?

ABSTRACT

Different dialects of XML have emerged as ubiquitous document exchange formats. For effective collaboration based on such documents, the capability to propagate edit operations performed on a document is indispensable. In order to avoid the transmission of whole documents, deltas are used to describe these edit operations, allowing the construction of a new version of a document. However, patching a document with a delta it was not generated for is error-prone, and any insert or delete operations performed on the document are likely to affect all subsequent paths within that document.

In this paper, we present a delta format for XML documents that uses context-aware fingerprints to identify edit operations. This allows our XML patch procedure to find the correct position of an edit operation, even if the document was updated in the meantime. Possible conflicts are detected. Experimental results show the reliability of the presented fingerprinting technique and prove the high quality of the resulting patched documents.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
J. Boyer. Canonical XML version 1.0, 2001.
3
 
4
D. Chamberlin, D. Florescu, J. Melton, J. Robie, and J. Siméon. XQuery Update Facility 1.0, 2008.
5
 
6
J. Clark and S. deRose. XML Path Language (XPath). Technical report, World Wide Web Consortium, 1999.
 
7
8
 
9
D. Eastlake, J. Reagle, and D. Solo. XML-Signature syntax and processing, 2002.
 
10
R. L. Fontaine. Merging XML files: a new approach providing intelligent merge of XML data sets. In Proceedings of XML Europe 2002, 2002.
 
11
Free Software Foundation. Comparing and Merging Files, 2002.
 
12
C.-L. Ignat and M. C. Norrie. Flexible collaboration over XML documents. In CDVE, pages 267--274, 2006.
 
13
S. Khanna, K. Kunal, and B. C. Pierce. A formal investigation of diff3. In Arvind and Prasad, editors, Foundations of Software Technology and Theoretical Computer Science (FSTTCS), Dec. 2007.
 
14
H.-K. Ko and S. Lee. An efficient scheme to completely avoid re-labeling in XML updates. In WISE, pages 259--264, 2006.
 
15
J. Kornblum. Identifying almost identical files using context triggered piecewise hashing. Digital Investigation, 3(Supplement-1):91--97, 2006.
16
 
17
E. Leonardi, S. S. Bhowmick, and S. K. Madria. Xandy: Detecting changes on large unordered XML documents using relational databases. In L. Zhou, B. C. Ooi, and X. Meng, editors, DASFAA, volume 3453 of Lecture Notes in Computer Science, pages 711--723. Springer, 2005.
18
19
20
 
21
 
22
H. Maruyama, K. Tamura, and N. Uramoto. Digest Values for DOM (DOMHASH), 2000.
 
23
R. Rivest. The md5 message-digest algorithm, 1992.
 
24
S. Rönnau and U. M. Borghoff. Intelligent merging of XML documents for distributed collaboration. In Proceedings of the Distributed Intelligent Systems and Technologies Workshop, pages 71--78, St. Petersburg, Russia, 2008.
25
 
26
 
27
B. Stein. Fuzzy-fingerprints for text-based information retrieval. In I-KNOW'05: Proceedings of the 5th International Conference on Knowledge Management, pages 572--579. Journal of Universal Computer Science, 2005.
28


Collaborative Colleagues:
Sebastian Rönnau: colleagues
Christian Pauli: colleagues
Uwe M. Borghoff: colleagues