ACM Home Page
Please provide us with feedback. Feedback
Approximate XML document matching
Full text PdfPdf (123 KB)
Source Symposium on Applied Computing archive
Proceedings of the 2005 ACM symposium on Applied computing table of contents
Santa Fe, New Mexico
SESSION: Document engineering (DE): poster paper table of contents
Pages: 787 - 788  
Year of Publication: 2005
ISBN:1-58113-964-0
Authors
E. Rodney Canfield  University of Georgia, Athens, GA
Guangming Xing  Western Kentucky University, Bowling Green, KY
Sponsor
SIGAPP: ACM Special Interest Group on Applied Computing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 19,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1066677.1066857
What is a DOI?

ABSTRACT

Regular Hedge Grammar is a formal method to specify XML schema. XML document can be viewed as an ordered labeled tree. Computing the approximate matching between an XML document with a schema with minimum cost is not only theoretically interesting. This problem can be modeled as: Given an ordered labeled tree F and a regular hedge grammar P, how to compute the minimum edit distance to transform the forest F into F' so that F' is exactly matched by P. In this paper, with the introduction of leaf forest, we gave an algorithm for this problem in O(F2P(F + log P)) time, where F is the size of the forest and P is the size of the grammar. From the authors' knowledge, this is the first algorithm to transform an XML document (ordered labeled tree) to conform to a schema (tree grammar).


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Shasha, D., Zhang, K. Approximate Tree Pattern Matching, chapter 14 Pattern Matching Algorithms (eds. Apostolico, A. and Galil, Z.), Oxford University Press, June 1997.
 
2
Murata, M Hedge Automata: A Formal Model for XML Schemata http://www.xml.gr.jp/relax/hedge_nice.html
 
3



REVIEW

"Anthony Joseph Duben : Reviewer"

The estimation of the cost of evaluating the match between an Extensible Markup Language (XML) document and a schema is a problem of practical significance. Since XML is becoming increasingly prevalent as the underlying language for describing doc  more...

Collaborative Colleagues:
E. Rodney Canfield: colleagues
Guangming Xing: colleagues