| An approach to XML path matching |
| Full text |
Pdf
(209 KB)
|
Source
|
Workshop On Web Information And Data Management
archive
Proceedings of the 9th annual ACM international workshop on Web information and data management
table of contents
Lisbon, Portugal
SESSION: XML and semi-structured data
table of contents
Pages 17-24
Year of Publication: 2007
ISBN:978-1-59593-829-9
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 46, Citation Count: 0
|
|
|
ABSTRACT
In applications that accomplish XML data integration and XML instance querying, the problem of XML path matching plays a central role. This paper presents an approach for matching XML paths that consists of (1) PathSim, a similarity function specifically designed for matching XML paths and (2) a set of pre-processing functions to be applied to XML paths that are to be compared by a similarity function. The reported experiments demonstrate that PathSim achieves matches of higher quality than a similarity function for XML paths found in literature. The experiments further show that matches of higher quality are achieved when the proposed pre-processing functions are employed.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
Chaitan Baru , Amarnath Gupta , Bertram Ludäscher , Richard Marciano , Yannis Papakonstantinou , Pavel Velikhov , Vincent Chu, XML-based information mediation with MIX, Proceedings of the 1999 ACM SIGMOD international conference on Management of data, p.597-599, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
| |
3
|
M. Boyd, S. Kittivoravitkul, C. Lazanitis, P. McBrien, and N. Rizopoulos. Automed: A bav data integration system for heterogeneous data sources. In CAiSE, pages 82--97, 2004.
|
| |
4
|
|
| |
5
|
|
 |
6
|
Carina F. Dorneles , Carlos A. Heuser , Andrei E. N. Lima , Altigran Soares da Silva , Edleno Silva de Moura, Measuring similarity between collection of values, Proceedings of the 6th annual ACM international workshop on Web information and data management, November 12-13, 2004, Washington DC, USA
[doi> 10.1145/1031453.1031465]
|
 |
7
|
|
| |
8
|
|
 |
9
|
|
| |
10
|
D. Kukulenz, K.-P. Herget, and J. Pauli. Improving retrieval by a similarity thesaurus based on hyperlink structure. In Databases and Applications, pages 29--34, 2005.
|
| |
11
|
V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8):707--710, 1966.
|
| |
12
|
B. Liang, J. Tang, J.-Z. Li, and K. Wang. Semantic similarity based ontology cache. In APWeb, pages 250--262, 2006.
|
| |
13
|
|
| |
14
|
A. Nierman and H. V. Jagadish. Evaluating structural similarity in xml documents. In WebDB, pages 61--66, 2002.
|
| |
15
|
T. Ogawa and N. Inuzuka. Similarity of documents using reconfiguration of thesaurus. In KES, pages 95--102, 2004.
|
| |
16
|
|
 |
17
|
Igor Tatarinov , Zachary Ives , Jayant Madhavan , Alon Halevy , Dan Suciu , Nilesh Dalvi , Xin (Luna) Dong , Yana Kadiyska , Gerome Miklau , Peter Mork, The Piazza peer data management project, ACM SIGMOD Record, v.32 n.3, September 2003
[doi> 10.1145/945721.945732]
|
| |
18
|
|
| |
19
|
Y. Wang, D. J. DeWitt, and Jyi Cai. X-diff: An effective change detection algorithm for xml documents. In ICDE, pages 519--530, 2003.
|
| |
20
|
|
| |
21
|
G. Wen, L. Jiang, and N. R. Shadbolt. Ontology-based similarity between text documents on manifold. In ASWC, pages 113--125, 2006.
|
| |
22
|
|
|