|
ABSTRACT
Document Object Modeling (DOM) is widely used approach for retrieving data from an XML document. If the size of the XML document is very large, however, using the DOM approach for retrieving data from the XML document may suffer from a lack of memory space for building the associated XML tree in the main memory. To alleviate this problem, we propose a method that allows the very large XML document to be split into small XML documents, retrieves data from the XML tree built from each of these small XML documents, and combines the results from all of the n XML trees to generate the final result. With this proposed approach, the memory space and processing time required to retrieve data from the very large XML document using DOM are reduced so that they can be managed by one single general-purpose personal computer.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Extensible Markup Language (XML) 1.0 (Third Edition), http://www.w3.org/TR/2004/REC-XML-20040204/
|
| |
2
|
UniProt, http://www.uniprot.org/database/download.shtml
|
| |
3
|
Apache Xerces, http://xerces.apache.org/
|
| |
4
|
Document Object Model (DOM) Level 3 Core Specification, http://www.w3.org/TR/DOM-Level-3-Core/
|
| |
5
|
SAX: A Simple API for XML, http://www.saxproject.org
|
 |
6
|
|
 |
7
|
|
| |
8
|
XML Path Language (XPath), http://www.w3.org/TR/xpath
|
| |
9
|
A. R. Schmidt , Florian Waas , Martin L. Kersten , D. Florescu , I. Manolescu , M. J. Carey , R. Busse, The XML benchmark project, CWI (Centre for Mathematics and Computer Science), Amsterdam, The Netherlands, 2001
|
 |
10
|
Hongjun Lu , Jeffrey Xu Yu , Guoren Wang , Shihui Zheng , Haifeng Jiang , Ge Yu , Aoying Zhou, What makes the differences: benchmarking XML database implementations, ACM Transactions on Internet Technology (TOIT), v.5 n.1, p.154-194, February 2005
[doi> 10.1145/1052934.1052940]
|
| |
11
|
|
| |
12
|
|
| |
13
|
|
 |
14
|
|
| |
15
|
Jayavel Shanmugasundaram , Kristin Tufte , Chun Zhang , Gang He , David J. DeWitt , Jeffrey F. Naughton, Relational Databases for Querying XML Documents: Limitations and Opportunities, Proceedings of the 25th International Conference on Very Large Data Bases, p.302-314, September 07-10, 1999
|
| |
16
|
Aoying Zhou , Hongjun Lu , Shihui Zheng , Yuqi Liang , Long Zhang , Wenyun Ji , Zengping Tian, VXMLR: A Visual XML-Relational Database System, Proceedings of the 27th International Conference on Very Large Data Bases, p.719-720, September 11-14, 2001
|
| |
17
|
Tian, F., DeWitt, D. J., Chen, J., and Zhang, C. The design and performance evaluation of alternative XML storage strategies. Tech. rep., Computer Science Department, University of Wisconsin, Madison, WI, 2000.
|
| |
18
|
|
| |
19
|
Wei Lu, Kenneth Chiu and Yinfei Pan, A Parallel Approach to XML Parsing, In Proceedings of the 7th IEEE/ACM International Conference on Grid Computing, 2006, 223--230.
|
| |
20
|
XMark, http://monetdb.cwi.nl/xml/generator.html
|
| |
21
|
XMark Benchmark Queries, http://www.ins.cwi.nl/projects/xmark/Assets/xmlquery.txt
|
| |
22
|
XML Query (XQuery), http://www.w3.org/XML/Query
|
| |
23
|
XML Namespace, http://www.w3.org/TR/REC-xml-names/
|
|