|
ABSTRACT
Systems for managing and querying semistructured-data sources often store data in proprietary object repositories or in a tagged-text format. We describe a technique that can use relational database management systems to store and manage semistructured data. Our technique relies on a mapping between the semistructured data model and the relational data model, expressed in a query language called STORED. When a semistructured data instance is given, a STORED mapping can be generated automatically using data-mining techniques. We are interested in applying STORED to XML data, which is an instance of semistructured data. We show how a document-type-descriptor (DTD), when present, can be exploited to further improve performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The Lorel query language for semistructured data. International Journal on Digital Libraries, 1(1):68- 88, April 1997.
|
 |
2
|
Rakesh Agrawal , Tomasz Imieliński , Arun Swami, Mining association rules between sets of items in large databases, Proceedings of the 1993 ACM SIGMOD international conference on Management of data, p.207-216, May 25-28, 1993, Washington, D.C., United States
|
| |
3
|
|
 |
4
|
Peter Buneman , Susan Davidson , Gerd Hillebrand , Dan Suciu, A query language and optimization techniques for unstructured data, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.505-516, June 04-06, 1996, Montreal, Quebec, Canada
|
 |
5
|
V. Christophides , S. Abiteboul , S. Cluet , M. Scholl, From structured documents to novel query facilities, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.313-324, May 24-27, 1994, Minneapolis, Minnesota, United States
|
 |
6
|
Mary Fernández , Daniela Florescu , Jaewoo Kang , Alon Levy , Dan Suciu, Catching the boat with Strudel: experiences with a Web-site management system, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.414-425, June 01-04, 1998, Seattle, Washington, United States
|
 |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
 |
11
|
Alon Y. Levy , Alberto O. Mendelzon , Yehoshua Sagiv, Answering queries using views (extended abstract), Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.95-104, May 22-25, 1995, San Jose, California, United States
[doi> 10.1145/212433.220198]
|
| |
12
|
|
 |
13
|
Svetlozar Nestorov , Serge Abiteboul , Rajeev Motwani, Extracting schema from semistructured data, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.295-306, June 01-04, 1998, Seattle, Washington, United States
|
 |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
 |
21
|
|
 |
22
|
Tian Zhang , Raghu Ramakrishnan , Miron Livny, BIRCH: an efficient data clustering method for very large databases, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.103-114, June 04-06, 1996, Montreal, Quebec, Canada
|
CITED BY 106
|
|
|
|
|
|
|
|
|
|
|
Jayavel Shanmugasundaram , Eugene Shekita , Jerry Kiernan , Rajasekar Krishnamurthy , Efstratios Viglas , Jeffrey Naughton , Igor Tatarinov, A general technique for querying XML documents using a relational database system, ACM SIGMOD Record, v.30 n.3, September 2001
|
|
|
Menzo Windhouwer , Albrecht Schmidt , Roelof van Zwol , Milan Petkovic , Henk Ernst Blok, Flexible digital library search, Web-enabled systems integration: practices and challenges, Idea Group Publishing, Hershey, PA, 2003
|
|
|
|
|
|
Minos N. Garofalakis , Rajeev Rastogi , S. Seshadri , Kyuseok Shim, Data mining and the Web: past, present and future, Proceedings of the 2nd international workshop on Web information and data management, p.43-47, November 02-06, 1999, Kansas City, Missouri, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ming-Ling Lo , Shyh-Kwei Chen , Sriram Padmanabhan , Jen-Yao Chung, XAS: a system for accessing componentized, virtual XML documents, Proceedings of the 23rd International Conference on Software Engineering, p.493-502, May 12-19, 2001, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Albrecht Schmidt , Florian Waas , Martin Kersten , Daniela Florescu , Michael J. Carey , Ioana Manolescu , Ralph Busse, Why and how to benchmark XML databases, ACM SIGMOD Record, v.30 n.3, September 2001
|
|
|
|
|
|
Juliana Freire , Jayant R. Haritsa , Maya Ramanath , Prasan Roy , Jérôme Siméon, StatiX: making XML count, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
|
|
|
Igor Tatarinov , Stratis D. Viglas , Kevin Beyer , Jayavel Shanmugasundaram , Eugene Shekita , Chun Zhang, Storing and querying ordered XML using a relational database system, Proceedings of the 2002 ACM SIGMOD international conference on Management of data, June 03-06, 2002, Madison, Wisconsin
|
|
|
Alberto H. F. Laender , Altigran S. da Silva , Paolo B. Golgher , Berthier Ribeiro-Neto , Irna M. R. Evangelista-Filha , Karine V. Magalhães, The Debye Environment for Web Data Management, IEEE Internet Computing, v.6 n.4, p.60-69, July 2002
|
|
|
|
|
|
Alberto H. F. Laender , Altigran S. da Silva , Paolo B. Golgher , Berthier Ribeiro-Neto , Irna M. R. Evangelista-Filha , Karine V. Magalhães, The Debye Environment for Web Data Management, IEEE Internet Computing, v.6 n.4, p.60-69, July 2002
|
|
|
|
|
|
|
|
|
Jun Huan , Wei Wang , Jan Prins , Jiong Yang, SPIN: mining maximal frequent subgraphs from graph databases, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hongjun Lu , Jeffrey Xu Yu , Guoren Wang , Shihui Zheng , Haifeng Jiang , Ge Yu , Aoying Zhou, What makes the differences: benchmarking XML database implementations, ACM Transactions on Internet Technology (TOIT), v.5 n.1, p.154-194, February 2005
|
|
|
|
|
|
|
|
|
William M. Shui , Franky Lam , Damien K. Fisher , Raymond K. Wong, Querying and maintaining ordered XML data using relational databases, Proceedings of the sixteenth Australasian database conference, p.85-94, January 01, 2005, Newcastle, Australia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Zhiyuan Chen , Johannes Gehrke , Flip Korn , Nick Koudas , Jayavel Shanmugasundaram , Divesh Srivastava, Index structures for matching XML twigs using relational query processors, Data & Knowledge Engineering, v.60 n.2, p.283-302, February, 2007
|
|
|
Mustafa Atay , Artem Chebotko , Dapeng Liu , Shiyong Lu , Farshad Fotouhi, Efficient schema-based XML-to-Relational data mapping, Information Systems, v.32 n.3, p.458-476, May, 2007
|
|
|
|
|
|
Krzysztof Walczak , Jacek Chmielewski , Miroslaw Stawniak , Sergiusz Strykowski, Extensible metadata framework for describing virtual reality and multimedia contents, Proceedings of the 24th IASTED international conference on Database and applications, p.168-175, February 13-15, 2006, Innsbruck, Austria
|
|
|
|
|
|
|
|
|
Anuj R. Jaiswal , C. Lee Giles , Prasenjit Mitra , James Z. Wang, An architecture for creating collaborative semantically capable scientific data sharing infrastructures, Proceedings of the eighth ACM international workshop on Web information and data management, November 10-10, 2006, Arlington, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
Yi Chen , Susan Davidson , Carmem Hara , Yifeng Zheng, RRXS: redundancy reducing XML storage in relations, Proceedings of the 29th international conference on Very large data bases, p.189-200, September 09-12, 2003, Berlin, Germany
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Philip Bohannon , Juliana Freire , Jayant R. Haritsa , Prasan Roy , Jérôme Siméon, LegoDB: customizing relational storage for XML documents, Proceedings of the 28th international conference on Very Large Data Bases, p.1091-1094, August 20-23, 2002, Hong Kong, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
T. Fiebig , S. Helmer , C.-C. Kanne , G. Moerkotte , J. Neumann , R. Schiele , T. Westmann, Anatomy of a native XML base management system, The VLDB Journal — The International Journal on Very Large Data Bases, v.11 n.4, p.292-314, December 2002
|
|
|
Jayavel Shanmugasundaram , Eugene Shekita , Rimon Barr , Michael Carey , Bruce Lindsay , Hamid Pirahesh , Berthold Reinwald, Efficiently publishing relational data as XML documents, The VLDB Journal — The International Journal on Very Large Data Bases, v.10 n.2-3, p.133-154, September 2001
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jayavel Shanmugasundaram , Eugene J. Shekita , Rimon Barr , Michael J. Carey , Bruce G. Lindsay , Hamid Pirahesh , Berthold Reinwald, Efficiently Publishing Relational Data as XML Documents, Proceedings of the 26th International Conference on Very Large Data Bases, p.65-76, September 10-14, 2000
|
|
|
|
|
|
|
|
|
|
|
|
Jayavel Shanmugasundaram , Kristin Tufte , Chun Zhang , Gang He , David J. DeWitt , Jeffrey F. Naughton, Relational Databases for Querying XML Documents: Limitations and Opportunities, Proceedings of the 25th International Conference on Very Large Data Bases, p.302-314, September 07-10, 1999
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Junichi Tatemura , Oliver Po , Arsany Sawires , Divyakant Agrawal , K. Selçuk Candan, WReX: a scalable middleware architecture to enable XML caching for web-services, Proceedings of the ACM/IFIP/USENIX 2005 International Conference on Middleware, p.124-143, November 01-01, 2005, Grenoble, France
|
|
|
|
|
|
|
|
|
|
|