ACM Home Page
Please provide us with feedback. Feedback
Query optimization in xml-based information integration
Full text PdfPdf (194 KB)
Source
Conference on Information and Knowledge Management archive
Proceeding of the 17th ACM conference on Information and knowledge management table of contents
Napa Valley, California, USA
POSTER SESSION: Poster session 2 database table of contents
Pages 1405-1406  
Year of Publication: 2008
ISBN:978-1-59593-991-3
Authors
Dongfeng Chen  NC State University, Raleigh, NC, USA
Rada Chirkova  NC State University, Raleigh, NC, USA
Maxim Kormilitsin  NC State University, Raleigh, NC, USA
Fereidoon Sadri  UNC-Greensboro, Greensboro, NC, USA
Timo J. Salo  IBM RTP, Research Triangle Park, NC, USA
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 89,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1458082.1458303
What is a DOI?

ABSTRACT

The problem of decentralized data sharing is relevant for a wide range of applications and is still a source of major theoretical and practical challenges, in spite of many years of sustained research in information integration. We focus on the challenge of efficiency of query evaluation in information-integration systems, with the objective of developing query-processing strategies that are widely applicable and easy to implement in real-life applications. In our algorithms we take into account important features of today's data-sharing applications, namely: XML as likely interface to or representation for data sources; potential for information overlap across data sources; and the need for inter-source processing (i.e., joins of data across data sources) in many applications.

To the best of our knowledge, our methods are the first to account for the practical issues of information overlap across data sources and of inter-source processing. While most of our algorithms are platform- and implementation-independent, we also propose XML-specific optimization techniques that allow for system-level tuning of query processing performance. Finally, using real-life datasets and our implementation of an information-integration system shell, we provide experimental results that demonstrate that our algorithms are efficient and competitive in the information-integration setting. For all the details, please see [1].


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
R. Chirkova, D. Chen, F. Sadri, and T. Salo. Efficient query processing in XML-based information integration. Tech. Report NCSU CSC. www.csc.ncsu.edu/research/tech/reports.php, 2008.
 
2
CiteSeer. http://citeseer.ist.psu.edu/.
 
3
dblp. http://www.informatik.uni-trier.de/~ley/db/index.html.
 
4
NCSU-UNCG Information Integration, see dbgroup.ncsu.edu.
 
5
PostgreSQL. http://www.postgresql.org/.
 
6
SAXONICA XSLT/XQuery Processing www.saxonica.com/.
 
7
SIGMOD. http://www.sigmod.org/.
8
 
9

Collaborative Colleagues:
Dongfeng Chen: colleagues
Rada Chirkova: colleagues
Maxim Kormilitsin: colleagues
Fereidoon Sadri: colleagues
Timo J. Salo: colleagues