ACM Home Page
Please provide us with feedback. Feedback
FleXPath: flexible structure and full-text querying for XML
Full text PdfPdf (438 KB)
Source International Conference on Management of Data archive
Proceedings of the 2004 ACM SIGMOD international conference on Management of data table of contents
Paris, France
SESSION: Research sessions: Web, XML and IR table of contents
Pages: 83 - 94  
Year of Publication: 2004
ISBN:1-58113-859-8
Authors
Sihem Amer-Yahia  AT&T Labs-Research, Florham Park, NJ
Laks V. S. Lakshmanan  University of British Columbia, Vancouver, CA
Shashank Pandit  IIT Bombay, Mumbaî, India
Sponsor
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 109,   Citation Count: 26
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1007568.1007581
What is a DOI?

ABSTRACT

Querying XML data is a well-explored topic with powerful database-style query languages such as XPath and XQuery set to become W3C standards. An equally compelling paradigm for querying XML documents is full-text search on textual content. In this paper, we study fundamental challenges that arise when we try to integrate these two querying paradigms.While keyword search is based on approximate matching, XPath has exact match semantics. We address this mismatch by considering queries on structure as a "template", and looking for answers that best match this template and the full-text search. To achieve this, we provide an elegant definition of relaxation on structure and define primitive operators to span the space of relaxations. Query answering is now based on ranking potential answers on structural and full-text search conditions. We set out certain desirable principles for ranking schemes and propose natural ranking schemes that adhere to these principles. We develop efficient algorithms for answering top-K queries and discuss results from a comprehensive set of experiments that demonstrate the utility and scalability of the proposed framework and algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Al-Khalifa et al. Structural joins: A primitive for efficient XML query pattern matching. In ICDE, 2002.
2
 
3
 
4
 
5
J. M. Bremer and M. Gertz. XQuery/IR: Integrating XML Document and Data Retrieval. WebDB 2002.
6
7
8
9
 
10
C. Chen and Y. Ling. A Sampling-Based Estimator for Top-K Query. In ICDE 2002.
 
11
T. T. Chinenyanga and N. Kushmerick. Expressive and Efficient Ranked Querying of XML Data. 4th International Workshop on the Web and Databases (WebDB). Santa Barbara, California, 2001.
 
12
S. Cohen et al. XSEarch: A Semantic Search Engine for XML. In VLDB 2003.
 
13
M. Cutler et al. Using the Structure of HTML Documents to Improve Retrieval. USENIX Symposium on Internet Technologies and Systems. California 1997.
 
14
 
15
C. Delobel and M. C. Rousset. A Uniform Approach for Querying Large Tree-structured Data through a Mediated Schema. International Workshop on Foundations of Models for Information Integration (FMII-2001).
 
16
S. Flesca et al. On the minimization of XPath queries. In VLDB 2003: 153--164
 
17
 
18
N. Fuhr and K. Grossjohann. XIRQL: An Extension of XQL for Information Retrieval. ACM SIGIR Workshop on XML and Information Retrieval. Athens, Greece, 2000.
 
19
20
 
21
Y. Hayashi et al. Searching Text-rich XML Documents with Relevance Ranking. ACM SIGIR 2000 Workshop on XML and Information Retrieval, Greece, 2000.
22
 
23
P. Kilpelainen. Tree Matching Problems with Applications to Structured Text Databases. PhD thesis, University of Helsinki, Finland, November 1992.
24
25
 
26
J. Naughton et al. The Niagara Internet Query System. http://www.cs.wisc.edu/niagara/Publications.html
 
27
 
28
 
29
 
30
T. Schlieder. Similarity Search in XML Data using Cost-Based Query Transformations. ACM SIGMOD 2001 Web and Databases Workshop. May, 2001. Santa Barbara, California.
 
31

CITED BY  26
Collaborative Colleagues:
Sihem Amer-Yahia: colleagues
Laks V. S. Lakshmanan: colleagues
Shashank Pandit: colleagues