ACM Home Page
Please provide us with feedback. Feedback
Score region algebra: building a transparent XML-R database
Full text PdfPdf (198 KB)
Source Conference on Information and Knowledge Management archive
Proceedings of the 14th ACM international conference on Information and knowledge management table of contents
Bremen, Germany
SESSION: Paper session IR-1 (information retrieval): XML retrieval table of contents
Pages: 12 - 19  
Year of Publication: 2005
ISBN:1-59593-140-6
Authors
Vojkan Mihajlović  University of Twente, The Netherlands
Henk Ernst Blok  University of Twente, The Netherlands
Djoerd Hiemstra  University of Twente, The Netherlands
Peter M. G. Apers  University of Twente, The Netherlands
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 45,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1099554.1099560
What is a DOI?

ABSTRACT

A unified database framework that will enable better comprehension of ranked XML retrieval is still a challenge in the XML database field. We propose a logical algebra, named score region algebra, that enables transparent specification of information retrieval (IR) models for XML databases. The transparency is achieved by a possibility to instantiate various retrieval models, using abstract score functions within algebra operators, while logical query plan and operator definitions remain unchanged. Our algebra operators model three important aspects of XML retrieval: element relevance score computation, element score propagation, and element score combination. To illustrate the usefulness of our algebra we instantiate four different, well known IR scoring models, and combine them with different score propagation and combination functions. We implemented the algebra operators in a prototype system on top of a low-level database kernel. The evaluation of the system is performed on a collection of IEEE articles in XML format provided by INEX. We argue that state of the art XML IR models can be transparently implemented using our score region algebra framework on top of any low-level physical database engine or existing RDBMS, allowing a more systematic investigation of retrieval model behavior.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
S. Boag, D. Chamberlin, M.F. Fernandez, D. Florescu, J. Robie, and J. Simeon. XQuery 1.0: An XML Query Language. Technical report, W3C, 2002.
 
4
P. Boncz. Monet: a Next Generation Database Kernel for Query Intensive Applications. PhD thesis, CWI, 2002.
5
 
6
S. Buxton and M. Rys. XQuery and XPath Full-Text Requirements. Technical report, W3C, 2003.
 
7
J. P. Callan, W. B. Croft, and S. M. Harding. The INQUERY Retrieval System. In Proceedings of the 3rd DEXA Conference, 1992.
 
8
J. Clark and S. DeRose. XML Path Language XPath Version 1.0. Technical report, W3C, 1999.
 
9
C.L.A. Clarke, G.V. Cormack, and F.J. Burkowski. An Algebra for Structured Text Search and a Framework for its Implementation. The Computer Journal, 38(1), 1995.
10
 
11
 
12
N. Fuhr. Models for Integrated Information Retrieval and Database Systems. IEEE Data Engineering Bulletin, 19(1), 1996.
13
14
 
15
N. Fuhr, M. Lalmas, and S. Malik, editors. Proceedings of the 2nd INEX Workshop, ERCIM Publications, 2004.
 
16
S. Geva. GPX - Gardens Point XML Information Retrieval at INEX 2004. In Proceedings of the 3rd INEX Workshop, LNCS 3493, Springer, 2005.
 
17
N. Gövert, M. Abolhassani, N. Fuhr, and K. Großjohan. Content-oriented XML Retrieval with HyRex. In Proceedings of the 1st INEX Workshop, ERCIM Publications, 2003.
 
18
T. Grabs and H.-J. Shek. Generating Vector Spaces On-the-y for Flexible XML Retrieval. In Proceedings of the XML and Information Retrieval Workshop at 25th ACM SIGIR, 2002.
 
19
 
20
T. Grust, S. Sakr, and J. Teubner. XQuery on SQL Hosts. In Proceedings of the 30th VLDB Conference, 2004.
21
 
22
D. Hiemstra. Using Language Models for Information Retrieval. PhD thesis, University of Twente, Twente, The Netherlands, 2001.
23
 
24
J. List, V. Mihajlović, A. de Vries, G. Ramirez, and D. Hiemstra. The TIJAH XML-IR System at INEX 2003. In Proceedings of the 2nd INEX Workshop, ERCIM Publications, 2004.
 
25
Y. Mass and M. Mandelbrod. Component Ranking and Automatic Query Refinement for XML Retrieval. InProceedings of the 3rd INEX Workshop, LNCS 3493, Springer, 2005.
 
26
V. Mihajlović, D. Hiemstra, H. E. Blok, and P. M. G. Apers. An XML-IR-DB Sandwich: Is it Better with an Algebra in Between? In Proceedings of the SIGIR Workshop on Information Retrieval and Databases (WIRD'04), 2004.
 
27
V. Mihajlović, G. Ramírez, A. P. de Vries, D. Hiemstra, and H. E. Blok. TIJAH at INEX 2004: Modeling Phrasesm and Relevance Feedback. In Proceedings of the 3rd INEX Workshop, LNCS 3493, Springer, 2005.
 
28
P. Ogilvie and J. Callan. Using Language Models for Flat Text Queries in XML Retrieval. In Proceedings of the 2nd INEX Workshop, ERCIM Publications, 2004.
 
29
J. Pehcevski, J. A. Thom, and A-M. Vercoustre. RMIT INEX Experiments: XML Retrieval Using Lucy/eXist. In Proceedings of the 2nd INEX Workshop, ERCIM Publications, 2004.
 
30
 
31
A. Salminen and F.W. Tompa. PAT Expressions: An Algebra for Text Search. In Proceedings of COMPLEX, 1992.
 
32
 
33
A. Trotman and R. A. O'Keefe. The Simplest Query Language That Could Possibly Work. In Proceedings of the 2nd INEX Workshop, ERCIM Publications, 2004.
 
34
D. Tsichritzis and A. Klug. The ANSI/X3/SPARC DBMS Framework Report of the Study Group on Database Management Systems. Information systems, 3, 1978.
 
35
S. R. Vasanthakumar, J. P. Callan, and W. Bruce Croft. Integrating INQUERY with an RDBMS to Support Text Retrieval. IEEE Data Engineering Bulletin, 19(1), 1996.
 
36
37


Collaborative Colleagues:
Vojkan Mihajlović: colleagues
Henk Ernst Blok: colleagues
Djoerd Hiemstra: colleagues
Peter M. G. Apers: colleagues