ACM Home Page
Please provide us with feedback. Feedback
Predicate rewriting for translating Boolean queries in a heterogeneous information system
Full text PdfPdf (351 KB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 17 ,  Issue 1  (January 1999) table of contents
Pages: 1 - 39  
Year of Publication: 1999
ISSN:1046-8188
Authors
Chen-Chuan K. Chang  Stanford Univ., Stanford, CA
Héctor Garcia-Molina  Stanford Univ., Stanford, CA
Andreas Paepcke  Stanford Univ., Stanford, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 36,   Citation Count: 8
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/297117.297120
What is a DOI?

ABSTRACT

Searching over heterogeneous information sources is difficult in part because of the nonuniform query languages. Our approach is to allow users to compose Boolean queries in one rich front-end language. For each user query and target source, we transform the user query into a subsuming query that can be supported by the source but that may return extra documents. The results are then processed by a filter query to yield the correct final results. In this article we introduce the architecture and associated mechanism for query translation. In particular, we discuss techniques for rewriting predicates in Boolean queries into native subsuming forms, which is a basis of translating complex queries. In addition, we present experimental results for evaluating the cost of postfiltering. We also discuss the drawbacks of this approach and cases when it may not be effective. We have implemented prototype versions of these mechanisms and demonstrated them on heterogeneous Boolean systems.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
BALDONADO, M., CHANG, C.-C. K., GRAVANO, L., AND PAEPCKE, A. 1997a. The Stanford Digital Library metadata architecture. Int. J. Dig. Lib. 1, 2 (Sept.), 108-121.
3
4
5
 
6
 
7
 
8
CRYSTAL, M. I. AND JAKOBSON, G. E. 1982. FRED, a front end for databases. Online 6, 5 (Sept.), 27-30.
9
10
 
11
 
12
GARC A-MOLINA, H., HAMMER, J., IRELAND, K., PAPAKONSTANTINOU, Y., ULLMAN, J., AND WIDOM, J. 1995. Integrating and accessing heterogeneous information sources in TSIMMIS. In Proceedings of the AAAI Spring Symposium on Information Gathering (Stanford, CA, Mar.). AAAI Press, Menlo Park, CA, 61-64.
 
13
 
14
15
16
 
17
HARMAN, D. 1993. Document detection overview. In Proceedings of the TIPSTER Text Program (Phase I) (Fredricksburg, VA, Sept.). Morgan Kaufmann Publishers Inc., San Francisco, CA.
 
18
HAWKINS, D. T. AND LEVY, L. R. 1985. Front end software for online database searching Part 1: Definitions, system features, and evaluation. Online 9, 6 (Nov.), 30-37.
 
19
HOWE, D. 1997. The free on-line dictionary of computing. Available via http://wombat.doc. ic.ac.uk/.
 
20
IEE. 1991. INSPEC Thesaurus. The Institution of Electrical Engineers, London, UK.
 
21
ISO. 1993. Information and Documentation--Commands for Interactive Text Searching. 1st ed. IOS Press, Amsterdam, The Netherlands.
22
 
23
KIRK, T., LEVY, A. Y., SAGIV, Y., AND SRIVASTAVA, D. 1995. The information manifold. In Papers from the AAAI Spring Symposium on Information Gathering in Distributed Heterogenous Environments (Menlo Park, CA, Mar.), C. Knoblock and A. Levy, Eds. AAAI Press, Menlo Park, CA, 85-91.
24
 
25
LOVINS, J. B. 1968. Development of a stemming algorithm. Mech. Trans. Comput. Ling. 11, 1-2, 22-31.
 
26
MARCUS, R. S. 1982. User assistance in bibliographic retrieval networks through a computer intermediary. IEEE Trans. Syst. Man Cybern. SMC-12, 2, 116-133.
 
27
MARTIN, T. H. 1974. A feature analysis of interactive retrieval systems. Rep. SU-COMM-ICR-74-1. Institute of Communication Research, Stanford University, Stanford, CA.
 
28
29
 
30
NISO. 1993. Z39.58-1992 Common Command Language for Online Interactive Information Retrieval. NISO Press, Bethesda, MD.
 
31
NISO. 1995. Information Retrieval (Z39.50): Application service definition and protocol specification. Tech. Rep. ANSI/NISO Z39.50-1995. NISO Press, Bethesda, MD. Available via http://lcweb.loc.gov/z3950/agency/.
32
 
33
NEGUS, A. E. 1979. Development of the Euronet-Diane Common Command Language. In Proceedings of the 3rd International Online Information Meeting. Learned Information Ltd., Oxford, UK, 95-98.
 
34
 
35
 
36
PORTER, M. F. 1980. An algorithm for suffix stripping. Program 14, 3 (July), 130-137.
 
37
PREECE, S. AND WILLIAMS, M. 1980. Software for the searcher's workbench. In Proceedings of the 43rd American Society for Information Science Annual Meeting (Palo Alto, CA, Oct.). Knowledge Industry Publications, Inc., White Plains, NY, 403-405.
 
38
RAO, R. AND RAJARAMAN, A. 1994. GAIA technical overview. Tech. Rep. Xerox PARC, Palo Alto, CA.
39
 
40
 
41
SELBERG, E. AND ETZIONI, O. 1995. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 4th International WWW Conference (Boston, MA, Dec.).
 
42
TOLIVER, D. E. 1982. OL'SAM: An intelligent front-end for bibliographic information retrieval. Inf. Technol. Libr. 1, 4, 317-326.
 
43
 
44
45
 
46
 
47
 
48

CITED BY  8

Collaborative Colleagues:
Chen-Chuan K. Chang: colleagues
Héctor Garcia-Molina: colleagues
Andreas Paepcke: colleagues