|
ABSTRACT
Searching over heterogeneous information sources is difficult in part because of the nonuniform query languages. Our approach is to allow users to compose Boolean queries in one rich front-end language. For each user query and target source, we transform the user query into a subsuming query that can be supported by the source but that may return extra documents. The results are then processed by a filter query to yield the correct final results. In this article we introduce the architecture and associated mechanism for query translation. In particular, we discuss techniques for rewriting predicates in Boolean queries into native subsuming forms, which is a basis of translating complex queries. In addition, we present experimental results for evaluating the cost of postfiltering. We also discuss the drawbacks of this approach and cases when it may not be effective. We have implemented prototype versions of these mechanisms and demonstrated them on heterogeneous Boolean systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
BALDONADO, M., CHANG, C.-C. K., GRAVANO, L., AND PAEPCKE, A. 1997a. The Stanford Digital Library metadata architecture. Int. J. Dig. Lib. 1, 2 (Sept.), 108-121.
|
 |
3
|
Michelle Baldonado , Chen-Chuan K. Chang , Luis Gravano , Andreas Paepcke, Metadata for digital libraries: architecture and design rationale, Proceedings of the second ACM international conference on Digital libraries, p.47-56, July 23-26, 1997, Philadelphia, Pennsylvania, United States
[doi> 10.1145/263690.263791]
|
 |
4
|
|
 |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
CRYSTAL, M. I. AND JAKOBSON, G. E. 1982. FRED, a front end for databases. Online 6, 5 (Sept.), 27-30.
|
 |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
GARC A-MOLINA, H., HAMMER, J., IRELAND, K., PAPAKONSTANTINOU, Y., ULLMAN, J., AND WIDOM, J. 1995. Integrating and accessing heterogeneous information sources in TSIMMIS. In Proceedings of the AAAI Spring Symposium on Information Gathering (Stanford, CA, Mar.). AAAI Press, Menlo Park, CA, 61-64.
|
| |
13
|
|
| |
14
|
|
 |
15
|
Luis Gravano , Chen-Chuan K. Chang , Héctor García-Molina , Andreas Paepcke, STARTS: Stanford proposal for Internet meta-searching, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.207-218, May 11-15, 1997, Tucson, Arizona, United States
|
 |
16
|
Luis Gravano , Héctor García-Molina , Anthony Tomasic, The effectiveness of GIOSS for the text database discovery problem, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.126-137, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
17
|
HARMAN, D. 1993. Document detection overview. In Proceedings of the TIPSTER Text Program (Phase I) (Fredricksburg, VA, Sept.). Morgan Kaufmann Publishers Inc., San Francisco, CA.
|
| |
18
|
HAWKINS, D. T. AND LEVY, L. R. 1985. Front end software for online database searching Part 1: Definitions, system features, and evaluation. Online 9, 6 (Nov.), 30-37.
|
| |
19
|
HOWE, D. 1997. The free on-line dictionary of computing. Available via http://wombat.doc. ic.ac.uk/.
|
| |
20
|
IEE. 1991. INSPEC Thesaurus. The Institution of Electrical Engineers, London, UK.
|
| |
21
|
ISO. 1993. Information and Documentation--Commands for Interactive Text Searching. 1st ed. IOS Press, Amsterdam, The Netherlands.
|
 |
22
|
Steven P. Ketchpel , Hector Garcia-Molina , Andreas Paepcke, Shopping models: a flexible architecture for information commerce, Proceedings of the second ACM international conference on Digital libraries, p.65-74, July 23-26, 1997, Philadelphia, Pennsylvania, United States
[doi> 10.1145/263690.263795]
|
| |
23
|
KIRK, T., LEVY, A. Y., SAGIV, Y., AND SRIVASTAVA, D. 1995. The information manifold. In Papers from the AAAI Spring Symposium on Information Gathering in Distributed Heterogenous Environments (Menlo Park, CA, Mar.), C. Knoblock and A. Levy, Eds. AAAI Press, Menlo Park, CA, 85-91.
|
 |
24
|
|
| |
25
|
LOVINS, J. B. 1968. Development of a stemming algorithm. Mech. Trans. Comput. Ling. 11, 1-2, 22-31.
|
| |
26
|
MARCUS, R. S. 1982. User assistance in bibliographic retrieval networks through a computer intermediary. IEEE Trans. Syst. Man Cybern. SMC-12, 2, 116-133.
|
| |
27
|
MARTIN, T. H. 1974. A feature analysis of interactive retrieval systems. Rep. SU-COMM-ICR-74-1. Institute of Communication Research, Stanford University, Stanford, CA.
|
| |
28
|
|
 |
29
|
|
| |
30
|
NISO. 1993. Z39.58-1992 Common Command Language for Online Interactive Information Retrieval. NISO Press, Bethesda, MD.
|
| |
31
|
NISO. 1995. Information Retrieval (Z39.50): Application service definition and protocol specification. Tech. Rep. ANSI/NISO Z39.50-1995. NISO Press, Bethesda, MD. Available via http://lcweb.loc.gov/z3950/agency/.
|
 |
32
|
|
| |
33
|
NEGUS, A. E. 1979. Development of the Euronet-Diane Common Command Language. In Proceedings of the 3rd International Online Information Meeting. Learned Information Ltd., Oxford, UK, 95-98.
|
| |
34
|
|
| |
35
|
|
| |
36
|
PORTER, M. F. 1980. An algorithm for suffix stripping. Program 14, 3 (July), 130-137.
|
| |
37
|
PREECE, S. AND WILLIAMS, M. 1980. Software for the searcher's workbench. In Proceedings of the 43rd American Society for Information Science Annual Meeting (Palo Alto, CA, Oct.). Knowledge Industry Publications, Inc., White Plains, NY, 403-405.
|
| |
38
|
RAO, R. AND RAJARAMAN, A. 1994. GAIA technical overview. Tech. Rep. Xerox PARC, Palo Alto, CA.
|
 |
39
|
|
| |
40
|
|
| |
41
|
SELBERG, E. AND ETZIONI, O. 1995. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 4th International WWW Conference (Boston, MA, Dec.).
|
| |
42
|
TOLIVER, D. E. 1982. OL'SAM: An intelligent front-end for bibliographic information retrieval. Inf. Technol. Libr. 1, 4, 317-326.
|
| |
43
|
|
| |
44
|
|
 |
45
|
|
| |
46
|
|
| |
47
|
|
| |
48
|
|
|