|
ABSTRACT
A model to query document databases by both their content and structure is presented. The goal is to obtain a query language that is expressive in practice while being efficiently implementable, features not present at the same time in previous work. The key ideas of the model are a set-oriented query language based on operations on nearby structure elements of one or more hierarchies, together with content and structural indexing and bottom-up evaluation. The model is evaluated in regard to expressiveness and efficiency, showing that it provides a good trade-off between both goals. Finally, it is shown how to include in the model other media different from text.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Alfred V. Aho , Ravi Sethi , Jeffrey D. Ullman, Compilers: principles, techniques, and tools, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1986
|
| |
2
|
ARNOLD-MOORE, T., FULLER, M., LOWE, B., THOM, J., AND WILKINSON, R. 1995. The ELF data model and SGQL query language for structured document databases. In Proceedings of the 6th Australasian Database Conference. 17-26.
|
| |
3
|
ARS INNOVANDI. 1992. Search City 1.1 Text Retrieval for Windows Power Users. Ars Innovandi, Santiago, Chile.
|
| |
4
|
BAEZA-YATES, R. 1994. An hybrid query model for full text retrieval systems. Tech. Rep. DCC-1994-2, Dept. of Computer Science, Univ. of Chile, Santiago, Chile.
|
| |
5
|
BAEZA-YATES, R. 1996. An extended model for full-text databases. J. Braz. CS Soc. 3, 2 (Apr.), 57-64.
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
| |
9
|
CATTELL, R. 1991. Object Data Management. Addison-Wesley, Reading, Mass.
|
| |
10
|
|
 |
11
|
V. Christophides , S. Abiteboul , S. Cluet , M. Scholl, From structured documents to novel query facilities, Proceedings of the 1994 ACM SIGMOD international conference on Management of data, p.313-324, May 24-27, 1994, Minneapolis, Minnesota, United States
|
| |
12
|
CLARKE, C., CORMACK, G., AND BURKOWSKI, F. 1995. An algebra for structured text search and a framework for its implementation. Comput. J.
|
 |
13
|
|
| |
14
|
|
 |
15
|
|
 |
16
|
|
 |
17
|
|
| |
18
|
DAO, T., SACKS-DAVIS, R., AND THOM, J. 1996. Indexing structured text for queries on containment relationships. In Proceedings of the 7th Australasian Database Conference.
|
| |
19
|
|
| |
20
|
DESAI, B., GOYAL, P., AND SADRI, S. 1986. A data model for use with formatted and textual data. J. ASIS 37, 3, 158-165.
|
| |
21
|
D{IGITAL. 1991. CDA-DDIF Technical Specification. Digital Equipment Corp., Maynard, Mass.
|
| |
22
|
|
| |
23
|
FAWCETT, H. 1989. PAT 3.3 User's Guide. UW Centre for the New OED and Text Research, Univ. of Waterloo, Ontario, Canada.
|
| |
24
|
|
| |
25
|
GAREY, M. AND JOHNSON, D. 1979. Computers and Intractability. W. Freeman and Company.
|
| |
26
|
|
 |
27
|
|
| |
28
|
ISO. 1986. Information processing--Text and office systems--Standard generalized markup language (SGML). ISO 8879-1986, International Standards Organization, Geneva, Switzerland.
|
| |
29
|
ISO. 1991. Information processing--Text composition--Standard page description language (SPDL). ISO/IEC DIS 10180, International Standards Organization, Geneva, Switzerland.
|
| |
30
|
ISO. 1992. Information technology--hypermedia/time-based structuring language (Hy- Time). ISO/IEC 10744, International Standards Organization, Geneva, Switzerland.
|
| |
31
|
ISO. 1994. Information technology--Text and office systems--Document style semantics and specification language (DSSSL). ISO/IEC DIS 10179.2, International Standards Organization, Geneva, Switzerland.
|
| |
32
|
|
| |
33
|
|
 |
34
|
|
 |
35
|
|
| |
36
|
|
| |
37
|
|
| |
38
|
MACKIE, E. AND ZOBEL, g. 1992. Retrieval of tree-structured data from disc. In Proceedings of the 3rd Australasian Database Conference. 209-216.
|
| |
39
|
|
| |
40
|
|
| |
41
|
NAVARRO, G. 1995. A language for queries on structure and contents of textual databases. Master's thesis, Dept. of Computer Science, Univ. of Chile, Santiago, Chile.
|
| |
42
|
NAVARRO, G. AND BAEZA-YATES, R. 1995a. Expressive power of a new model for structured text databases. In Proceedings of PANEL 95. 1151-1162.
|
 |
43
|
|
| |
44
|
SACKS-DAVIS, R., ARNOLD-MOORE, T., AND ZOBEL, J. 1994. Database systems for structured documents. In Proceedings of ADTI 94. 272-283.
|
| |
45
|
SACKS-DAVIS, R., ZOBEL, J., AND RAMAMOHANARAO, K. 1992. Advanced database systems for text retrieval. In Proceedings of the 3rd Australian Database Conference. 1-8.
|
| |
46
|
SALMINEN, A. AND TOMPA, F. 1992. PAT expressions: An algebra for text search. In COM- PLEX 92. 309-332.
|
| |
47
|
|
 |
48
|
|
| |
49
|
SUBRAHMANIAN, M. AND JAJODIA, S. Eds. 1996. Multimedia Database Systems. Springer- Verlag, New York.
|
 |
50
|
Jean Tague , Airi Salminen , Charles McClellan, Complete formal model for information retrieval systems, Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval, p.14-20, October 13-16, 1991, Chicago, Illinois, United States
[doi> 10.1145/122860.122862]
|
| |
51
|
|
CITED BY 29
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Marcos André Gonçalves , Edward A. Fox , Layne T. Watson , Neill A. Kipp, Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries, ACM Transactions on Information Systems (TOIS), v.22 n.2, p.270-312, April 2004
|
|
|
|
|
|
|
|
|
|
|
|
Marcos André Gonçalves , Edward A. Fox , Aaron Krowne , Pável Calado , Alberto H. F. Laender , Altigran S. da Silva , Berthier Ribeiro-Neto, The effectiveness of automatically structured queries in digital libraries, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, June 07-11, 2004, Tuscon, AZ, USA
|
|
|
|
|
|
|
|
|
|
|
|
Robert W.P. Luk , H. V. Leong , Tharam S. Dillon , Alvin T.S. Chan , W. Bruce Croft , James Allan, A survey in indexing and searching XML documents, Journal of the American Society for Information Science and Technology, v.53 n.6, p.415-437, May, 2002
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
REVIEW
"Jaroslav Pokorny : Reviewer"
The authors present a model for structuring and querying document
databases. Theirs is an advanced method, combining structure and content
in queries. On a prototype system, the authors argue that their model
not only is sufficiently expressiv
more...
|