ACM Home Page
Please provide us with feedback. Feedback
Proximal nodes: a model to query document databases by content and structure
Full text PdfPdf (550 KB)
Source ACM Transactions on Information Systems (TOIS) archive
Volume 15 ,  Issue 4  (October 1997) table of contents
Pages: 400 - 435  
Year of Publication: 1997
ISSN:1046-8188
Authors
Gonzalo Navarro  Univ. of Chile, Blanco, Chile
Ricardo Baeza-Yates  Univ. of Chile, Blanco, Chile
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 64,   Citation Count: 29
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/263479.263482
What is a DOI?

ABSTRACT

A model to query document databases by both their content and structure is presented. The goal is to obtain a query language that is expressive in practice while being efficiently implementable, features not present at the same time in previous work. The key ideas of the model are a set-oriented query language based on operations on nearby structure elements of one or more hierarchies, together with content and structural indexing and bottom-up evaluation. The model is evaluated in regard to expressiveness and efficiency, showing that it provides a good trade-off between both goals. Finally, it is shown how to include in the model other media different from text.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
ARNOLD-MOORE, T., FULLER, M., LOWE, B., THOM, J., AND WILKINSON, R. 1995. The ELF data model and SGQL query language for structured document databases. In Proceedings of the 6th Australasian Database Conference. 17-26.
 
3
ARS INNOVANDI. 1992. Search City 1.1 Text Retrieval for Windows Power Users. Ars Innovandi, Santiago, Chile.
 
4
BAEZA-YATES, R. 1994. An hybrid query model for full text retrieval systems. Tech. Rep. DCC-1994-2, Dept. of Computer Science, Univ. of Chile, Santiago, Chile.
 
5
BAEZA-YATES, R. 1996. An extended model for full-text databases. J. Braz. CS Soc. 3, 2 (Apr.), 57-64.
6
7
8
 
9
CATTELL, R. 1991. Object Data Management. Addison-Wesley, Reading, Mass.
 
10
11
 
12
CLARKE, C., CORMACK, G., AND BURKOWSKI, F. 1995. An algebra for structured text search and a framework for its implementation. Comput. J.
13
 
14
15
16
17
 
18
DAO, T., SACKS-DAVIS, R., AND THOM, J. 1996. Indexing structured text for queries on containment relationships. In Proceedings of the 7th Australasian Database Conference.
 
19
 
20
DESAI, B., GOYAL, P., AND SADRI, S. 1986. A data model for use with formatted and textual data. J. ASIS 37, 3, 158-165.
 
21
D{IGITAL. 1991. CDA-DDIF Technical Specification. Digital Equipment Corp., Maynard, Mass.
 
22
 
23
FAWCETT, H. 1989. PAT 3.3 User's Guide. UW Centre for the New OED and Text Research, Univ. of Waterloo, Ontario, Canada.
 
24
 
25
GAREY, M. AND JOHNSON, D. 1979. Computers and Intractability. W. Freeman and Company.
 
26
27
 
28
ISO. 1986. Information processing--Text and office systems--Standard generalized markup language (SGML). ISO 8879-1986, International Standards Organization, Geneva, Switzerland.
 
29
ISO. 1991. Information processing--Text composition--Standard page description language (SPDL). ISO/IEC DIS 10180, International Standards Organization, Geneva, Switzerland.
 
30
ISO. 1992. Information technology--hypermedia/time-based structuring language (Hy- Time). ISO/IEC 10744, International Standards Organization, Geneva, Switzerland.
 
31
ISO. 1994. Information technology--Text and office systems--Document style semantics and specification language (DSSSL). ISO/IEC DIS 10179.2, International Standards Organization, Geneva, Switzerland.
 
32
 
33
34
35
 
36
 
37
 
38
MACKIE, E. AND ZOBEL, g. 1992. Retrieval of tree-structured data from disc. In Proceedings of the 3rd Australasian Database Conference. 209-216.
 
39
 
40
 
41
NAVARRO, G. 1995. A language for queries on structure and contents of textual databases. Master's thesis, Dept. of Computer Science, Univ. of Chile, Santiago, Chile.
 
42
NAVARRO, G. AND BAEZA-YATES, R. 1995a. Expressive power of a new model for structured text databases. In Proceedings of PANEL 95. 1151-1162.
43
 
44
SACKS-DAVIS, R., ARNOLD-MOORE, T., AND ZOBEL, J. 1994. Database systems for structured documents. In Proceedings of ADTI 94. 272-283.
 
45
SACKS-DAVIS, R., ZOBEL, J., AND RAMAMOHANARAO, K. 1992. Advanced database systems for text retrieval. In Proceedings of the 3rd Australian Database Conference. 1-8.
 
46
SALMINEN, A. AND TOMPA, F. 1992. PAT expressions: An algebra for text search. In COM- PLEX 92. 309-332.
 
47
48
 
49
SUBRAHMANIAN, M. AND JAJODIA, S. Eds. 1996. Multimedia Database Systems. Springer- Verlag, New York.
50
 
51

CITED BY  29


REVIEW

"Jaroslav Pokorny : Reviewer"

The authors present a model for structuring and querying document databases. Theirs is an advanced method, combining structure and content in queries. On a prototype system, the authors argue that their model not only is sufficiently expressiv  more...

Collaborative Colleagues:
Gonzalo Navarro: colleagues
Ricardo Baeza-Yates: colleagues