ACM Home Page
Please provide us with feedback. Feedback
Retrieval from hierarchical texts by partial patterns
Full text PdfPdf (904 KB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Pittsburgh, Pennsylvania, United States
Pages: 214 - 222  
Year of Publication: 1993
ISBN:0-89791-605-0
Authors
Sponsor
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 30,   Citation Count: 16
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/160688.160722
What is a DOI?

ABSTRACT

Structured texts (for example dictionaries and user manuals) typically have a heirarchical (tree-like) structure. We describe a query language for retrieving information from collections of hierarchical text. The language is based on a tree pattern matching notion called tree inclusion. Tree inclusion allows easy expression of queries that use the structure and the content of the document. In using it a user need not be aware of the whole structure of the database. Thus a language based on tree inclusion is data independent, a property made necessary because of the great variance in the structure of the texts.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
 
5
D.L. Berg, G. H. Gonnet, and F. Win. Tompa. The New Oxford English Dictionary project at the University of Waterloo. Technical Report OED- 88-01, UW Centre for the New OED, 1988.
 
6
7
 
8
9
 
10
R. J. Byrd. LQL user notes: An informal guide to the lexical query language. Technical Report RC 14853 8/17/89, IBM T.J. Watson Research Center, August 1989.
 
11
 
12
 
13
G. H. Gonnet, R. A. Baeza-Yates, and T. Snider. Lexicographical indices for text: Inverted flies vs. PAT trees. Report OED-91-01, UW Centre for the New Oxford English Dictionary and Text Research, 1991.
14
 
15
P. Kilpel~inen. Tree Matching Problems with Applications to Structured Text Databases. P hD thesis, University of Helsinki, Dept. of Comp. Science, November 1992.
 
16
P. Kilpel~inen, G. Lind6n, H. Mannila, and E. Nikunen. A structured document database system. In R. Furuta, editor, EPgO - Proceedings of the International Conference on Electronic PubIi3hing, Document Manipulation ~4 Typography, The Cambridge Series on Electronic Publishing. Cambridge University Press, 1990.
 
17
 
18
 
19
 
20
 
21
 
22
 
23
 
24
 
25
 
26
V. Quint and I. Vatton. Grif: An interactive system for structured document manipulation. In J. C. van Vliet, editor, Text Processing and Document Manipulation, pages 200-213. Cambridge University Press, 1986.
 
27
L. Sterling and E. Shapiro. The Art of Prolo9. MIT Press, 1986.
28
 
29
 
30
D. H. D. Warren. Efficient processing of interactive relational database queries expressed in logic. In Proceedings of the Seventh International Conference on Very Large Data Bases, pages 272-281. Computer Society Press, 1981.

CITED BY  16

Collaborative Colleagues:
Pekka Kilpeläinen: colleagues
Heikki Mannila: colleagues