| Inferring structure in semistructured data |
| Full text |
Pdf
(461 KB)
|
| Source
|
ACM SIGMOD Record
archive
Volume 26 , Issue 4 (December 1997)
table of contents
Pages: 39 - 43
Year of Publication: 1997
ISSN:0163-5808
|
|
Authors
|
|
Svetlozer Nestorov
|
Department of Computer Science, Stanford University, Stanford, CA
|
|
Serge Abiteboul
|
Department of Computer Science, Stanford University, Stanford, CA
|
|
Rajeev Motwani
|
Department of Computer Science, Stanford University, Stanford, CA
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 20, Citation Count: 14
|
|
|
ABSTRACT
When dealing with semistructured data such as that available on the Web, it becomes important to infer the inherent structure, both for the user (e.g., to facilitate querying) and for the system (e.g., to optimize access). In this paper, we consider the problem of identifying some underlying structure in large collections of semistructured data. Since we expect the data to be fairly irregular, this structure consists of an approximate classification of objects into a hierarchical collection of types. We propose a notion of a type hierarchy for such data, and outline a method for deriving the type hierarchy, and rules for assigning types to data elements.
CITED BY 14
|
|
|
|
|
|
|
|
Jinlin Chen , Baoyao Zhou , Jin Shi , Hongjiang Zhang , Qiu Fengwu, Function-based object model towards website adaptation, Proceedings of the 10th international conference on World Wide Web, p.587-596, May 01-05, 2001, Hong Kong, Hong Kong
|
|
|
|
|
|
Reo-Jo Yamashita , Tetsuro Ito , Hsiu-Hsen Yao, ESSQL: an enhanced semi-structured query language for composite document retrievals, Proceedings of the 16th annual international conference on Computer documentation, p.120-126, September 24-26, 1998, Quebec, Quebec, Canada
|
|
|
|
|
|
Marcos André Gonçalves , Edward A. Fox , Layne T. Watson , Neill A. Kipp, Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries, ACM Transactions on Information Systems (TOIS), v.22 n.2, p.270-312, April 2004
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Guija Choe , Young-Kwang Nam , Joseph Goguen , Guilian Wang, Query generation for retrieving data from distributed semistructured documents using a metadata interface, Computer Languages, Systems and Structures, v.35 n.4, p.422-434, December, 2009
|
|