ACM Home Page
Please provide us with feedback. Feedback
Searching for dependencies at multiple abstraction levels
Full text PdfPdf (411 KB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 27 ,  Issue 3  (September 2002) table of contents
Pages: 229 - 260  
Year of Publication: 2002
ISSN:0362-5915
Authors
Toon Calders  University of Antwerp, Wilrijk, Belgium
Raymond T. Ng  University of British Columbia, Canada
Jef Wijsen  University of Mons-Hainaut, Mons, Belgium
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 46,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/581751.581752
What is a DOI?

ABSTRACT

The notion of roll-up dependency (RUD) extends functional dependencies with generalization hierarchies. RUDs can be applied in OLAP and database design. The problem of discovering RUDs in large databases is at the center of this paper. An algorithm is provided that relies on a number of theoretical results. The algorithm has been implemented; results on two real-life datasets are given. The extension of functional dependency (FD) with roll-ups turns out to capture meaningful rules that are outside the scope of classical FD mining. Performance figures show that RUDs can be discovered in linear time in the number of tuples of the input dataset.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
Bay, S. 1999. The UCI KDD archive, http://kdd.ics.uci.edu/. University of California, Irvine, Dept. of Information and Computer Sciences.
 
6
Bell, S. 1995. Discovery and maintenance of functional dependencies by independencies. In Proceedings of the 1st International Conference on Knowledge Discovery in Databases. AAAI Press, Montreal, Canada, 27--32.
 
7
Bettini, C., Dyreson, C., Evans, W., Snodgrass, R., and Wang, X. 1998. A glossary of time granularity concepts. In Temporal Databases: Research and Practice, O. Etzion, S. Jajodia, and S. Sripada, Eds. Number 1399 in LNCS State-of-the-art Survey. Springer-Verlag, 406--413.
 
8
 
9
 
10
CIA. 2001. The world factbook 2001, http://www.cia.gov/cia/publications/factbook.
 
11
Davey, B. and Priestley, H. 1990. Introduction to Lattices and Order. Cambridge University Press.
 
12
 
13
Han, J. 1997. OLAP mining: An integration of OLAP with data mining. In Proceedings of the 7th IFIP 2.6 Working Conference on Database Semantics (DS-7). Leysin, Switzerland, 1--9.
14
 
15
Huhtala, Y., Kärkkäinen, J., Porkka, P., and Toivonen, H. 1999. TANE: An efficient algorithm for discovering functional and approximate dependencies. Comput. J. 42, 2, 100--111.
 
16
Kantola, M., Mannila, H., Räihä, K.-J., and Siirtola, H. 1992. Discovering functional and inclusion dependencies in relational databases. Int. J. Intelligent Systems 7, 591--607.
 
17
 
18
 
19
 
20
 
21
 
22
May, W. 1999. Information extraction and integration with Florid: The Mondial case study. Tech. Rep. 131, Universität Freiburg, Institut für Informatik.
 
23
 
24
Schlimmer, J. C. 1993. Efficiently inducing determinations: A complete and systematic search algorithm that uses optimal pruning. In Proceedings of the 10th International Conference on Machine Learning. Morgan Kaufmann, Amherst, MA, 284--290.
 
25
van der Heyden, J. 2001. Global statistics, http://www.geohive.com/. GeoHive.
 
26
van der Laag, P. R. J. and Nienhuys-Cheng, S.-H. 1998. Completeness and properness of refinement operators in inductive logic programming. J. Logic Program. 34, 3, 201--225.
27
28
 
29
30
 
31


Collaborative Colleagues:
Toon Calders: colleagues
Raymond T. Ng: colleagues
Jef Wijsen: colleagues