| Mining all frequent projection-selection queries from a relational table |
| Full text |
Pdf
(262 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 261
archive
Proceedings of the 11th international conference on Extending database technology: Advances in database technology
table of contents
Nantes, France
SESSION: Research sessions: Data mining
table of contents
Pages 368-379
Year of Publication: 2008
ISBN:978-1-59593-926-5
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 31, Citation Count: 0
|
|
|
ABSTRACT
In this paper we study the problem of mining all frequent queries in a given database table, a problem known to be intractable even for conjunctive queries. We restrict our attention to projection-selection queries, and we assume that the table to be mined satisfies a set of functional dependencies. Under these assumptions we define a pre-ordering &cupre; over queries and we show the following: (a) the support measure is anti-monotonic (with respect to &cupre;), and (b) if we define q &cupre; q' iff q &cupre; q' and q' &cupre; q then all queries of an equivalence class have the same support. With these results at hand, we further restrict our attention to star schemas of data warehouses. In those schemas, the set of functional dependencies satisfies an important property, namely, the union of keys of all dimension tables is a key for the fact table. The main contribution of this paper is the proposal of a level-wise algorithm for mining all frequent projection-selection queries in a data warehouse over a star schema. Moreover, we show that, in the case of a star schema, the complexity in the number of scans of our algorithm is similar to that of the well known Apriori algorithm, i.e., linear with respect to the number of attributes.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Rakesh Agrawal , Heikki Mannila , Ramakrishnan Srikant , Hannu Toivonen , A. Inkeri Verkamo, Fast discovery of association rules, Advances in knowledge discovery and data mining, American Association for Artificial Intelligence, Menlo Park, CA, 1996
|
| |
2
|
W. Armstrong. Dependency structures of data base relationships. In IFIP Congress, pages 580--583. North Holland, 1974.
|
| |
3
|
|
| |
4
|
|
| |
5
|
A. Faye, A. Giacometti, D. Laurent, and N. Spyratos. Mining rules in databases with multiple tables: Problems and perspectives. In 3rd International Conference on Computing Anticipatory Systems (CASYS), 1999.
|
| |
6
|
B. Goethals. Mining queries. In Workshop on inductive databases and constraint based mining. http://www.informatik.unifreiburg.de/ml/IDB/talks/Goethals_slides.pdf, 2004.
|
| |
7
|
|
 |
8
|
|
| |
9
|
J. Han, Y. Fu, W. Wang, K. Koperski, and O. Zaiane. Dmql: A data mining query language for relational databases. In SIGMOD-DMKD'96, pages 27--34, 1996.
|
| |
10
|
|
| |
11
|
T. Y. Jen, D. Laurent, N. Spyratos, and O. Sy. Towards mining frequent queries in star schemes. In International Workshop on Knowledge Discovery in Databases (KDID), LNCS 3933, pages 104--123. Springer Verlag, 2005.
|
| |
12
|
|
| |
13
|
T. Turmeaux, A. Salleb, C. Vrain, and D. Cassard. Learning caracteristic rules relying on quantified paths. In PKDD, LNCS 2838, pages 471--482. Springer Verlag, 2003.
|
| |
14
|
|
|