ACM Home Page
Please provide us with feedback. Feedback
Mining all frequent projection-selection queries from a relational table
Full text PdfPdf (262 KB)
Source ACM International Conference Proceeding Series; Vol. 261 archive
Proceedings of the 11th international conference on Extending database technology: Advances in database technology table of contents
Nantes, France
SESSION: Research sessions: Data mining table of contents
Pages 368-379  
Year of Publication: 2008
ISBN:978-1-59593-926-5
Authors
Tao-Yuan Jen  Université de Cergy-Pontoise, Cergy-Pontoise, France
Dominique Laurent  Université de Cergy-Pontoise, Cergy-Pontoise, France
Nicolas Spyratos  Université Paris-Sud, Orsay, France
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 39,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1353343.1353390
What is a DOI?

ABSTRACT

In this paper we study the problem of mining all frequent queries in a given database table, a problem known to be intractable even for conjunctive queries. We restrict our attention to projection-selection queries, and we assume that the table to be mined satisfies a set of functional dependencies. Under these assumptions we define a pre-ordering &cupre; over queries and we show the following: (a) the support measure is anti-monotonic (with respect to &cupre;), and (b) if we define q &cupre; q' iff q &cupre; q' and q' &cupre; q then all queries of an equivalence class have the same support.

With these results at hand, we further restrict our attention to star schemas of data warehouses. In those schemas, the set of functional dependencies satisfies an important property, namely, the union of keys of all dimension tables is a key for the fact table. The main contribution of this paper is the proposal of a level-wise algorithm for mining all frequent projection-selection queries in a data warehouse over a star schema. Moreover, we show that, in the case of a star schema, the complexity in the number of scans of our algorithm is similar to that of the well known Apriori algorithm, i.e., linear with respect to the number of attributes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
W. Armstrong. Dependency structures of data base relationships. In IFIP Congress, pages 580--583. North Holland, 1974.
 
3
 
4
 
5
A. Faye, A. Giacometti, D. Laurent, and N. Spyratos. Mining rules in databases with multiple tables: Problems and perspectives. In 3rd International Conference on Computing Anticipatory Systems (CASYS), 1999.
 
6
B. Goethals. Mining queries. In Workshop on inductive databases and constraint based mining. http://www.informatik.unifreiburg.de/ml/IDB/talks/Goethals_slides.pdf, 2004.
 
7
8
 
9
J. Han, Y. Fu, W. Wang, K. Koperski, and O. Zaiane. Dmql: A data mining query language for relational databases. In SIGMOD-DMKD'96, pages 27--34, 1996.
 
10
 
11
T. Y. Jen, D. Laurent, N. Spyratos, and O. Sy. Towards mining frequent queries in star schemes. In International Workshop on Knowledge Discovery in Databases (KDID), LNCS 3933, pages 104--123. Springer Verlag, 2005.
 
12
 
13
T. Turmeaux, A. Salleb, C. Vrain, and D. Cassard. Learning caracteristic rules relying on quantified paths. In PKDD, LNCS 2838, pages 471--482. Springer Verlag, 2003.
 
14

Collaborative Colleagues:
Tao-Yuan Jen: colleagues
Dominique Laurent: colleagues
Nicolas Spyratos: colleagues