ACM Home Page
Please provide us with feedback. Feedback
A survey on wavelet applications in data mining
Full text PdfPdf (330 KB)
Source ACM SIGKDD Explorations Newsletter archive
Volume 4 ,  Issue 2  (December 2002) table of contents
Pages: 49 - 68  
Year of Publication: 2002
ISSN:1931-0145
Authors
Tao Li  Univ. of Rochester, Rochester, NY
Qi Li  Univ. of Delaware, Newark, DE
Shenghuo Zhu  Univ. of Rochester, Rochester, NY
Mitsunori Ogihara  Univ. of Rochester, Rochester, NY
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 75,   Downloads (12 Months): 575,   Citation Count: 18
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/772862.772870
What is a DOI?

ABSTRACT

Recently there has been significant development in the use of wavelet methods in various data mining processes. However, there has been written no comprehensive survey available on the topic. The goal of this is paper to fill the void. First, the paper presents a high-level data-mining framework that reduces the overall process into smaller components. Then applications of wavelets for each component are reviewd. The paper concludes by discussing the impact of wavelets on data mining research and outlining potential future research directions and applications.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
F. Abramovich, T. Bailey, and T. Sapatinas. Wavelet analysis and its statistical applications. JRSSD, (48):1--30, 2000.
 
2
F. Abramovich and Y. Benjamini. Thresholding of wavelet coecients as multiple hypotheses testing procedure. In A. Antoniadis and G. Oppenheim, editors, Wavelets and Statistics, Lecture Notes in Statistics 103, pages 5--14. Springer-Verlag, New York, 1995.
 
3
F. Abramovich and T. Sapatinas. Bayesian approach to wavelet decomposition and shrinkage. In P. Muller and B. Vidakovic, editors, Lecture Notes in Statistics. Springer-Verlag, New York, 1999.
 
4
F. Abramovich, T. Sapatinas, and B. Silverman. Wavelet thresholding via a Bayesian approach. Journal of the Royal Statistical Society, Series B, (58), 1997.
 
5
C. C. Aggarwal, J. L. Wolf, P. S. Yu, C. Procopiuc, and J. S. Park. Fast algorithms for projected clustering. pages 61--72, 1999.
6
7
 
8
H. Akaike. Information theory and an extension of the maximum likelihood principle. In In Proc. 2nd Int. Symp. Info. Theory, pages 267--281, 1973.
 
9
A. Aldroubi and M. Unser, editors. Wavelets in Medicine and Biology. CRC Press, Boca Raton, 1996.
 
10
A. Antoniadis. Wavelets in statistics: a review. J. It. Statist. Soc., 1999.
 
11
A. Antoniadis, G. Grégoire, and I. W. McKeague. Wavelet methods for curve estimation. Journal of the American Statistical Association, 89(428):1340--1353, 1994.
 
12
A. Antoniadis and G. Oppenhiem, editors. Wavelets and Statistics, Lecture Notes in Statistics. Springer-Verlag, 1995.
 
13
 
14
A. Arning, R. Agrawal, and P. Raghavan. A linear method for deviation detection in large databases. In Knowledge Discovery and Data Mining, pages 164--169, 1996.
 
15
T. C. Bailey, T. Sapatinas, K. J. Powell, and W. J. Krzanowski. Signal detection in underwater sounds using wavelets. Journal of the American Statistical Association, 93(441):73--83, 1998.
 
16
B. Bakshi. Multiscale pca with application to multivariate statistical process monitoring. AIChE Journal, 44(7):1596--1610, 1998.
 
17
B. Bakshi. Multiscale analysis and modeling using wavelets. Journal of Chemometrics, (13):415--434, 1999.
 
18
B. R. Bakshi, A. Koulouris, and G. Stephanopoulos. Learning at multiple resolutions: Wavelets as basis functions in artificial neural networks and inductive decision trees. In R. Motard and B. Joseph, editors, Wavelet Applications in Chemical Engineering. Kluwer Inc., Boston, 1994.
 
19
 
20
A. Bell and T. Sejnowski. Fast blind separation based on information theory. In Proc. Intern. Symp. on Nonlinear Theory and Applications, Las Vegas, 1995.
 
21
 
22
M. Berger and I. Rigoutsos. An algorithm for point clustering and grid generation. IEEE Trans. on Systems, Man and Cybernetics, 21(5):1278--1286, 1991.
 
23
M. Blume and D. Ballard. Image annotation based on learning vector quantization and localized haar wavelet transform features. In Proc. SPIE 3077, pages 181--190, 1997.
 
24
C. Brambilla, A. D. Ventura, I. Gagliardi, and R. Schettini. Multiresolution wavelet transform and supervised learning for content-based image retrieval. In Proceedings of the IEEE International Conference on Multimedia Computing and Systems, volume I, 1999.
25
 
26
M. Brito, E. Chavez, A. Quiroz, and J. Yukich. Connectivity of the mutual K-Nearest-Neighbor graph for clustering and outlier detection. Statistics and Probability Letters, 35:33--42, 1997.
 
27
A. Bruce and H.-Y. Gao. Waveshrink with firm shrinkage. Statistica Sinica, (4):855--874, 1996.
28
 
29
R. W. Buccigrossi and E. P. Simoncelli. Image compression via joint statistical characterization in the wavelet domain. In Proceedings ICASSP-97 (IEEE International Conference on Acoustics, Speech and Signal Processing), number 414, Munich, Germany, 1997.
 
30
T. Cai. Adaptive wavelet estimation: a block thresholding and oracle inequality approach. Technical Report 98--07, Department of Statistics, Purdue University, 1998.
 
31
J. P. Campbell. Speaker recognition: A tutorial. In Proceedings of the IEEE, volume 85, pages 1437--1461, Sept. 1997.
 
32
G. Carpenter. Wsom: building adaptive wavelets with self-organizing maps. In Proc. of 1998 IEEE International Joint Conference on Neural Networks, volume 1, pages 763--767, 1998.
 
33
V. Castelli and I. Kontoyiannis. Wavelet-based classification: Theoretical analysis. Technical Report RC-20475, IBM Watson Research Center, 1996.
 
34
V. Castelli and I. Kontoyiannis. An efficient recursive partitioning algorithm for classification, using wavelets. Technical Report RC-21039, IBM Watson Research Center, 1997.
 
35
V. Castelli, C. Li, J. Turek, and I. Kontoyiannis. Progressive classification in the compressed domain for large eos satellite databases, April 1996.
 
36
 
37
A. Chambolle, R. DeVore, N. Lee, and B. Lucier. Nonlinear wavelet image processing: Variational problems, compression, and noise removal through wavelet shrinkage. IEEE Tran. Image Proc., 7(3):319--333, 1998.
 
38
P.-R. Chang and B.-F. Yeh. Nonlinear communication channel equalization using wavelet neural networks. In Proc. of 1994 IEEE International Conference Joint on Neural Networks, volume 6, pages 3605--3610, 1994.
 
39
S. Chang, B. Yu, and M. Vetterli. Spatially adaptive wavelet thresholding with context modeling for image denoising. In ICIP, volume 1, pages 535--539, 1998.
 
40
T. Chang and C. Kuo. Texture analysis and classification with tree-structured wavelet transform. IEEE Trans. on Image Processing, 2(4):429--441, 1993.
 
41
P. Cheeseman, J. Kelly, and M. Self. AutoClass: A bayesian classification system. In ICML'88, 1988.
 
42
B. Chen, X.Z.Wang, S. Yang, and C. McGreavy. Application of wavelets and neural networks to diagnostic system development, 1, feature extraction. Computers and Chemical Engineering, (23):899--906, 1999.
 
43
B. Chen, X.Z.Wang, S. Yang, and C. McGreavy. Application of wavelets and neural networks to diagnostic system development, 2, an integrated framework and its application. Computers and Chemical Engineering, (23):945--954, 1999.
 
44
C. Chiann and P. A. Morettin. A wavelet analysis for time series. Journal of Nonparametric Statistics, 10(1):1--46, 1999.
 
45
 
46
 
47
 
48
 
49
P. Cristea, R. Tuduce, and A. Cristea. Time series prediction with wavelet neural networks. In Proc. of the 5th Seminar on Neural Network Applications in Electrical Engineering, pages 5--10, 2000.
50
 
51
I. Daubechies. Orthonormal bases of compactly support wavelets. Comm. Pure Applied Mathematics, 41:909--996, 1988.
 
52
 
53
I. Daubechies, B. Han, A. Ron, and Z. Shen. Framelets: Mrabased constructions of wavelet frames, 2000. preprint.
 
54
 
55
R. A. DeVore, B. Jawerth, and B. J. Lucier. image compression through wavelet transform coding. IEEE Transactions on Information Theory, 38(2):719--746, 1992.
 
56
P. Q. Dinh, C. Dorai, and S. Venkatesh. Video genre categorization using audio wavelet coefficients. In ACCV 2002, 2002.
 
57
D. Donoho, I. Johnstone, G. Kerkyacharian, and D. Picard. Density estimation by wavelet thresholding. Ann. Statist., (24):508--539, 1996.
 
58
D. L. Donoho and I. M. Johnstone. Ideal spatial adaptation by wavelet shrinkage. Biometrika, 81(3):425--455, 1994.
 
59
D. L. Donoho and I. M. Johnstone. Adapting to unknown smoothness via wavelet shrinkage. Journal of the American Statistical Association, 90(432):1200--1224, 1995.
 
60
D. L. Donoho and I. M. Johnstone. Minimax estimation via wavelet shrinkage. Annals of Statistics, 26(3):879--921, 1998.
 
61
D. L. Donoho, I. M. Johnstone, G. Kerkyacharian, and D. Picard. Wavelet shrinkage: Asymptopia? J. R. Statist. Soc. B., 57(2):301--337, 1995.
 
62
 
63
G. Fan and X. Xia. Wavelet-based statistical image processing using hidden markov tree model. In Proc. 34th Annual Conference on Information Sciences and Systems, Princeton, NJ, USA, 2000.
 
64
 
65
G. C. Feng, P. C. Yuen, and D. Q. Dai. Human face recognition using PCA on wavelet subband. SPIE Journal of Electronic Imaging, 9(2), 2000.
 
66
D. H. Fisher. Iterative optimization and simplification of hierarchical clusterings. Technical Report CS-95-01, Vanderbilt U., Dept. of Comp. Sci., 1995.
 
67
P. Flandrin. Wavelet analysis and synthesis of fractional Brownian motion. IEEE Transactions on Information Theory, 38(2):910--917, 1992.
68
 
69
H.-Y. Gao. Threshold selection in WaveShrink, 1997. theory for matlab wavelet toolbox on denoising.
70
 
71
72
 
73
 
74
D. Gunopulos. Tutorial slides: Dimensionality reduction techniques. In DIMACS Summer School on Data Mining, August 2001.
 
75
A. Haar. Zur theorie der orthogonalen funktionensysteme. Mathematics Annal., 69:331--371, 1910.
 
76
P. Hall, G. Kerkyacharian, and D. Picard. Block threshold rules for curve estimation using kernel and wavelet methods. Ann. Statist., (26):922--942, 1998.
 
77
 
78
 
79
 
80
S. Haykin. Neural Networks. Prentice Hall, 1999.
 
81
 
82
P. Hornby, F. Boschetti, and F. Horowitz. Analysis of potential field data in the wavelet domain. In Proceedings of the 59th EAGE Conference, May 1997.
 
83
Y. Huhtala, J. Karkkainen, and H. Toivonen. Mining for similarities in aligned time series using wavelets. In Data Mining and Knowledge Discovery: Theory, Tools, and Technology. SPIE Proc., 1999.
 
84
O. F. J. Z. Wang, G. Wiederhold and S. X. Wei. Wavelet-based image indexing techniques with partial sketch retrieval capability. IEEE Advances in Digital Libraries, 1(4):311--328, May 1997.
85
 
86
M. Jansen, M. Malfait, and A. Bultheel. Generalized cross validation for wavelet thresholding, 1995. preprint, Dec. 1995.
87
88
 
89
 
90
G. K. An Introduction to Neural Networks. UCL Press, 1997.
 
91
L. Kaplan and C. Kuo. Fractal estimation from noisy data via discrete fractional Gaussian noise (DFGN) and the Harr basis. IEEE Transactions on Information Theory, 41(12):3554--3562, 1993.
 
92
H. Kargupta and B. Park. The collective data mining: A technology for ubiquitous data analysis from distributed heterogeneous sites, 1998. Submitted to IEEE Computer Special Issue on Data Mining.
 
93
D. Keim and M. Heczko. Wavelets and their applications in databases. Tutorial Notes of ICDE 2001, 2001.
 
94
 
95
K. Kobayashi and T. Torioka. A wavelet neural network for function approximation and network optimization. In Proceedings of ANNIE'94, AMSE Press., pages 505--510, 1994.
 
96
K. Kobayashi and T. Torioka. Designing wavelet networks using genetic algorithms. In Proceedings of EUFIT'97, volume 1, 1997.
 
97
P. Kostka, E. Tkacz, Z. Nawrat, and Z. Malota. An application of wavelet neural networks (wnn) for heart valve prostheses characteristic. In Proc. of the 22nd Annual International Conference of IEEE, Engineering in Medicine and Biology Society, volume 4, pages 2463--2465, 2000.
 
98
A. Koulouris, B. R. Bakshi, and G. Stephanopoulos. Empirical learning through neural networks: The wave-net solution. Intelligent Systems in Process Engineering, pages 437--484, 1996.
 
99
V. Kreinovich, O. Sirisaengtaksin, and S. Cabrera. Wavelet neural networks are optimal approximators for functions of one variable. Technical Report 29, University of Texas at El Paso/University of Houston, 1992.
 
100
A. Laine and J. Fan. texture classification by wavelet packet signatures. Technical report, University of Florida, 1992.
 
101
T. Lambrou, P. Kudumakis, R. Speller, M. Sandler, and A. Linney. Classification of audio signals using statistical features on time and wavelet tranform domains. In Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP-98), volume 6, pages 3621--3624, 1998.
 
102
P. C. Lemarié and Y. Meyer. Ondelettes et bases hilbertiennes. Rev. Mat. Ibero-Amer, pages 1--18, 1986.
 
103
G. Li and A. A. Khokhar. Content-based indexing and retrieval of audio data using wavelets. In IEEE International Conference on Multimedia and Expo (II), pages 885--888, 2000.
 
104
 
105
S.-T. Li and S.-W. Chou. Multi-resolution spatio-temporal data mining for the study of air pollutant regionalization. In Proceedings of the 33rd Hawaii International Conference on System Sciences, 2000.
 
106
S. Ma and C. Ji. Modeling heterogeneous network traffic in wavelet domain: Part I-temporal correlation. Technical report, 1999.
 
107
S. Ma and C. Ji. Modeling heterogeneous network traffic in wavelet domain: Part II-non-gaussian traffic. Technical report, IBM Waston Research Center, 1999.
 
108
 
109
 
110
S. Mallat. A Wavelet Tour of Signal Processing. Academic Press, San Diego, 1998.
 
111
S. G. Mallat and W. L. Hwang. Singularity detection and processing with wavelets. IEEE Transactions on Information Theory, 38(2):617--643, 1992.
 
112
113
 
114
 
115
P. Meerwald and A. Uhl. A survey of wavelet-domain watermarking algorithms. In Proceedings of SPIE, Electronic Imaging, Security and Watermarking of Multimedia Contents III, volume 4314, San Jose, CA, USA, jan 2001. SPIE.
 
116
Y. Meyer. Wavelets and Operatiors. Cambridge University Press, 1992.
 
117
Y. Meyer. Wavlets--Algorithms and Application. SIAM, 1993.
 
118
 
119
 
120
A. Mojsilovic and M. v. Popovic. Wavelet image extension for analysis and classification of infarcted myocardial tissue. IEEE Transactions on Biomedical Engineering, 44(9), September 1997.
 
121
P. Morettin. Wavelets in statistics. (3):211--272, 1997.
 
122
P. A. Morettin. From fourier to wavelet analysis of time series. In A. Prat, editor, Proceedings in Computational Statistics, pages 111--122, 1996.
 
123
G. P. Nason. Wavelet shrinkage by cross-validation. Journal of the Royal Statistical Society B, 58:463--479, 1996.
 
124
G. P. Nason and R. von Sachs. Wavelets in time series analysis. Philosophical Transactions of the Royal Society of London A, 357(1760):2511--2526, 1999.
125
 
126
 
127
 
128
 
129
D. Percival and A. T. Walden. Wavelet Methods for Time Series Analysis. Cambridge University Press, 2000.
 
130
R. Polikar. The wavelet tutorial. Internet Resources: http://engineering.rowan.edu/polikar/WAVELETS/WTtutorial.html.
 
131
 
132
I. Popivanov and R. J. Miller. Similarity search over time series data using wavelets. In ICDE 2002, 2002.
 
133
L. Prasad, S. S. Iyengar, and S. S. Ayengar. Wavelet Analysis with Applications to Image Processing. CRC Press, 1997.
 
134
 
135
S. Ramaswamy, R. Rastogi, and K. Shim. Efficient algorithms for mining outliers from large data sets. pages 427--438, 2000.
 
136
R. H. Riedi, M. S. Crouse, V. J. Ribeiro, and R. G. Baraniuk. A multifractal wavelet model with application to network traffic. IEEE Transactions on Information Theory, 45(4):992--1018, 1999.
 
137
J. B. T. M. Roerdink and M. A. Westenberg. Wavelet-based volume visualization. Nieuw Archief voor Wiskunde, 17(2):149--158, 1999.
 
138
A. Ron. Frames and stable bases for shift invariant subspaces of 1. Canad. J. Math., (47):1051--1094, 1995.
 
139
D. Roverso. Multivariate temporal classification by windowed wavelet decomposition and recurrent neural networks. In In International Topical Meeting on Nuclear Plant Instrumentation, Controls, and Human-Machine Interface Technologies (NPIC&HMIT 2000), Washington, DC, November 2000.
 
140
S. Santini and A. Gupta. A data model for querying wavelet features in image databases. In Multimedia Information Systems, pages 21--30, 2001.
 
141
S. Santini and A. Gupta. Wavelet data model for image databases. In IEEE Intl. Conf. on Multimedia and Expo, Tokyo, Japan, August 2001.
 
142
P. Scheunders, S. Livens, G. V. de Wouwer, P. Vautrot, and D. V. Dyck. Wavelet-based texture analysis. International Journal on Computer Science and Information Management, ijcsim, 1(2):22--34, 1998.
 
143
Cyrus Shahabi , Seokkyung Chung , Maytham Safar , George Hajj, 2D TSA-tree: A Wavelet-Based Approach to Improve the Efficiency of Multi-Level Spatial Data Mining, Proceedings of the 13th International Conference on Scientific and Statistical Database Management, p.59-68, July 18-20, 2001
 
144
 
145
 
146
 
147
P. Smyth. Probabilistic model-based clustering of multivariate and sequential data. In Proceedings of Artificial Intelligence and Statistics, 1999.
 
148
 
149
 
150
 
151
G. Strang. Wavelet transforms versus fourier transforms. Bull. Amer. Math. Soc., (new series 28):288--305, 1990.
 
152
V. Strela. Denoising via block wiener filtering in wavelet domain. In 3rd European Congress of Mathematics, Barcelona. Birkhauser Verlag., July 2000.
 
153
 
154
 
155
S. R. Subramanya and A. Youssef. Wavelet-based indexing of audio data in audio/multimedia databases. In IWMMDBMS, pages 46--53, 1998.
 
156
G. Tzanetakis and P. Cook. Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing, 10(5), July 2002.
 
157
G. Tzanetakis, G. Essl, and P. Cook. Automatic musical genre classification of audio signals. pages 205--210, Bloomington, IN, USA, October 2001.
 
158
P. Vaidyanathan. Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial. In Proc. IEEE, number 78, pages 56--93, 1990.
 
159
M. Vannucci and F. Corradi. Covariance structure of wavelet coefficients: theory and models in a bayesian perspective. J. R. Statist. Soc. B, (61):971--986, 1999.
 
160
R. Venkatesan, S. Koon, M. Jakubowski, and P. Moulin. Robust image hashing. In Current proceedings, 2000.
 
161
B. Vidakovic. Nonlinear wavelet shrinkage with Bayes rules and Bayes factors. Journal of the American Statistical Association, 93(441):173--179, 1998.
 
162
B. Vidakovic. Wavelet-based nonparametric bayes methods. Technical report, ISDS, Duke University, 1998.
 
163
B. Vidakovic. Statistical Modeling by Wavelets. John Wiley & Sons, New york, 1999.
 
164
J. S. Vitter and M. Wang. Approximate computation of multidimensional aggregates of sparse data using wavelets. pages 193--204, 1999.
 
165
J. S. Walker. A Primer on Wavelets for Their Scientific Applications (Studies in Advanced Mathematics). CRC Press, 1999.
 
166
 
167
J. Z. Wang, G. Wiederhold, O. Firschein, and S. X. Wei. Content-based image indexing and searching using daubechies' wavelets. International Journal on Digital Libraries, 1(4):311--328, 1997.
 
168
Y. Wang. Jump and sharp cusp detection by wavelets. Biometrika, 82(2):385--397, 1995.
 
169
P. Wojtaszczyj. A Mathematical Introduction to Wavelets. Cambridge University Press, Cambridge, 1997.
 
170
 
171
B. J. Woodford and N. K. Kasabov. A wavelet-based neural network classifier for temporal data. In Presented at the 5th Australasia-Japan Joint Workshop, University of Otago, Dunedin, New Zealand, November 2001.
172
 
173
T. Yamakawa, E. Uchino, and T. Samatsu. Wavelet neural network employing over-complete number of compactly supported non-orthogonal wavelets and their applciations. In Proc. of 1994 IEEE International Conference Joint on Neural Networks, volume 3, pages 1391--1396, 1994.
 
174
R. Young. Wavelet Theory and its Application. Kluwer Academic Publishers, Bonston, 1993.
 
175
A. Zhai, P. Huang, and T. J. yu Pan. A study on web-log using wavelet. In Research and Development in Information Retrieval, 2001.
176
 
177

CITED BY  18
Collaborative Colleagues:
Tao Li: colleagues
Qi Li: colleagues
Shenghuo Zhu: colleagues
Mitsunori Ogihara: colleagues