|
ABSTRACT
Using visualization techniques to explore and understand high-dimensional data is an efficient way to combine human intelligence with the immense brute force computation power available nowadays. Several visualization techniques have been developed to study the cluster structure of data, i.e., the existence of distinctive groups in the data and how these clusters are related to each other. However, only few of these techniques lend themselves to studying how this structure changes if the features describing the data are changed. Understanding this relationship between the features and the cluster structure means understanding the features themselves and is thus a useful tool in the feature extraction phase.In this paper we present a novel approach to visualizing how modification of the features with respect to weighting or normalization changes the cluster structure. We demonstrate the application of our approach in two music related data mining projects.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
|
| |
5
|
J.-J. Aucouturier and F. Pachet. Music Similarity Measures: What's the Use? In Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR'02), Paris, France, 2002.
|
| |
6
|
|
| |
7
|
P. Cano, M. Kaltenbrunner, F. Gouyon, and E. Batlle. On the Use of Fastmap for Audio Retrieval and Browsing. In Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR'02), Paris, France, 2002.
|
| |
8
|
|
 |
9
|
|
 |
10
|
Venkatesh Ganti , Johannes Gehrke , Raghu Ramakrishnan, A framework for measuring changes in data characteristics, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.126-137, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
[doi> 10.1145/303976.303989]
|
| |
11
|
H. Hotelling. Analysis of a Complex of Statistical Variables into Principal Components. Journal of Educational Psychology, 24:417--441 and 498--520, 1933.
|
 |
12
|
|
| |
13
|
S. Kaski. Data Exploration Using Self-Organizing Maps. PhD thesis, Helsinki University of Technology, Department of Computer Science and Engineering, 1997.
|
| |
14
|
S. Kaski. Fast Winner Search for SOM-Based Monitoring and Retrieval of High-Dimensional Data. In Proceedings of the 9th International Conference on Artificial Neural Networks (ICANN'99), London, UK, 1999.
|
| |
15
|
S. Kaski, T. Honkela, K. Lagus, and T. Kohonen. WEBSOM -- Self-Organizing Maps of Document Collections. Neurocomputing, 21:101--117, 1998.
|
| |
16
|
|
| |
17
|
T. Kohonen. Self-Organizing Formation of Topologically Correct Feature Maps. Biological Cybernetics, 43:59--69, 1982.
|
| |
18
|
T. Kohonen. The Adaptive-Subspace SOM (ASSOM) and its use for the Implementation of Invariant Feature Detection. In Proceedings of the International Conference on Artificial Neural Networks (ICANN'95), Paris, France, 1995.
|
| |
19
|
|
| |
20
|
T. Kohonen, S. Kaski, K. Lagus, J. Salojärvi, J. Honkela, V. Paatero, and A. Saarela. Self Organization of a Massive Document Collection. IEEE Transactions on Neural Networks, Special Issue on Neural Networks for Data Mining and Knowledge Discovery, 11(3):574--585, 2000.
|
| |
21
|
J. B. Kruskal and M. Wish. Multidimensional Scaling. Number 07--011 in Paper Series on Quantitative Applications in the Social Sciences. Sage Publications, Newbury Park, CA, 1978.
|
| |
22
|
K. Lagus and S. Kaski. Keyword Selection Method for Characterizing Text Document Maps. In Proceedings of the 9th International Conference on Artificial Neural Networks (ICANN'99), London, UK, 1999.
|
| |
23
|
|
| |
24
|
B. Logan. Mel Frequency Cepstral Coefficients for Music Modeling. In Proceedings of the International Symposium on Music Information Retrieval (ISMIR), Plymouth, MA, 2000.
|
| |
25
|
J. MacQueen. Some Methods for Classification and Analysis of Multivariate Observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, Berkeley and Los Angeles, CA, 1967.
|
| |
26
|
E. Pampalk. Islands of Music: Analysis, Organization, and Visualization of Music Archives. Master's thesis, Vienna University of Technology, 2001. http://www.oefai.at/~elias/music/thesis.html.
|
 |
27
|
|
| |
28
|
|
| |
29
|
A. Rauber. LabelSOM: On the Labeling of Self-Organizing Maps. In Proceedings of the International Joint Conference on Neural Networks (IJCNN'99), Washington, DC, 1999.
|
| |
30
|
J. W. Sammon. A Nonlinear Mapping for Data Structure Analysis. IEEE Transactions on Computers, 18:401--409, 1969.
|
| |
31
|
A. Ultsch and H. P. Siemon. Kohonen's Self-Organizing Feature Maps for Exploratory Data Analysis. In Proceedings of the International Neural Network Conference (INNC'90), Dordrecht, Netherlands, 1990.
|
| |
32
|
J. Vesanto and J. Ahola. Hunting for Correlations in Data Using the Self-Organizing Map. In Proceedings of the International ICSC Congress on Computational Intelligence Methods and Applications (CIMA'99), Rochester, NY, 1999.
|
| |
33
|
J. Vesanto and E. Alhoniemi. Clustering of the Self-Organizing Map. IEEE Transactions on Neural Networks, 11(3):586--600, 2000.
|
 |
34
|
|
| |
35
|
Gerhard Widmer, Using AI and machine learning to study expressive music performance: project survey and first report, AI Communications, v.14 n.3, p.149-162, September 2001
|
| |
36
|
|
| |
37
|
E. Zwicker and H. Fastl. Psychoacoustics, Facts and Models. Springer, Berlin, Germany 2nd edition, 1999.
|
|