|
ABSTRACT
Music similarity query based on acoustic content is becoming important with the ever-increasing growth of the music information from emerging applications such as digital libraries and WWW. However, relative techniques are still in their infancy and much less than satisfactory. In this paper, we present a novel index structure, called Composite Feature tree, CF-tree, to facilitate efficient content-based music search adopting multiple musical features. Before constructing the tree structure, we use PCA to transform the extracted features into a new space sorted by the importance of acoustic features. The CF-tree is a balanced multi-way tree structure where each level represents the data space at different dimensionalities. The PCA transformed data and reduced dimensions in the upper levels can alleviate suffering from dimensionality curse. To accurately mimic human perception, an extension, named CF+-tree, is proposed, which further applies multivariable regression to determine the weight of each individual feature. We conduct extensive experiments to evaluate the proposed structures against state-of-art techniques. The experimental results demonstrate superiority of our technique.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
|
| |
5
|
George Casella and Roger L. Berger. Statistical Inference. Duxbury Press, 2nd edition, 2001.
|
| |
6
|
|
| |
7
|
|
| |
8
|
B. Cui, J. Hu, H. T. Shen, and C. Yu. Adaptive quantization of the high-dimensional data for efficient knn processing. In Proc. 9th DASFAA Conference, 2004.
|
 |
9
|
|
| |
10
|
G.H. Golub and C.F. Van Loan. Matrix Computations. The Johns Hopkins University Press, 1989.
|
| |
11
|
D. Huron. Perceptual and cognitive applications in music information retrieval. In Proc. Internationl Symposim of Music Information Retrieval, 2000.
|
| |
12
|
I. T. Jolliffe. Principle Component Analysis. Springer-Verlag, 1986.
|
| |
13
|
G. Li and A. A. Khokhar. Content-based indexing and retrieval of audio data using wavelets. In IEEE Internal Conference on Multimedia and Expo, 2000.
|
 |
14
|
|
 |
15
|
|
| |
16
|
B. Logan. Mel frequency cepstral coefficients for music modeling. In Proc. Multimedia Information Retrieval, 2000.
|
| |
17
|
U. Nam and J. Berger. Addressing the same but different - different but similar problem in automatic music classification. In Proc. Multimedia Information Retrieval, 2001.
|
| |
18
|
|
 |
19
|
|
| |
20
|
T. Tolonen and M. Karjalainen. A computationally efficient multipitch analysis model. In IEEE Transactions on Speech and Audio Processing, 2000.
|
| |
21
|
G. Tzanetakis and P. Cook. Musical genre classification of audio signals. In IEEE Transactions on Speech and Audio Processing, pages 293--302, 2002.
|
| |
22
|
|
| |
23
|
|
CITED BY 4
|
|
|
|
|
Yi Yu , J. Stephen Downie , Lei Chen , Vincent Oria , Kazuki Joe, Searching musical audio datasets by a batch of multi-variant tracks, Proceeding of the 1st ACM international conference on Multimedia information retrieval, October 30-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|