|
ABSTRACT
As there is a growing amount of MP3 music data available on the Internet today, the problems related to music classification and content-based music retrieval are getting more attention recently. In this paper, we propose an approach to automatically classify MP3 music objects according to their singers. First, the coefficients extracted from the output of the polyphase filters are used to compute the MP3 features for segmentation. Based on these features, an MP3 music object can be decomposed into a sequence of notes (or phonemes). Then for each MP3 phoneme in the training set, its MP3 feature is extracted and used to train an MP3 classifier which can identify the singer of an unknown MP3 music object. Experiments are performed and analyzed to show the effectiveness of the proposed method.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Bakhmutova, V., V. D. Gusev, and T. N. Titkova, "The Search for Adaptations in Song Melodies," Computer Music Journal, Vol. 21, No. 1, pp. 58--67, Spring 1997.
|
| |
2
|
Brandenburg, K., and G. Stoll, "ISO-MPEG-1 Audio: A Generic Standard for Coding of High Quality Digital Audio," Journal of the Audio Engineering Society, Vol. 42, No. 10, Oct 1994, pp. 780--792.
|
| |
3
|
Campbell, J.P., Jr., "Speaker Recognition: a Tutorial," Proceedings of the IEEE, Vol. 85, No. 9, Sept. 1997 pp. 1437--1462.
|
| |
4
|
|
| |
5
|
Chibelushi, C.C., F. Deravi, and J. S. D. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. On Multimedia, Vol. 4, No. 1, pp. 23--37, March 2002.
|
| |
6
|
|
| |
7
|
Chou, W., and L. Gu, "Robust Singing Detection in Speech/Music Discriminator Design," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 865--868, 2001.
|
| |
8
|
Foote, J., "Content-Based Retrieval of Music and Audio", Multimedia Storage and Archiving systems II, Proc. SPIE, Vol.3229, pp. 138--147.
|
| |
9
|
|
 |
10
|
Asif Ghias , Jonathan Logan , David Chamberlin , Brian C. Smith, Query by humming: musical information retrieval in an audio database, Proceedings of the third ACM international conference on Multimedia, p.231-236, November 05-09, 1995, San Francisco, California, United States
[doi> 10.1145/217279.215273]
|
| |
11
|
Hsu, J.L., C.C. Liu and A.L.P. Chen, "Discovering Non-Trivial Repeating Patterns in Music Data," IEEE Transactions on Multimedia, Vol. 3, No. 3, pp. 311--325, 2001.
|
| |
12
|
ISO/IEC 11172-3:1993, "Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s - Part 3: Audio."
|
| |
13
|
Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "Music Retrieval by Humming," in Proceedings of IEEE PACRIM'99, pp. 404--407, 1999.
|
 |
14
|
Naoko Kosugi , Yuichi Nishihara , Tetsuo Sakata , Masashi Yamamuro , Kazuhiko Kushima, A practical query-by-humming system for a large music database, Proceedings of the eighth ACM international conference on Multimedia, p.333-342, October 2000, Marina del Rey, California, United States
[doi> 10.1145/354384.354520]
|
| |
15
|
Lambrou, T. et al., "Classification of Audio Signals Using Statistical Features on Time and Wavelet Transform Domains," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3621--3624, 1998.
|
| |
16
|
Li, S. Z., "Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method," IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 5, pp. 619--625, Sept. 2000.
|
| |
17
|
|
| |
18
|
Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "An Approximate String Matching Algorithm for Content-Based Music Data Retrieval," in Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, 1999.
|
| |
19
|
Liu, C. C., and Wei-Yi Kuo, "Content-Based Segmentation of MP3 Music Objects," in Proc. of the Workshop on the 21st Century Digital Life and Internet Technologies, 2001.
|
 |
20
|
|
| |
21
|
Liu, Z. et al., "Audio Feature Extraction and Analysis for Scene Classification," in Proc. IEEE First Workshop on Multimedia Signal Processing, pp. 343--348, 1997.
|
| |
22
|
Liu, Z. and Q. Huang., "Classification of Audio Events in Broadcast News," in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 364--369, 1998.
|
| |
23
|
Lu, G.J. and T. Hankinson, "A Technique Towards Automatic Audio Classification and Retrieval," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 1142--1145, 1998.
|
| |
24
|
Lu, G.J. and T. Hankinson, "An Investigation of Automatic Audio Classification and Segmentation," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 776--781, 2000.
|
| |
25
|
Martin, K. D., and Y. E. Kim, "2pMU9. Musical instrument identification : A pattern-recognition approach," in the 136th meeting of the Acoustical Society of America, October 13, 1998.
|
| |
26
|
|
| |
27
|
Melih, K., and R. Gonzalez, "Audio Source Type Segmentation Using a Perceptually Based Representation," in ISSPA 99, Brisbane, Australia, 22--25 August, 1999.
|
| |
28
|
|
| |
29
|
Moreno, P.J. and R. Rifkin, "Using The Fisher Kernel Method for Web Audio Classification," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 4, pp. 2417--2420, 2000.
|
| |
30
|
Noll, P., "MPEG Digital Audio Coding," IEEE Signal Processing Magazine, Vol. 14, No. 5, pp. 59--81, Sept. 1997.
|
| |
31
|
Painter, T. and A. Spanias, "Perceptual Coding of Digital Audio," Proceedings of the IEEE , Vol. 88, No. 4 , pp. 451--515, April 2000.
|
| |
32
|
|
 |
33
|
Pierre-Yves Rolland , Gailius Raškinis , Jean-Gabriel Ganascia, Musical content-based retrieval: an overview of the Melodiscov approach and system, Proceedings of the seventh ACM international conference on Multimedia (Part 1), p.81-84, October 30-November 05, 1999, Orlando, Florida, United States
[doi> 10.1145/319463.319473]
|
| |
34
|
Saunders, J., "Real-Time Discrimination of Broadcast Speech/Music," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 993--996, 1996.
|
| |
35
|
|
| |
36
|
Smith, G., H. Murase, H. Kashino, "Quick Audio Retrieval Using Active Search", in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3777--3780, 1998.
|
| |
37
|
Tsai, Po-Jun and Chih-Chin Liu, "An MP3 Search Engine on the Internet", in Proc. of 2000 Workshop on Internet & Distributed Systems, Vol. 1, pp. 18--27, 2000.
|
| |
38
|
Tzanetakis, G., G. Essl, and P. Cook, "Automatic Musical Genre Classification of Audio Signals," in Proc. Int. Symposium on Music Information Retrieval (ISMIR), Bloomington, Indiana, 2001.
|
| |
39
|
Tzanetakis, G., and P. Cook, "A Framework for Audio Analysis Based on Classification and Temporal Segmentation," in Proc. EUROMICRO Conf., Vol. 2, pp. 61--67, 1999.
|
| |
40
|
Erling Wold , Thom Blum , Douglas Keislar , James Wheaton, Content-Based Classification, Search, and Retrieval of Audio, IEEE MultiMedia, v.3 n.3, p.27-36, September 1996
[doi> 10.1109/93.556537]
|
| |
41
|
Zhang, T. and C.-C.J. Kuo, "Hierarchical Classification of Audio Data for Archiving and Retrieving," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3001--3004, 1999.
|
CITED BY 4
|
|
|
|
|
Jialie Shen , Bin Cui , John Shepherd , Kian-Lee Tan, Towards efficient automated singer identification in large music databases, Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, August 06-11, 2006, Seattle, Washington, USA
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.3
Information Search and Retrieval
Subjects:
Retrieval models
Additional Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.3
Information Search and Retrieval
Subjects:
Selection process;
Search process
General Terms:
Algorithms,
Design,
Documentation,
Experimentation,
Measurement,
Performance
Keywords:
MP3,
MP3 classification,
MP3 databases,
content-based music classification,
music classification,
music databases,
music feature extraction,
singer identification
|