| Towards efficient automated singer identification in large music databases |
| Full text |
Pdf
(263 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
table of contents
Seattle, Washington, USA
SESSION: Speech and music
table of contents
Pages: 59 - 66
Year of Publication: 2006
ISBN:1-59593-369-7
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 89, Citation Count: 0
|
|
|
ABSTRACT
Automated singer identification is important in organising, browsing and retrieving data in large music databases. In this paper, we propose a novel scheme, called Hybrid Singer Identifier (HSI), for automated singer recognition. HSI can effectively use multiple low-level features extracted from both vocal and non-vocal music segments to enhance the identification process with a hybrid architecture and build profiles of individual singer characteristics based on statistical mixture models. Extensive experimental results conducted on a large music database demonstrate the superiority of our method over state-of-the-art approaches.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
M. Bartsch and G. Wakefield. Singing voice identification using spectral envelop estimation. IEEE Trans. on Speech and Audio Processing, 12(2):100--109, 2004.
|
| |
2
|
C. Becchetti, L. Ricotti, and L. Ricotti. Speech Recognition. John Wiley, 1999.
|
| |
3
|
A. Berenzweig, D. P. W. Ellis, and S. Lawrence. Using voice segments to improve artist classification of music. In AES 22nd International Conference on Virtual, Synthetic and Entertainment Audio, pages 119--122, 2002.
|
| |
4
|
A. L. Berenzweig and D. P. W. Ellis. Locating singing voice segments within music signals. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2001.
|
| |
5
|
Chih-Chung Chang and Chih-Jen Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~jlin/libsvm.
|
| |
6
|
|
| |
7
|
|
| |
8
|
T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer Verlag, 2001.
|
| |
9
|
Michael I. Jordan. Why the logistic function? a tutorial discussion on probabilities and neural networks. Technical Report 9503, MIT, 1995.
|
| |
10
|
G. Lebanon and J. Lafferty. Boosting and maximum likelihood for exponential model and bregman distances. In Proc. of the 14th Conf. Neural Information Processing Systems (NIPS), pages 110--121, 2002.
|
 |
11
|
|
 |
12
|
|
| |
13
|
A. Livshin and X. Rodet. Musical instrument identification in continuous recordings. In Proc. of the 7th Int. Conference on Digital Audio Effects (DAFX-04), pages 222--226, 2004.
|
 |
14
|
|
| |
15
|
|
| |
16
|
|
| |
17
|
J. Sundberg. The Science of the Singing Voice. Illinios University Press, 1987.
|
| |
18
|
T. Tolonen and M. Karjalainen. A computationally efficient multipitch analysis model. IEEE Trans. on Speech and Audio Processing, 8(4):708--716, 2000.
|
| |
19
|
W. H. Tsai and H. M. Wang. Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals. IEEE Trans. on Speech, Audio and Language Processing, 14(1):330--341, 2006.
|
| |
20
|
V. Vapnik. Statistical Learning Theory. John Wiley&Sons, 1998.
|
| |
21
|
B. Whitman, G. Flake, and S. Lawrence. Artist detection in music with minnowmatch. In IEEE Workshop on Neural Networks for Signal Processing, 2001.
|
| |
22
|
C. Xu, N.C. Maddage, and X. Shao. Automatic music classification and summarization. IEEE Trans. on Speech, Audio and Language Processing, 13(3):441--450, 2005.
|
| |
23
|
T. Zhang. Automatic singer identification. In Proc. of IEEE Int. Conference on Multimedia and Expo (ICME), pages 33--36, 2003.
|
|