ACM Home Page
Please provide us with feedback. Feedback
On the extraction of vocal-related information to facilitate the management of popular music collections
Full text PdfPdf (243 KB)
Source International Conference on Digital Libraries archive
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries table of contents
Denver, CO, USA
SESSION: Tools & techniques track: automatically managing media table of contents
Pages: 197 - 206  
Year of Publication: 2005
ISBN:1-58113-876-8
Authors
Wei-Ho Tsai  Academia Sinica, Taipei, Taiwan, Republic of China
Hsin-Min Wang  Academia Sinica, Taipei, Taiwan, Republic of China
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 59,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1065385.1065432
What is a DOI?

ABSTRACT

With the explosive growth of networked collections of musical material, there is a need to establish a mechanism like a digital library to manage music data. This paper presents a content-based processing paradigm of popular song collections to facilitate the realization of a music digital library. The paradigm is built on the automatic extraction of information of interest from music audio signals. Because the vocal part is often the heart of a popular song, we focus on developing techniques to exploit the solo vocal signals underlying an accompanied performance. This supports the necessary functions of a music digital library, namely, music data organization, music information retrieval/recommendation, and copyright protection.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Akeroyd, M. A., Moore, B. C. J., and Moore, G. A. Melody Recognition Using Three Types of Dichotic-pitch Stimulus. Journal of the Acoustical Society of America, 110 (3), 2001, 1498-1504.
2
 
3
 
4
Davis, S. B., and Mermelstein, P. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Transactions on Acoustics, speech, and Signal Processing, 28, 1980, 357-366.
 
5
Dempster, A., Laird, N., and Rubin, D. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, 39, 1977, 1-38.
 
6
Durey, A. S., and Clements, M. A. Features for Melody Spotting Using Hidden Markov Models. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.
 
7
Eronen, A. Musical Instrument Recognition Using ICA-based Transform of Features and Discriminatively Trained HMMS. In Proceedings of the International Symposium on Signal Processing and Its Applications, 2003.
 
8
 
9
Haitsma, J., and Kalker, T. A Highly Robust Audio Fingerprinting System, In Proceedings of the International Conference on Music Information Retrieval, 2002.
 
10
Herrera, P., Amatriain, X., Batlle, E., and Serra. X. Towards Instrument Segmentation for Music Content Description: A Critical Review of Instrument Classification Techniques. In Proceedings of the International Symposium on Music Information Retrieval, 2000.
 
11
 
12
ISO-IEC/JTC1 SC29 WG11 Moving Pictures Expert Group. Information technology - multimedia content description interface - part 4: Audio. Committee Draft 15938-4, ISO/IEC, 2000.
 
13
Kim, Y. E., and Whitman, B. Singer Identification in Popular Music Recordings Using Voice Coding Features. In Proceedings of the International Conference on Music Information Retrieval, 2002.
14
15
 
16
Liu, D., Lu, L., and Zhang, H. J. Automatic Mood Detection from Acoustic Music Data. In Proceedings of the International Conference on Music Information Retrieval, 2003.
 
17
Martin, A., Doddington, G., Kamm, T., Ordowski, M., and Przybocki, M. The DET Curve in Assessment of Detection Task Performance. In Proceedings of the European Conference on Speech Communication and Technology, 1997.
18
 
19
Oppenheim, A. V., and Schafer, R. W. Homomorphic Analysis of Speech. IEEE Transactions on Audio and Electroacoustics, 16, 1968, 221-226.
 
20
Reynolds, D. A., and Rose, R. C. Robust Text-independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing, 3 (1), 1995, 72-83.
 
21
Reynolds, D. A., Quatieri, T. F., and Dunn, R. B. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 10, 2000, 19-41.
 
22
 
23
Solomonoff, A., Mielke, A., Schmidt, M., and Gish, H. Clustering Speakers by Their Voices. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1998.
 
24
 
25
Tsai, W. H., and Wang, H. M. A Query-by-example Framework to Retrieve Music Documents by Singer. In Proceedings of the IEEE International Conference on Multimedia and Expo, 2004.
 
26
Tzanetakis, G., and Cook, P. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, 10 (5), 2002, 293-302.
 
27
Tzanetakis, G., Gao, J., and Steenkiste, P. A Scalable Peer-to-Peer System for Music Content and Information Retrieval. In Proceedings of the International Conference on Music Information Retrieval, 2003.
 
28
Venkatachalam, V., Cazzanti, L., Dhillon, N., and Wells, M. Automatic Identification of Sound Recordings. IEEE Signal Processing Magazine, March 2004, 92-99.
 
29
Wang, C. K., Lyu, R. Y., and Chiang, Y. C. An Automatic Singing Transcription System with Multilingual Singing Lyric Recognizer and Robust Melody Tracker. In Proceedings of the European Conference on Speech Communication and Technology, 2003.
 
30
Whitman, B., Flake, G., and Lawrence, S. Artist Detection in Music with Minnowmatch. In Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, 2001.
 
31
Yang, D., and Lee, W. Disambiguating Music Emotion Using Software Agents. In Proceedings of the International Conference on Music Information Retrieval, 2004.


Collaborative Colleagues:
Wei-Ho Tsai: colleagues
Hsin-Min Wang: colleagues