|
ABSTRACT
With the explosive growth of networked collections of musical material, there is a need to establish a mechanism like a digital library to manage music data. This paper presents a content-based processing paradigm of popular song collections to facilitate the realization of a music digital library. The paradigm is built on the automatic extraction of information of interest from music audio signals. Because the vocal part is often the heart of a popular song, we focus on developing techniques to exploit the solo vocal signals underlying an accompanied performance. This supports the necessary functions of a music digital library, namely, music data organization, music information retrieval/recommendation, and copyright protection.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Akeroyd, M. A., Moore, B. C. J., and Moore, G. A. Melody Recognition Using Three Types of Dichotic-pitch Stimulus. Journal of the Acoustical Society of America, 110 (3), 2001, 1498-1504.
|
 |
2
|
David Bainbridge , Craig G. Nevill-Manning , Ian H. Witten , Lloyd A. Smith , Rodger J. McNab, Towards a digital library of popular music, Proceedings of the fourth ACM conference on Digital libraries, p.161-169, August 11-14, 1999, Berkeley, California, United States
[doi> 10.1145/313238.313295]
|
| |
3
|
|
| |
4
|
Davis, S. B., and Mermelstein, P. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Transactions on Acoustics, speech, and Signal Processing, 28, 1980, 357-366.
|
| |
5
|
Dempster, A., Laird, N., and Rubin, D. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, 39, 1977, 1-38.
|
| |
6
|
Durey, A. S., and Clements, M. A. Features for Melody Spotting Using Hidden Markov Models. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.
|
| |
7
|
Eronen, A. Musical Instrument Recognition Using ICA-based Transform of Features and Discriminatively Trained HMMS. In Proceedings of the International Symposium on Signal Processing and Its Applications, 2003.
|
| |
8
|
|
| |
9
|
Haitsma, J., and Kalker, T. A Highly Robust Audio Fingerprinting System, In Proceedings of the International Conference on Music Information Retrieval, 2002.
|
| |
10
|
Herrera, P., Amatriain, X., Batlle, E., and Serra. X. Towards Instrument Segmentation for Music Content Description: A Critical Review of Instrument Classification Techniques. In Proceedings of the International Symposium on Music Information Retrieval, 2000.
|
| |
11
|
|
| |
12
|
ISO-IEC/JTC1 SC29 WG11 Moving Pictures Expert Group. Information technology - multimedia content description interface - part 4: Audio. Committee Draft 15938-4, ISO/IEC, 2000.
|
| |
13
|
Kim, Y. E., and Whitman, B. Singer Identification in Popular Music Recordings Using Voice Coding Features. In Proceedings of the International Conference on Music Information Retrieval, 2002.
|
 |
14
|
|
 |
15
|
|
| |
16
|
Liu, D., Lu, L., and Zhang, H. J. Automatic Mood Detection from Acoustic Music Data. In Proceedings of the International Conference on Music Information Retrieval, 2003.
|
| |
17
|
Martin, A., Doddington, G., Kamm, T., Ordowski, M., and Przybocki, M. The DET Curve in Assessment of Detection Task Performance. In Proceedings of the European Conference on Speech Communication and Technology, 1997.
|
 |
18
|
Rodger J. McNab , Lloyd A. Smith , Ian H. Witten , Clare L. Henderson , Sally Jo Cunningham, Towards the digital music library: tune retrieval from acoustic input, Proceedings of the first ACM international conference on Digital libraries, p.11-18, March 20-23, 1996, Bethesda, Maryland, United States
[doi> 10.1145/226931.226934]
|
| |
19
|
Oppenheim, A. V., and Schafer, R. W. Homomorphic Analysis of Speech. IEEE Transactions on Audio and Electroacoustics, 16, 1968, 221-226.
|
| |
20
|
Reynolds, D. A., and Rose, R. C. Robust Text-independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing, 3 (1), 1995, 72-83.
|
| |
21
|
Reynolds, D. A., Quatieri, T. F., and Dunn, R. B. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 10, 2000, 19-41.
|
| |
22
|
|
| |
23
|
Solomonoff, A., Mielke, A., Schmidt, M., and Gish, H. Clustering Speakers by Their Voices. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1998.
|
| |
24
|
|
| |
25
|
Tsai, W. H., and Wang, H. M. A Query-by-example Framework to Retrieve Music Documents by Singer. In Proceedings of the IEEE International Conference on Multimedia and Expo, 2004.
|
| |
26
|
Tzanetakis, G., and Cook, P. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, 10 (5), 2002, 293-302.
|
| |
27
|
Tzanetakis, G., Gao, J., and Steenkiste, P. A Scalable Peer-to-Peer System for Music Content and Information Retrieval. In Proceedings of the International Conference on Music Information Retrieval, 2003.
|
| |
28
|
Venkatachalam, V., Cazzanti, L., Dhillon, N., and Wells, M. Automatic Identification of Sound Recordings. IEEE Signal Processing Magazine, March 2004, 92-99.
|
| |
29
|
Wang, C. K., Lyu, R. Y., and Chiang, Y. C. An Automatic Singing Transcription System with Multilingual Singing Lyric Recognizer and Robust Melody Tracker. In Proceedings of the European Conference on Speech Communication and Technology, 2003.
|
| |
30
|
Whitman, B., Flake, G., and Lawrence, S. Artist Detection in Music with Minnowmatch. In Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, 2001.
|
| |
31
|
Yang, D., and Lee, W. Disambiguating Music Emotion Using Software Agents. In Proceedings of the International Conference on Music Information Retrieval, 2004.
|
CITED BY
|
|
Xiaonan Lu , Prasenjit Mitra , James Z. Wang , C. Lee Giles, Automatic categorization of figures in scientific documents, Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, June 11-15, 2006, Chapel Hill, NC, USA
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.8
Database applications
Subjects:
Data mining
Additional Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.1
Content Analysis and Indexing
Subjects:
Indexing methods
H.3.3
Information Search and Retrieval
Subjects:
Retrieval models;
Clustering;
Relevance feedback
H.3.7
Digital Libraries
Subjects:
Systems issues
General Terms:
Algorithms,
Design,
Documentation,
Experimentation,
Legal Aspects,
Management,
Measurement,
Verification
Keywords:
music digital library,
music information retrieval,
query-by-example,
solo voice modeling,
vocal/non-vocal segmentation
|