|
ABSTRACT
This paper proposes a novel framework for music content indexing and retrieval. The music structure information, i.e., timing, harmony and music region content, is represented by the layers of the music structure pyramid. We begin by extracting this layered structure information. We analyze the rhythm of the music and then segment the signal proportional to the inter-beat intervals. Thus, the timing information is incorporated in the segmentation process, which we call Beat Space Segmentation. To describe Harmony Events, we propose a two-layer hierarchical approach to model the music chords. We also model the progression of instrumental and vocal content as Acoustic Events. After information extraction, we propose a vector space modeling approach which uses these events as the indexing terms. In query-by-example music retrieval, a query is represented by a vector of the statistics of the n-gram events. We then propose two effective retrieval models, a hard-indexing scheme and a soft-indexing scheme. Experiments show that the vector space modeling is effective in representing the layered music information, achieving 82.5% top-5 retrieval accuracy using 15-sec music clips as the queries. The soft-indexing outperforms hard-indexing in general.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Cavnar, W.B., and Trenkle, J.M. N-Gram-Based Text Categorization. In Proc. of 3rd Annual Symposium on Document Analysis and Information Retrieval, 1994.
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
| |
7
|
Duxburg. C, Sandler. M., and Davies. M. A Hybrid Approach to Musical Note Onset Detection. In Proc. Int. Conf. DAFx. Hamburg, Germany, Sept, 2002.
|
 |
8
|
|
| |
9
|
Fujishima, T. Real Time Chord Recognition of Musical Sound: A System Using Lisp Music. In Proc. ICMC, Oct. 1999.
|
 |
10
|
Asif Ghias , Jonathan Logan , David Chamberlin , Brian C. Smith, Query by humming: musical information retrieval in an audio database, Proceedings of the third ACM international conference on Multimedia, p.231-236, November 05-09, 1995, San Francisco, California, United States
[doi> 10.1145/217279.215273]
|
| |
11
|
Goldstein, J. L. An Optimum Processor Theory for the Central Formation of the Pitch of Complex Tones. In JASA, Vol. 54, 1973.
|
| |
12
|
Kageyama, T., Mochizuki, K., and Takashima, Y. Melody Retrieval with Humming. In Proc. ICMC, Sept, 1993.
|
| |
13
|
Lemström, K., and Laine, P. Music Information Retrieval using Musical Parameters. In Proc. of the ICMC, Oct, 1998.
|
 |
14
|
|
 |
15
|
Namunu C. Maddage , Changsheng Xu , Mohan S. Kankanhalli , Xi Shao, Content-based music structure analysis with applications to music semantics understanding, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027549]
|
 |
16
|
Rodger J. McNab , Lloyd A. Smith , Ian H. Witten , Clare L. Henderson , Sally Jo Cunningham, Towards the digital music library: tune retrieval from acoustic input, Proceedings of the first ACM international conference on Digital libraries, p.11-18, March 20-23, 1996, Bethesda, Maryland, United States
[doi> 10.1145/226931.226934]
|
 |
17
|
|
| |
18
|
Pickens, J. A Survey of Feature Selection Techniques for Music Information Retrieval. Technical report, Center of Intelligent Information Retrieval, Dept. of Computer Science, University of Massachusetts, 2001.
|
| |
19
|
Pickens, J. and Iliopoulos, C. Markov Random Fields and Maximum Entropy Modeling for Music Information Retrieval. In Proc. of ISMIR, Sept, 2005.
|
| |
20
|
Salton, G. The SMART retrieval system. Prentice-Hall, Englewood Cliffs, NJ, 1971.
|
| |
21
|
Shih, H.-H., Narayanan, S. S., and Kuo, C.-C. J. An HMM-Based Approach to Humming Transcription. In Proc. of ICME, Aug, 2002.
|
| |
22
|
Song, J., Bae, S. Y., and Yoon, K. Mid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System. In Proc. of ISMIR, Oct, 2002.
|
| |
23
|
Terhardt, E. Pitch, Consonance and Harmony. In JASA, Vol. 55, No. 5, 1974.
|
| |
24
|
Typke, R., Wiering, F., and Veltkamp, R. A Survey of Music Information Retrieval Systems. In Proc. of the ISMIR, Sept. 2005.
|
| |
25
|
|
| |
26
|
Ward, W. Subjective Music Pitch. In JASA, Vol. 26, 195
|
|