ACM Home Page
Please provide us with feedback. Feedback
Music structure based vector space retrieval
Full text PdfPdf (1.10 MB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Seattle, Washington, USA
SESSION: Speech and music table of contents
Pages: 67 - 74  
Year of Publication: 2006
ISBN:1-59593-369-7
Authors
Namunu C. Maddage  Institute for Infocomm Research (I2R), Singapore
Haizhou Li  Institute for Infocomm Research (I2R), Singapore
Mohan S. Kankanhalli  National University of Singapore
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 167,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1148170.1148185
What is a DOI?

ABSTRACT

This paper proposes a novel framework for music content indexing and retrieval. The music structure information, i.e., timing, harmony and music region content, is represented by the layers of the music structure pyramid. We begin by extracting this layered structure information. We analyze the rhythm of the music and then segment the signal proportional to the inter-beat intervals. Thus, the timing information is incorporated in the segmentation process, which we call Beat Space Segmentation. To describe Harmony Events, we propose a two-layer hierarchical approach to model the music chords. We also model the progression of instrumental and vocal content as Acoustic Events. After information extraction, we propose a vector space modeling approach which uses these events as the indexing terms. In query-by-example music retrieval, a query is represented by a vector of the statistics of the n-gram events. We then propose two effective retrieval models, a hard-indexing scheme and a soft-indexing scheme. Experiments show that the vector space modeling is effective in representing the layered music information, achieving 82.5% top-5 retrieval accuracy using 15-sec music clips as the queries. The soft-indexing outperforms hard-indexing in general.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Cavnar, W.B., and Trenkle, J.M. N-Gram-Based Text Categorization. In Proc. of 3rd Annual Symposium on Document Analysis and Information Retrieval, 1994.
 
3
 
4
 
5
6
 
7
Duxburg. C, Sandler. M., and Davies. M. A Hybrid Approach to Musical Note Onset Detection. In Proc. Int. Conf. DAFx. Hamburg, Germany, Sept, 2002.
8
 
9
Fujishima, T. Real Time Chord Recognition of Musical Sound: A System Using Lisp Music. In Proc. ICMC, Oct. 1999.
10
 
11
Goldstein, J. L. An Optimum Processor Theory for the Central Formation of the Pitch of Complex Tones. In JASA, Vol. 54, 1973.
 
12
Kageyama, T., Mochizuki, K., and Takashima, Y. Melody Retrieval with Humming. In Proc. ICMC, Sept, 1993.
 
13
Lemström, K., and Laine, P. Music Information Retrieval using Musical Parameters. In Proc. of the ICMC, Oct, 1998.
14
15
16
17
 
18
Pickens, J. A Survey of Feature Selection Techniques for Music Information Retrieval. Technical report, Center of Intelligent Information Retrieval, Dept. of Computer Science, University of Massachusetts, 2001.
 
19
Pickens, J. and Iliopoulos, C. Markov Random Fields and Maximum Entropy Modeling for Music Information Retrieval. In Proc. of ISMIR, Sept, 2005.
 
20
Salton, G. The SMART retrieval system. Prentice-Hall, Englewood Cliffs, NJ, 1971.
 
21
Shih, H.-H., Narayanan, S. S., and Kuo, C.-C. J. An HMM-Based Approach to Humming Transcription. In Proc. of ICME, Aug, 2002.
 
22
Song, J., Bae, S. Y., and Yoon, K. Mid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System. In Proc. of ISMIR, Oct, 2002.
 
23
Terhardt, E. Pitch, Consonance and Harmony. In JASA, Vol. 55, No. 5, 1974.
 
24
Typke, R., Wiering, F., and Veltkamp, R. A Survey of Music Information Retrieval Systems. In Proc. of the ISMIR, Sept. 2005.
 
25
 
26
Ward, W. Subjective Music Pitch. In JASA, Vol. 26, 195


Collaborative Colleagues:
Namunu C. Maddage: colleagues
Haizhou Li: colleagues
Mohan S. Kankanhalli: colleagues