ACM Home Page
Please provide us with feedback. Feedback
Combination of audio and lyrics features for genre classification in digital audio collections
Full text PdfPdf (521 KB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Content track C5: multimedia content analysis and applications table of contents
Pages 159-168  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Rudolf Mayer  Vienna University of Technology, Vienna, Austria
Robert Neumayer  Vienna University of Technology, Vienna, Austria, and Norwegian University of Science and Technology, Trondheim, Norway
Andreas Rauber  Vienna University of Technology, Vienna, Austria
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 22,   Downloads (12 Months): 147,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459382
What is a DOI?

ABSTRACT

In many areas multimedia technology has made its way into mainstream. In the case of digital audio this is manifested in numerous online music stores having turned into profitable businesses. The widespread user adaption of digital audio both on home computers and mobile players show the size of this market. Thus, ways to automatically process and handle the growing size of private and commercial collections become increasingly important; along goes a need to make music interpretable by computers. The most obvious representation of audio files is their sound - there are, however, more ways of describing a song, for instance its lyrics, which describe songs in terms of content words. Lyrics of music may be orthogonal to its sound, and differ greatly from other texts regarding their (rhyme) structure. Consequently, the exploitation of these properties has potential for typical music information retrieval tasks such as musical genre classification; so far, there is a lack of means to efficiently combine these modalities. In this paper, we present findings from investigating advanced lyrics features such as the frequency of certain rhyme patterns, several parts-of-speech features, and statistic features such as words per minute (WPM). We further analyse in how far a combination of these features with existing acoustic feature sets can be exploited for genre classification and provide experiments on two test collections.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Baumann, T. Pohle, and S. Vembu. Towards a socio-cultural compatibility of mir systems. In Proceedings of the 5th International Conference of Music Information Retrieval (ISMIR'04), pages 460--465, Barcelona, Spain, October 10-14 2004.
 
2
E. Brochu, N. de Freitas, and K. Bao. The sound of an album cover: Probabilistic multimedia and IR. In C. M. Bishop and B. J. Frey, editors, Proceedings of the 9th International Workshop on Artificial Intelligence and Statistics, Key West, FL, USA, January 3-6 2003.
 
3
W. B. Cavnar and J. M. Trenkle. N-gram-based text categorization. In Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval (SDAIR'94), pages 161--175, Las Vegas, USA, 1994.
 
4
J. Downie. Annual Review of Information Science and Technology, volume 37, chapter Music Information Retrieval, pages 295--340. Information Today, Medford, NJ, 2003.
 
5
6
7
 
8
P. Knees, M. Schedl, and G. Widmer. Multiple lyrics alignment: Automatic retrieval of song lyrics. In Proceedings of 6th International Conference on Music Information Retrieval (ISMIR'05), pages 564--569, London, UK, September 11-15 2005.
 
9
T. Lidy and A. Rauber. Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR'05), pages 34--41, London, UK, September 11-15 2005.
 
10
B. Logan, A. Kositsky, and P. Moreno. Semantic analysis of song lyrics. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'04), pages 827--830, Taipei, Taiwan, June 27-30 2004.
11
 
12
R. Mayer, R. Neumayer, and A. Rauber. Rhyme and style features for musical genre classification by song lyrics. In Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR'08), Philadelphia, PA, USA, September 14-18 2008. Accepted for publication.
 
13
R. Neumayer and A. Rauber. Integration of text and audio features for genre classification in music information retrieval. In Proceedings of the 29th European Conference on Information Retrieval (ECIR'07), pages 724--727, Rome, Italy, April 2-5 2007.
 
14
R. Neumayer and A. Rauber. Multi-modal music information retrieval - visualisation and evaluation of clusterings by both audio and lyrics. In Proceedings of the 8th Conference Recherche d'Information Assistée par Ordinateur (RIAO'07), Pittsburgh, PA, USA, May 29th - June 1 2007.
 
15
 
16
E. Pampalk, A. Flexer, and G. Widmer. Hierarchical organization and description of music collections at the artist level. In Research and Advanced Technology for Digital Libraries ECDL'05, pages 37--48, 2005.
17
 
18
A. Rauber, E. Pampalk, and D. Merkl. Using psycho-acoustic models and self-organizing maps to create a hierarchical structuring of music by musical styles. In Proceedings of the 3rd International Symposium on Music Information Retrieval (ISMIR'02), pages 71--80, Paris, France, October 13-17 2002.
 
19
 
20
 
21
G. Tzanetakis and P. Cook. Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing, 10(5):293--302, July 2002.
22
 
23

Collaborative Colleagues:
Rudolf Mayer: colleagues
Robert Neumayer: colleagues
Andreas Rauber: colleagues