ACM Home Page
Please provide us with feedback. Feedback
Automatic and instant ring tone generation based on music structure analysis
Full text PdfPdf (730 KB)
Source
International Multimedia Conference archive
Proceedings of the seventeen ACM international conference on Multimedia table of contents
Beijing, China
SESSION: Short papers session 1: content analysis table of contents
Pages 593-596  
Year of Publication: 2009
ISBN:978-1-60558-608-3
Authors
Tong Zhang  Hewlett-Packard Company, Palo Alto, CA, USA
Chee Keat Fong  Hewlett-Packard Company, Palo Alto, CA, USA
Linxing Xiao  Tsinghua University, Beijing, China
Jie Zhou  Tsinghua University, Beijing, China
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 7,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1631272.1631364
What is a DOI?

ABSTRACT

Real tones, which are often excerpts from pop songs, have become popular as ring tones. This paper describes how a ring tone can be produced by analyzing the structure of music and selecting the most appropriate portion of the music. With audio feature analysis and pattern recognition methods, the structure of a song can be estimated by deploying both singing voice detection and repetition detection. Then, one or more ring tones can be automatically selected from the song according to heuristic rules. The entire process takes only a few seconds. It is greatly superior in efficiency and ease-of-use than currently available ring tone generation approaches, and can be used in handheld devices, desktop or laptop PCs and web services. Moreover, this unique music structure analysis technology we developed may be used in many other applications as well, such as for browsing, searching and shopping digital music.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
W. Chai and B. Vercoe, "Music thumbnailing via structural analysis," ACM Conf. on Multimedia, pp.223--226, 2003.
 
5
N. Maddage, C. Xu, M. Kankanhalli, et al. "Content-based music structure analysis with applications to music semantics understanding," Proc. of ACM Conference on Multimedia, pp.112--119, New York, Oct. 2004.
 
6
A. Berenzweig, D. Ellis, "Locating singing voice segments within music signals," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.119--122, 2001.
 
7
T. Nwe, Y. Wang, "Automatic detection of vocal segments in popular songs," International Conf. on Music Information Retrieval, 2004.
 
8
H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. on Acoustics, Speech and Signal Processing, vol.ASSP-26, no.1, pp.43--49, Feb. 1978.
 
9
M. A. Bartsch, G. H. Wakefield, "To catch a chorus: using chroma-based representations for audio thumbnailing," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp.15--18, Oct. 2001.
 
10
D. Ellis, G. Poliner, "Identifying cover songs with chroma features and dynamic programming beat tracking," IEEE Conf. on Acoustics, Speech and Signal Processing, 2007.
 
11
X. Liu, D. Wang, J. Li, ea al., "The feature and spatial covariant kernel: Adding implicit spatial constraints to histogram," ACM Conf. on Image and Video Retrieval, 2007.
 
12
T. Zhang and R. Samadani, "Automatic generation of music thumbnails," IEEE International Conference on Multimedia and Expo, pp.228--231, Beijing, July 2007.
 
13