| Automatic and instant ring tone generation based on music structure analysis |
| Full text |
Pdf
(730 KB)
|
Source
|
International Multimedia Conference
archive
Proceedings of the seventeen ACM international conference on Multimedia
table of contents
Beijing, China
SESSION: Short papers session 1: content analysis
table of contents
Pages 593-596
Year of Publication: 2009
ISBN:978-1-60558-608-3
|
|
Authors
|
|
Tong Zhang
|
Hewlett-Packard Company, Palo Alto, CA, USA
|
|
Chee Keat Fong
|
Hewlett-Packard Company, Palo Alto, CA, USA
|
|
Linxing Xiao
|
Tsinghua University, Beijing, China
|
|
Jie Zhou
|
Tsinghua University, Beijing, China
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 7, Citation Count: 0
|
|
|
ABSTRACT
Real tones, which are often excerpts from pop songs, have become popular as ring tones. This paper describes how a ring tone can be produced by analyzing the structure of music and selecting the most appropriate portion of the music. With audio feature analysis and pattern recognition methods, the structure of a song can be estimated by deploying both singing voice detection and repetition detection. Then, one or more ring tones can be automatically selected from the song according to heuristic rules. The entire process takes only a few seconds. It is greatly superior in efficiency and ease-of-use than currently available ring tone generation approaches, and can be used in handheld devices, desktop or laptop PCs and web services. Moreover, this unique music structure analysis technology we developed may be used in many other applications as well, such as for browsing, searching and shopping digital music.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
W. Chai and B. Vercoe, "Music thumbnailing via structural analysis," ACM Conf. on Multimedia, pp.223--226, 2003.
|
| |
5
|
N. Maddage, C. Xu, M. Kankanhalli, et al. "Content-based music structure analysis with applications to music semantics understanding," Proc. of ACM Conference on Multimedia, pp.112--119, New York, Oct. 2004.
|
| |
6
|
A. Berenzweig, D. Ellis, "Locating singing voice segments within music signals," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.119--122, 2001.
|
| |
7
|
T. Nwe, Y. Wang, "Automatic detection of vocal segments in popular songs," International Conf. on Music Information Retrieval, 2004.
|
| |
8
|
H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. on Acoustics, Speech and Signal Processing, vol.ASSP-26, no.1, pp.43--49, Feb. 1978.
|
| |
9
|
M. A. Bartsch, G. H. Wakefield, "To catch a chorus: using chroma-based representations for audio thumbnailing," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp.15--18, Oct. 2001.
|
| |
10
|
D. Ellis, G. Poliner, "Identifying cover songs with chroma features and dynamic programming beat tracking," IEEE Conf. on Acoustics, Speech and Signal Processing, 2007.
|
| |
11
|
X. Liu, D. Wang, J. Li, ea al., "The feature and spatial covariant kernel: Adding implicit spatial constraints to histogram," ACM Conf. on Image and Video Retrieval, 2007.
|
| |
12
|
T. Zhang and R. Samadani, "Automatic generation of music thumbnails," IEEE International Conference on Multimedia and Expo, pp.228--231, Beijing, July 2007.
|
| |
13
|
|
|