ACM Home Page
Please provide us with feedback. Feedback
A compressed domain beat detector using MP3 audio bitstreams
Full text PdfPdf (260 KB)
Source International Multimedia Conference; Vol. 9 archive
Proceedings of the ninth ACM international conference on Multimedia table of contents
Ottawa, Canada
Session: Audio Processing table of contents
Pages: 194 - 202  
Year of Publication: 2001
ISBN:1-58113-394-4
Authors
Ye Wang  Nokia Research Center, Tampere, Finland
Miikka Vilermo  Nokia Research Center, Tampere, Finland
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGCOMM: ACM Special Interest Group on Data Communication
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 63,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/500141.500172
What is a DOI?

ABSTRACT

This paper presents a novel beat detector that processes MPEG-1 Layer III (known as MP3) encoded audio bitstreams directly in the compressed domain. Most previous beat detection or tracking systems dealing with MIDI or PCM signals are not directly applicable to compressed audio bitstreams, such as MP3 bitstreams. We have developed the beat detector as a part of a beat-pattern based error concealment scheme for streaming music over error prone channels. Special effort was used to obtain a tailored trade-off between performance, complexity and memory consumption for this specific application. A comparison between the machine-detected results to the human annotation has shown that the proposed method correctly tracked beats in 4 out of 6 popular music test signals. The results were analyzed.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Patel, N.V., Sethi, I.K. "Audio Characterization for Video Indexing", Proc. SPIE Vol. 2670, Storage and Retrieval for Image and Video Databases IV, Jan/Feb 1996, San Jose, CA, USA, pp. 373-384.
 
2
Nakajima, Y., Lu, Y., Sugano, M., Yoneyama, A., Yanagihara, H., Kurematsu, A. "A Fast Classification from MPEG Coded Data", Proc. of International Conference on Acoustic, Speech and Signal Processing (ICASSP), 1999, Phoenix, Arizona, USA, pp.3005 3008.
 
3
 
4
Pfeiffer, S., Robert-Ribes, J.; Kim, D. "Audio Content Extraction from MPEG-encoded sequences", First International Workshop on Intelligent Multimedia Computing and Networking (IMMCN2000), Feb./March 2000, Atlantic City, New Jersey, pp. 513- 516.
 
5
Wang, Y., "A Beat-Pattern based Error Concealment Scheme for Music Delivery with Burst Packet Loss", accepted by IEEE International Conference on Multimedia and Expo (ICME2001), August, 2001, Tokyo, Japan.
 
6
Wang, Y., Vilermo, M., Isherwood, D. "The Impact of the Relationship Between MDCT and DFT on Audio Compression: A Step Towards Solving the Mismatch", The First IEEE Pacific-Rim Conference on Multimedia (IEEE-PCM2000), December 13-15, 2000, Sydney, Australia, pp. 130- 138.
 
7
Wang, Y., Ojanpera, J., Vilermo, M., Vaananen, M. "Schemes for Re-compressing MP3 Audio Bitstreams", accepted by the Audio Engineering Society (AES) 11 llh International Convention, September 2 l-24,200 I, New York, USA.
 
8
ISO/IEC I I 172-3, 'Information Technology - Coding of moving pictures and associated audio for digital storage media at up to about I ,5 Mbit/s', 1993.
 
9
Dannenberg, R.B., Mont-Reynaud, B., 'Following an improvisation in real time,' Proc. Int. Comp. Music Conf., 1987, pp.24 I-248.
 
10
Desain, P., Honing, H., "Advanced issues in beat induction modcling: syncopation, tempo and timing," Proc. Int. Comp. Music Conf., 1994, pp. 92-94.
 
11
 
12
Scheirer, E.D., "Tempo and beat analysis of acoustic musical signals,' J. Acousti. Sot. Am., 1998, vol. 103, no. 1, pp. 588-601.
 
13
 
14
Todd, N.P.M., "The auditory 'primal sketch': A multiscale model of rhythmic grouping,' J. New Music Research, 1994, vol. 23, no. 1, pp.25-70.
 
15
Smith, L.M., "A multi-resolution time-frequency analysis and interpretation of musical rhythm," PhD thesis, University of Western Australia, 1999.
 
16
Dixon, S.E., "A beat tracking system for audio signal," Proc. Conf. Computat. And Mathemat. Methods in Music, Vienna, Austria, 1999, pp.lOl-l 10.
 
17
Klapuri, A., "Sound onset detection by applying psychoacoustic knowledge," Proc. IEEE Int. Conf. Acoust., Speech, and Signal Proc., 1999, vol. 6, pp. 3089-3092.