|
ABSTRACT
This paper presents a novel beat detector that processes MPEG-1 Layer III (known as MP3) encoded audio bitstreams directly in the compressed domain. Most previous beat detection or tracking systems dealing with MIDI or PCM signals are not directly applicable to compressed audio bitstreams, such as MP3 bitstreams. We have developed the beat detector as a part of a beat-pattern based error concealment scheme for streaming music over error prone channels. Special effort was used to obtain a tailored trade-off between performance, complexity and memory consumption for this specific application. A comparison between the machine-detected results to the human annotation has shown that the proposed method correctly tracked beats in 4 out of 6 popular music test signals. The results were analyzed.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Patel, N.V., Sethi, I.K. "Audio Characterization for Video Indexing", Proc. SPIE Vol. 2670, Storage and Retrieval for Image and Video Databases IV, Jan/Feb 1996, San Jose, CA, USA, pp. 373-384.
|
| |
2
|
Nakajima, Y., Lu, Y., Sugano, M., Yoneyama, A., Yanagihara, H., Kurematsu, A. "A Fast Classification from MPEG Coded Data", Proc. of International Conference on Acoustic, Speech and Signal Processing (ICASSP), 1999, Phoenix, Arizona, USA, pp.3005 3008.
|
| |
3
|
|
| |
4
|
Pfeiffer, S., Robert-Ribes, J.; Kim, D. "Audio Content Extraction from MPEG-encoded sequences", First International Workshop on Intelligent Multimedia Computing and Networking (IMMCN2000), Feb./March 2000, Atlantic City, New Jersey, pp. 513- 516.
|
| |
5
|
Wang, Y., "A Beat-Pattern based Error Concealment Scheme for Music Delivery with Burst Packet Loss", accepted by IEEE International Conference on Multimedia and Expo (ICME2001), August, 2001, Tokyo, Japan.
|
| |
6
|
Wang, Y., Vilermo, M., Isherwood, D. "The Impact of the Relationship Between MDCT and DFT on Audio Compression: A Step Towards Solving the Mismatch", The First IEEE Pacific-Rim Conference on Multimedia (IEEE-PCM2000), December 13-15, 2000, Sydney, Australia, pp. 130- 138.
|
| |
7
|
Wang, Y., Ojanpera, J., Vilermo, M., Vaananen, M. "Schemes for Re-compressing MP3 Audio Bitstreams", accepted by the Audio Engineering Society (AES) 11 llh International Convention, September 2 l-24,200 I, New York, USA.
|
| |
8
|
ISO/IEC I I 172-3, 'Information Technology - Coding of moving pictures and associated audio for digital storage media at up to about I ,5 Mbit/s', 1993.
|
| |
9
|
Dannenberg, R.B., Mont-Reynaud, B., 'Following an improvisation in real time,' Proc. Int. Comp. Music Conf., 1987, pp.24 I-248.
|
| |
10
|
Desain, P., Honing, H., "Advanced issues in beat induction modcling: syncopation, tempo and timing," Proc. Int. Comp. Music Conf., 1994, pp. 92-94.
|
| |
11
|
|
| |
12
|
Scheirer, E.D., "Tempo and beat analysis of acoustic musical signals,' J. Acousti. Sot. Am., 1998, vol. 103, no. 1, pp. 588-601.
|
| |
13
|
|
| |
14
|
Todd, N.P.M., "The auditory 'primal sketch': A multiscale model of rhythmic grouping,' J. New Music Research, 1994, vol. 23, no. 1, pp.25-70.
|
| |
15
|
Smith, L.M., "A multi-resolution time-frequency analysis and interpretation of musical rhythm," PhD thesis, University of Western Australia, 1999.
|
| |
16
|
Dixon, S.E., "A beat tracking system for audio signal," Proc. Conf. Computat. And Mathemat. Methods in Music, Vienna, Austria, 1999, pp.lOl-l 10.
|
| |
17
|
Klapuri, A., "Sound onset detection by applying psychoacoustic knowledge," Proc. IEEE Int. Conf. Acoust., Speech, and Signal Proc., 1999, vol. 6, pp. 3089-3092.
|
CITED BY 6
|
|
Ye Wang , Ali Ahmaniemi , David Isherwood , Wendong Huang, Content-based UEP: a new scheme for packet loss recovery in music streaming, Proceedings of the eleventh ACM international conference on Multimedia, November 02-08, 2003, Berkeley, CA, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Uma Srinivasan , Silvia Pfeiffer , Surya Nepal , Michael Lee , Lifang Gu , Stephen Barrass, A Survey of MPEG-1 Audio, Video and Semantic Analysis Techniques, Multimedia Tools and Applications, v.27 n.1, p.105-141, September 2005
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.5
Sound and Music Computing
Subjects:
Signal analysis, synthesis, and processing
Additional Classification:
E.
Data
E.4
CODING AND INFORMATION THEORY
Subjects:
Data compaction and compression
I.
Computing Methodologies
I.5
PATTERN RECOGNITION
I.5.2
Design Methodology
Subjects:
Pattern analysis
General Terms:
Design,
Experimentation,
Performance
Keywords:
MP3,
MPEG audio,
beat detection,
beat tracking,
bitstream processing,
compressed domain processing,
error concealment
|