ACM Home Page
Please provide us with feedback. Feedback
Identifying audio clips with RARE
Full text PdfPdf (160 KB)
Source International Multimedia Conference archive
Proceedings of the eleventh ACM international conference on Multimedia table of contents
Berkeley, CA, USA
DEMONSTRATION SESSION: Demonstration session 2 table of contents
Pages: 444 - 445  
Year of Publication: 2003
ISBN:1-58113-722-2
Authors
Chris J. C. Burges  Microsoft Research, Redmond, WA
John C. Platt  Microsoft Research, Redmond, WA
Jonathan Goldstein  Microsoft Research, Redmond, WA
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 32,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/957013.957104
What is a DOI?

ABSTRACT

In this paper, we describe RARE (Robust Audio Recognition Engine): a system for identifying audio streams and files. RARE can be used in a variety of applications: from enhancing the consumer listening experience to cleaning large audio databases. RARE was designed with two key qualities in mind: robustness to distortion of the audio, and lookup speed. RARE identifies audio clips in a stream against a database of 1/4 million songs in real time using approximately 10% CPU on an 850 MHz P3, and with a measured false positive rate of 1.5x10-8 per clip, per database entry, at a false negative rate of 0.2% per clip. We demo RARE in real-time on a stream and on distorted files.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Burges, C.J.C, Platt, J.C., and Jana, S., "Extracting noise robust features from audio data", in Proceedings of ICASSP 2002, 1021-1024.
 
2
Burges, C.J.C., Platt, J.C. and Jana, S., "Distortion Discriminant Analysis for Audio Fingerprinting", to appear in IEEE Speech and Audio Processing, 2003
 
3
 
4
Goldstein, J, Platt, J.C., and Burges, C.J.C., "Indexing high Dimensional Rectangles for Fast Multimedia Identification", Microsoft Research Tech. Report MSR-TR-2003-38, 2003.
 
5
Malvar, H.S., "A Modulated Complex Lapped Transform and its Applications to Audio Processing", in Proceedings of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 1999
 
6


Collaborative Colleagues:
Chris J. C. Burges: colleagues
John C. Platt: colleagues
Jonathan Goldstein: colleagues