ACM Home Page
Please provide us with feedback. Feedback
Visualizing music and audio using self-similarity
Full text PdfPdf (1.27 MB)
Source International Multimedia Conference archive
Proceedings of the seventh ACM international conference on Multimedia (Part 1) table of contents
Orlando, Florida, United States
Pages: 77 - 80  
Year of Publication: 1999
ISBN:1-58113-151-8
Author
Jonathan Foote  FX Palo Alto Laboratory, Inc., 3400 Hillview Ave., Building 4, Palo Alto, CA
Sponsors
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGCOMM: ACM Special Interest Group on Data Communication
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 120,   Citation Count: 17
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/319463.319472
What is a DOI?

ABSTRACT

This paper presents a novel approach to visualizing the time structure of music and audio. The acoustic similarity between any two instants of an audio recording is displayed in a 2D representation, allowing identification of structural and rhythmic characteristics. Examples are presented for classical and popular music. Applications include content-based analysis and segmentation, as well as tempo and structure extraction.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Potter Ralph K., George A. Kopp, Harriet C. Green, Visible Speech, D. Van Nostrand Co., NY, 1947
 
2
Koenig, Walter K., H.K. Dunn, L.Y. Lacey, "The Sound Spectrograph," in JASA, Vol. 18, p. 19~49.
 
3
 
4
 
5
Foote, Jonathan. "Content-Based Retrieval of Music and Audio," in C.-C. J. Kuo et al., editor, Multimedia Storage and Archiving Systems II, Proc. of SPIE, Vol. 3229, pp. 138-147, 1997.
 
6
Carey, M. J., et al., "A Comparision of Features for Speech and Music Discrimination," in Proc. ICASSP '99, vol. 1, pp. 149-152, IEEE, Phoenix AZ 1999
 
7
Johnson, P., "sci.skeptic FAQ," Section 0.6.2, http:// www. faqs.org/faqs/skeptic-faq/1, 1999

CITED BY  17
 
 
 
 
 
 
 


Peer to Peer - Readers of this Article have also read: