ACM Home Page
Please provide us with feedback. Feedback
Large data methods for multimedia
Full text PdfPdf (85 KB)
Source
International Multimedia Conference archive
Proceedings of the 15th international conference on Multimedia table of contents
Augsburg, Germany
TUTORIAL SESSION: Tutorials table of contents
Pages: 6 - 7  
Year of Publication: 2007
ISBN:978-1-59593-702-5
Authors
Michael A. Casey  University of London, London, United Kingdom
Frank Kurth  University of Bonn, Bonn, Germany
Sponsors
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 101,   Citation Count: 0
Additional Information:

abstract   references   index terms   review   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1291233.1291238
What is a DOI?

ABSTRACT

This tutorial describes techniques essential for searching the large multimedia databases that are now common on the Internet. There are up to 10 million songs in commercial music catalogues and over 300 million images stored in online photo services such as Flickr. How can we find the music, videos or images we want? How can we organize such large collections: find duplicates, create links between similar documents, extract and annotate semantic structures from complex audiovisual documents? Conventional methods for handling large data sets, such as hashing, get us part of the way, but those methods may not straightforwardly be used for similarity-based matching and retrieval in audiovisual document collections. On the other hand, several elaborate methods from multimedia retrieval are available for semantic document analysis. Unfortunately, those methods generally do not scale for large data sets. Instead, new classes of algorithms combining the best of the two worlds of large data methods and semantic analysis are needed to handle large multimedia databases. Innovative methods such as locality sensitive hashing, which are based on randomized probes, are the new workhorses. This tutorial covers methods for multimedia retrieval on large document collections. Starting with audio retrieval, we describe both the theory (i.e., randomized algorithms for hashing) and the implementation details (how do you store hash values for millions of songs?). A special focus is on how to combine large data methods with semantically meaningful descriptors in order to facilitate efficient similarity-based retrieval. Besides audio, the tutorial also covers image, 3d motion and video retrieval.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Allamanche, J. Herre, B. Fröba, and M. Cremer. AudioID: Towards Content-Based Identification of Audio Material. In Proc. 110th AES Convention, Amsterdam, NL, 2001.
 
2
 
3
M. Casey and M. Slaney. The importance of sequences for music similarity. In Proc. ICASSP, 2006.
 
4
M. Casey and M. Slaney. Song intersection by approximate nearest neighbours. In Proc. ISMIR, 2006.
 
5
M. Casey and M. Slaney. Fast recognition of remixed music audio. In Proc. ICASSP, 2007.
 
6
M. Clausen, R. Engelbrecht, D. Meyer, and J. Schmitz. Proms: A web-based tool for searching in polyphonic music. In Proc. ISMIR, 2000.
 
7
M. Clausen, H. Kࣆrner, and F. Kurth. An Efficient Indexing and Search Technique for Multimedia Databases. In SIGIR Workshop on Multimedia Retrieval, Toronto, Canada, 2003.
 
8
M. Clausen and F. Kurth. Content-Based Information Retrieval by Group Theoretical Methods. In NATO (ASI), Lucca, Italy, 2003.
 
9
M. Clausen and F. Kurth. A unified approach to content-based and fault tolerant music recognition. IEEE Transactions on Multimedia, 6(5):717--731, October 2004.
10
 
11
J. Haitsma and T. Kalker. A Highly Robust Audio Fingerprinting System. In Proc. ISMIR, 2002.
 
12
J. Herre, E. Allamanche, O. Hellmuth, and T. Kastner. Robust identification/fingerprinting of audio signals using spectral flatness features. In Journal of the Acoustical Society of America, volume 111, 2002.
 
13
F. Kurth, M. Müller, D. Damm, C. Fremerey, A. Ribbrock, and M. Clausen. Syncplayer - an advanced system for content-based audio access. In Proc. ISMIR, London, GB, 2005.
 
14
M. Müller, F. Kurth, and M. Clausen. Audio matching via chroma-based statistical features. In Proc. ISMIR, London, GB, 2005.
15
 
16
T. Röder, F. Kurth, A. Mosig, and M. Clausen. A Group Theoretical Approach to Content-based Image Retrieval. NATO Advanced Study Institute (ASI), Lucca, Italy, 2003. Available via email to the authors.
17
 
18
A. Wang. An industrial strength audio search algorithm. In Proc. ISMIR, 2003.


REVIEW

"Simon Berkovich : Reviewer"

An extended abstract of a tutorial that describes techniques for searching audio-visual documents, this paper consists of two parts: hashing, so-called locality sensitive hashing (LSH), and indexing with algebraic structures, especially for music   more...

Collaborative Colleagues:
Michael A. Casey: colleagues
Frank Kurth: colleagues