ACM Home Page
Please provide us with feedback. Feedback
Searching musical audio datasets by a batch of multi-variant tracks
Full text PdfPdf (410 KB)
Source
International Multimedia Conference archive
Proceeding of the 1st ACM international conference on Multimedia information retrieval table of contents
Vancouver, British Columbia, Canada
SESSION: Audio retrieval table of contents
Pages 121-127  
Year of Publication: 2008
ISBN:978-1-60558-312-9
Authors
Yi Yu  Nara Women's University, Nara, Japan
J. Stephen Downie  University of Illinois at Urbana-Champaign, Champaign, IL, USA
Lei Chen  Hong Kong University of Science and Technology, Hong Kong, China
Vincent Oria  New Jersey Institute of Technology, Newark, NJ, USA
Kazuki Joe  Nara Women's University, Nara, Japan
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 95,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1460096.1460117
What is a DOI?

ABSTRACT

Multi-variant music tracks are those audio tracks of a particular song which are sung and recorded by different people (i.e., cover songs). As music social clubs grow on the Internet, more and more people like to upload music recordings onto such music social sites to share their own home-produced albums and participate in Internet singing contests. Therefore it is very important to explore a computer-assisted evaluation tool to detect these audio-based multi-variant tracks. In this paper we investigate such a task: the original track of a song is embedded in datasets, with a batch of multi-variant audio tracks of this song as input, our retrieval system returns an ordered list by similarity and indicates the position of relevant audio track. To help process multi-variant audio tracks, we suggest a semantic indexing framework and propose the Federated Features (FF) scheme to generate the semantic summarization of audio feature sequences. The conjunction of federated features with three typical similarity searching schemes, K-Nearest Neighbor (KNN), Locality Sensitive Hashing (LSH), and Exact Euclidian LSH (E2LSH), is evaluated. From these findings, a computer-assisted evaluation tool for searching multi-variant audio tracks was developed to search over large musical audio datasets.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
J. S. Downie. The Music Information Retrieval Evaluation eXchange (MIREX). In D-Lib Magazine 12, 2006. http://dlib.org/dlib/december06/downie/12downie.html.
 
2
J. P. Bello. Audio-based Cover Song Retrieval Using Approximate Chord Sequences: Testing Shifts, Gaps, Swaps and Beats. ISMIR'07, pp.239--244, 2007.
 
3
D. Ellis and G. Poliner. Identifying cover songs with chroma features and dynamic programming beat tracking. ICASSP'07, 2007.
 
4
Y. Yu, K. Joe, and J. S. Downie. Efficient Query-by- Content Audio Retrieval by Locality Sensitive Hashing and Partial Sequence Comparison. IEICE Transaction on Information and System, Vol.E91-D, No.6, pp. 1730--1739, 2008.
 
5
 
6
Y. Yu, M. Takata, and K. Joe. Index-Based Similarity Searching with Partial Sequence Comparison for Query-by-Content Audio Retrieval. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.76--86, 2006.
7
8
9
 
10
T. Pohle, M. Schedl, P. Knees, and G. Widmer. Automatically Adapting the Structure of Audio Similarity Spaces. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp. 66--75, 2006.
 
11
LSH Algorithm and Implementation (E2LSH) http://web.mit.edu/andoni/www/LSH/index.html.
 
12
P. Indyk and N. Thaper. Fast color image retrieval via embeddings. Workshop on Statistical and Computational Theories of Vision (ICCV), 2003.
 
13
S. Y. Hu. Efficient Video Retrieval by Locality Sensitive Hashing. ICASSP'05, pp.449--452, 2005.
 
14
J. Reiss, J. J. Aucouturier, and M. Sandler. Efficient multi dimensional searching routines for music information retrieval. ISMIR'01, 2001.
 
15
 
16
M. Casey and M. Slaney. Song Intersection by Approximate Nearest Neighbor Search. ISMIR'06, pp. 144--149, 2006.
 
17
M. Lesaffre and M. Leman. Using Fuzzy to Handle Semantic Descriptions of Music in a Content-based Retrieval System. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.43--5, 2006.
 
18
G. Tzanetakis and P. Cook. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, Vol.10, No.5, pp. 293--302, 2002.
 
19
R. Miotto and N. Orio. A Methodology for the Segmentation and Identification of Music Works. ISMIR'07, pp.239--244, 2007.
 
20

Collaborative Colleagues:
Yi Yu: colleagues
J. Stephen Downie: colleagues
Lei Chen: colleagues
Vincent Oria: colleagues
Kazuki Joe: colleagues