| Searching musical audio datasets by a batch of multi-variant tracks |
| Full text |
Pdf
(410 KB)
|
Source
|
International Multimedia Conference
archive
Proceeding of the 1st ACM international conference on Multimedia information retrieval
table of contents
Vancouver, British Columbia, Canada
SESSION: Audio retrieval
table of contents
Pages 121-127
Year of Publication: 2008
ISBN:978-1-60558-312-9
|
|
Authors
|
|
Yi Yu
|
Nara Women's University, Nara, Japan
|
|
J. Stephen Downie
|
University of Illinois at Urbana-Champaign, Champaign, IL, USA
|
|
Lei Chen
|
Hong Kong University of Science and Technology, Hong Kong, China
|
|
Vincent Oria
|
New Jersey Institute of Technology, Newark, NJ, USA
|
|
Kazuki Joe
|
Nara Women's University, Nara, Japan
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 95, Citation Count: 0
|
|
|
ABSTRACT
Multi-variant music tracks are those audio tracks of a particular song which are sung and recorded by different people (i.e., cover songs). As music social clubs grow on the Internet, more and more people like to upload music recordings onto such music social sites to share their own home-produced albums and participate in Internet singing contests. Therefore it is very important to explore a computer-assisted evaluation tool to detect these audio-based multi-variant tracks. In this paper we investigate such a task: the original track of a song is embedded in datasets, with a batch of multi-variant audio tracks of this song as input, our retrieval system returns an ordered list by similarity and indicates the position of relevant audio track. To help process multi-variant audio tracks, we suggest a semantic indexing framework and propose the Federated Features (FF) scheme to generate the semantic summarization of audio feature sequences. The conjunction of federated features with three typical similarity searching schemes, K-Nearest Neighbor (KNN), Locality Sensitive Hashing (LSH), and Exact Euclidian LSH (E2LSH), is evaluated. From these findings, a computer-assisted evaluation tool for searching multi-variant audio tracks was developed to search over large musical audio datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. S. Downie. The Music Information Retrieval Evaluation eXchange (MIREX). In D-Lib Magazine 12, 2006. http://dlib.org/dlib/december06/downie/12downie.html.
|
| |
2
|
J. P. Bello. Audio-based Cover Song Retrieval Using Approximate Chord Sequences: Testing Shifts, Gaps, Swaps and Beats. ISMIR'07, pp.239--244, 2007.
|
| |
3
|
D. Ellis and G. Poliner. Identifying cover songs with chroma features and dynamic programming beat tracking. ICASSP'07, 2007.
|
| |
4
|
Y. Yu, K. Joe, and J. S. Downie. Efficient Query-by- Content Audio Retrieval by Locality Sensitive Hashing and Partial Sequence Comparison. IEICE Transaction on Information and System, Vol.E91-D, No.6, pp. 1730--1739, 2008.
|
| |
5
|
|
| |
6
|
Y. Yu, M. Takata, and K. Joe. Index-Based Similarity Searching with Partial Sequence Comparison for Query-by-Content Audio Retrieval. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.76--86, 2006.
|
 |
7
|
|
 |
8
|
|
 |
9
|
Bin Cui , Jialie Shen , Gao Cong , Heng Tao Shen , Cui Yu, Exploring composite acoustic features for efficient music similarity query, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
[doi> 10.1145/1180639.1180725]
|
| |
10
|
T. Pohle, M. Schedl, P. Knees, and G. Widmer. Automatically Adapting the Structure of Audio Similarity Spaces. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp. 66--75, 2006.
|
| |
11
|
LSH Algorithm and Implementation (E2LSH) http://web.mit.edu/andoni/www/LSH/index.html.
|
| |
12
|
P. Indyk and N. Thaper. Fast color image retrieval via embeddings. Workshop on Statistical and Computational Theories of Vision (ICCV), 2003.
|
| |
13
|
S. Y. Hu. Efficient Video Retrieval by Locality Sensitive Hashing. ICASSP'05, pp.449--452, 2005.
|
| |
14
|
J. Reiss, J. J. Aucouturier, and M. Sandler. Efficient multi dimensional searching routines for music information retrieval. ISMIR'01, 2001.
|
| |
15
|
|
| |
16
|
M. Casey and M. Slaney. Song Intersection by Approximate Nearest Neighbor Search. ISMIR'06, pp. 144--149, 2006.
|
| |
17
|
M. Lesaffre and M. Leman. Using Fuzzy to Handle Semantic Descriptions of Music in a Content-based Retrieval System. Workshop on Learning Semantics of Audio Signals (LSAS'06), pp.43--5, 2006.
|
| |
18
|
G. Tzanetakis and P. Cook. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, Vol.10, No.5, pp. 293--302, 2002.
|
| |
19
|
R. Miotto and N. Orio. A Methodology for the Segmentation and Identification of Music Works. ISMIR'07, pp.239--244, 2007.
|
| |
20
|
|
|