ACM Home Page
Please provide us with feedback. Feedback
Approximate matching algorithms for music information retrieval using vocal input
Full text PdfPdf (165 KB)
Source International Multimedia Conference archive
Proceedings of the eleventh ACM international conference on Multimedia table of contents
Berkeley, CA, USA
SESSION: Music table of contents
Pages: 130 - 139  
Year of Publication: 2003
ISBN:1-58113-722-2
Authors
Richard L. Kline  Pace University, New York, NY
Ephraim P. Glinert  Rensselaer Polytechnic Institute, Troy, NY
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 49,   Citation Count: 5
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/957013.957042
What is a DOI?

ABSTRACT

Effective use of multimedia collections requires efficient and intuitive methods of searching and browsing. This work considers databases which store music and explores how these may best be searched by providing input queries in some musical form. For the average person, humming several notes of the desired melody is the most straightforward method for providing this input, but such input is very likely to contain several errors. Previously proposed implementations of so-called query-by-humming systems are effective only when the number of input errors is small. We conducted experiments which revealed that the expected error rate for user queries is much higher than existing algorithms can tolerate. We then developed algorithms based on approximate matching techniques which deliver much improved results when comparing error-filled vocal user queries against a music collection.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. Basic Local Alignment Search Tool. J. Mol. Biol., 215:403--410, 1990.
 
2
S. F. Altschul, T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research, 25(17):3389--3402, 1997.
 
3
R. B. Dannenberg. An on-line algorithm for real-time accompaniment. In Proc. 1984 International Computer Music Conference (ICMC '84), pages 193--198, 1984.
 
4
W. J. Dowling. Scale and contour: Two components of a theory of memory for melodies. Psychological Review, 85(4):341--354, 1978.
5
6
 
7
D. Greenhaus et al. Digital tradition folksong database. Available (as of this writing) at http://www.mudcat.org/folksearch.html.
 
8
A. R. Halpern. Memory for the absolute pitch of familiar songs. Memory and Cognition, 17(5):572--581, 1989.
 
9
J.-S. R. Jang, H.-R. Lee, and M.-Y. Kao. Content-based music retrieval using linear scaling and branch-and-bound tree search. In IEEE Int'l Conf. on Multimedia and Expo, Tokyo, JP, August 2001.
 
10
T. Kageyama, K. Mochizuki, and Y. Takashima. Melody retrieval with humming. In Proc. 1993 International Computer Music Conference (ICMC'93), pages 349--351, Tokyo, 1993.
 
11
N. Kosugi. Personal correspondence.
12
 
13
D. J. Levitin. Absolute memory for musical pitch: Evidence from the production of learned melodies. Perception and Psychophysics, 56(4):414--423, 1994.
 
14
D. J. Levitin and P. R. Cook. Memory for musical tempo: Additional evidence that auditory memory is absolute. Perception and Psychophysics, 58(6):927--935, 1996.
 
15
G. Li. Robust regression. In D. C. Hoaglin, F. Mosteller, and J. W. Tukey, editors, Exploring Data Tables, Trends, and Shapes, pages 281--343. Wiley and Sons, New York, 1985.
 
16
A. T. Lindsay. Using contour as a mid-level representation of melody. Master's thesis, Mass. Institute of Technology, 1996.
 
17
R. J. McNab, L. A. Smith, D. Bainbridge, and I. H. Witten. The New Zealand Digital Library MELody inDEX. D-Lib Magazine, May 1997. http://www.dlib.org/dlib/may97/meldex/05witten.html.
18
 
19
M. Mongeau and D. Sankoff. Comparison of musical sequences. Computers and the Humanities, 24:161--175, 1990.
 
20
S. Pauws. CubyHum: a fully operational query by humming system. In Proc. 3rd Int'l Symposium on Music Information Retrieval (ISMIR'02), pages 187--196, Paris, 2002.
 
21
E. D. Scheirer. Tempo and beat analysis of acoustic musical signals. J. Acoust. Soc. Am., 103(1):588--601, Jan 1998.
 
22
T. F. Smith and M. S. Waterman. Identification of common molecular subsequences. J. Mol. Biol., 147:195--197, 1981.
23
 
24
Wildcat Canyon Software. Autoscore Pro 2.0 computer software. Available (as of this writing) at http://www.wildcat.com.
25


Collaborative Colleagues:
Richard L. Kline: colleagues
Ephraim P. Glinert: colleagues