| Towards the digital music library: tune retrieval from acoustic input |
| Full text |
Pdf
(998 KB)
|
| Source
|
International Conference on Digital Libraries
archive
Proceedings of the first ACM international conference on Digital libraries
table of contents
Bethesda, Maryland, United States
Pages: 11 - 18
Year of Publication: 1996
ISBN:0-89791-830-4
|
|
Authors
|
|
Rodger J. McNab
|
Department of Computer Science, University of Waikato, Hamilton, New Zealand
|
|
Lloyd A. Smith
|
Department of Computer Science, University of Waikato, Hamilton, New Zealand
|
|
Ian H. Witten
|
Department of Computer Science, University of Waikato, Hamilton, New Zealand
|
|
Clare L. Henderson
|
School of Education, University of Waikato, Hamilton, New Zealand
|
|
Sally Jo Cunningham
|
Department of Computer Science, University of Waikato, Hamilton, New Zealand
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 13, Downloads (12 Months): 122, Citation Count: 48
|
|
|
ABSTRACT
Music is traditionally retrieved by title, composer or subject
classification. It is possible, with current technology, to
retrieve music from a database on the basis of a few notes sung or
hummed into a microphone. This paper describes the implementation
of such a system, and discusses several issues pertaining to music
retrieval. We first describe an interface that transcribes acoustic
input into standard music notation. We then analyze string matching
requirements for ranked retrieval of music and present the results
of an experiment which tests how accurately people sing well known
melodies. The performance of several string matching criteria are
analyzed using two folk song databases. Finally, we describe a
prototype system which has been developed for retrieval of tunes
from acoustic input.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Bainbridge, D. and Bell, T.C. (1996) "An extensible optical music recognition system." Proc Australian Conference on Computer Science. Melbourne; January.
|
| |
2
|
Bauer, B. (1988) The new Real Book. Sher Music Co, Petaluma, CA.
|
| |
3
|
Bishop, M. J. and Thompson, E. A. (1986) "Maximum likelihood alignment of DNA sequences." J. Molecular Biology 190: 159-165.
|
| |
4
|
Carter, N.P. (1989) Automatic recognition of printed music in the context of electronic publishing. Ph.D. thesis, University of Surrey, UK; February.
|
| |
5
|
Cohen, A. and Cohen, N. (1973) "Tune evolution as an indicator of traditional musical norms." J. American Folklore 86, 339: 37-47.
|
| |
6
|
Deutsch, D. (1972) "Octave generalization and tune recognition." Perception and Psychophysics 11(6): 411-412.
|
| |
7
|
Dowling, W. J. (1978) "Scale and contour: Two components of a theory of memory for melodies." Psychological Review 85(4) 341-354.
|
 |
8
|
Asif Ghias , Jonathan Logan , David Chamberlin , Brian C. Smith, Query by humming: musical information retrieval in an audio database, Proceedings of the third ACM international conference on Multimedia, p.231-236, November 05-09, 1995, San Francisco, California, United States
[doi> 10.1145/217279.215273]
|
| |
9
|
Gold, B. and Rabmer, L. (1969) "Parallel processing techniques for estimating pitch periods of speech in the time domain." J. Acoust. Soc. Am. 46(2) 442-448.
|
| |
10
|
Goodrum, C. A. and Dalrymple, H. W. (1982) Guide to the Library of Congress. Library of Congress, Washington, D. C.
|
| |
11
|
Hess, W. (1983) Pitch Determination of Speech Signals. Springer- Verlag, New York.
|
| |
12
|
Hawley, M. (1990) "The personal orchestra." Computing Systems 3(2): 289-329.
|
 |
13
|
|
 |
14
|
|
| |
15
|
McNab, R. J., Smith, L. A. and Witten, I.H. (1995) "Signal processing for melody transcriptlon." Working Paper 95/22. University of Waikato, Hamilton, New Zealand.
|
| |
16
|
Mongeau, M. and Sankoff, D. (1990) "Comparison of musical sequences." Computers and the Humanities 24:161-I75.
|
| |
17
|
|
| |
18
|
Selfridge-Field, E. (1994) "Optical recognition of music notation: a survey of current work." Computing in Musicology, 9: 109-145.
|
| |
19
|
Sloboda, J. (I982) "Music performance." in The Psychology of Music, edited by D. Deutsch, Academic Press, pp. 479- 496.
|
| |
20
|
Sundberg, j. and B. Lindblom (1976) "Generative theories in language and music descriptions." Cognition 4: 99-122.
|
 |
21
|
|
| |
22
|
Waibel, A. and Yegnanarayana, B. (1983) "Comparative study of nonlinear warping techniques in isolated word speech recognition systems." IEEE Trans Acoustics, Speech, and Signal Proc. 31(6): 1582-1586.
|
 |
23
|
|
CITED BY 48
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
David DeRoure , Steven Blackburn , Lee Oades , Jonathan Read , Neil Ridgway, Applying open hypermedia to audio, Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems, p.285-286, June 20-24, 1998, Pittsburgh, Pennsylvania, United States
|
|
|
Jonah Shifrin , Bryan Pardo , Colin Meek , William Birmingham, HMM-based musical query retrieval, Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, July 14-18, 2002, Portland, Oregon, USA
|
|
|
|
|
|
Pierre-Yves Rolland , Gailius Raškinis , Jean-Gabriel Ganascia, Musical content-based retrieval: an overview of the Melodiscov approach and system, Proceedings of the seventh ACM international conference on Multimedia (Part 1), p.81-84, October 30-November 05, 1999, Orlando, Florida, United States
|
|
|
|
|
|
Harry Hochheiser, Browsers with changing parts: a catalog explorer for Philip Glass' website, Proceedings of the conference on Designing interactive systems: processes, practices, methods, and techniques, p.105-115, August 17-19, 2000, New York City, New York, United States
|
|
|
David Bainbridge , Craig G. Nevill-Manning , Ian H. Witten , Lloyd A. Smith , Rodger J. McNab, Towards a digital library of popular music, Proceedings of the fourth ACM conference on Digital libraries, p.161-169, August 11-14, 1999, Berkeley, California, United States
|
|
|
|
|
|
Lloyd A. Smith , Eline F. Chiu , Brian L. Scott, A speech interface for building musical score collections, Proceedings of the fifth ACM conference on Digital libraries, p.165-173, June 02-07, 2000, San Antonio, Texas, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Erdem Unal , S. S. Narayanan , H. H. Shih , Elaine Chew , C. C. Jay Kuo, Creating data resources for designing user-centric frontends for query by humming systems, Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval, November 07-07, 2003, Berkeley, California
|
|
|
Ian H. Witten , Rodger J. McNab , Steve Jones , Mark Apperley , David Bainbridge , Sally Jo Cunningham, Managing Complexity in a Distributed Digital Library, Computer, v.32 n.2, p.74-79, February 1999
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Erdem Unal , Shrikanth Narayanan , Elaine Chew , Panayiotis G. Georgiou , Nathan Dahlin, A dictionary based approach for robust and syllable-independent audio input transcription for query by humming systems, Proceedings of the 1st ACM workshop on Audio and music computing multimedia, October 27-27, 2006, Santa Barbara, California, USA
|
|
|
|
|
|
Roger B. Dannenberg , William P. Birmingham , Bryan Pardo , Ning Hu , Colin Meek , George Tzanetakis, A comparative evaluation of search techniques for query-by-humming using the MUSART testbed, Journal of the American Society for Information Science and Technology, v.58 n.5, p.687-701, March 2007
|
|
|
|
|
|
Roger B. Dannenberg , William P. Birmingham , George P. Tzanetakis , Colin P. Meek , Ning P. Hu , Bryan P. Pardo, The MUSART Testbed for Query-by-Humming Evaluation, Computer Music Journal, v.28 n.2, p.34-48, June 2004
|
|
|
|
|
|
|
|
|
Jakob Frank , Thomas Lidy , Ewald Peiszer , Ronald Genswaider , Andreas Rauber, Ambient music experience in real and virtual worlds using audio similarity, Proceeding of the 1st ACM international workshop on Semantic ambient media experiences, October 31-31, 2008, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|