ACM Home Page
Please provide us with feedback. Feedback
Towards the digital music library: tune retrieval from acoustic input
Full text PdfPdf (998 KB)
Source International Conference on Digital Libraries archive
Proceedings of the first ACM international conference on Digital libraries table of contents
Bethesda, Maryland, United States
Pages: 11 - 18  
Year of Publication: 1996
ISBN:0-89791-830-4
Authors
Rodger J. McNab  Department of Computer Science, University of Waikato, Hamilton, New Zealand
Lloyd A. Smith  Department of Computer Science, University of Waikato, Hamilton, New Zealand
Ian H. Witten  Department of Computer Science, University of Waikato, Hamilton, New Zealand
Clare L. Henderson  School of Education, University of Waikato, Hamilton, New Zealand
Sally Jo Cunningham  Department of Computer Science, University of Waikato, Hamilton, New Zealand
Sponsors
SIGBIO: ACM Special Interest Group on Biomedical Computing
SIGCAPH: ACM SIGCAPH Computers and the Physically Handicapped
SIGGROUP: ACM Special Interest Group on Supporting Group Work
SIGMIS: ACM Special Interest Group on Management Information Systems
SIGADA: ACM Special Interest Group on Ada Programming Language
SIGART: ACM Special Interest Group on Artificial Intelligence
SIGCUE: ACM Special Interest Group on Computer Uses In Education
SIGCOMM: ACM Special Interest Group on Data Communication
SIGIR: ACM Special Interest Group on Information Retrieval
SIGLINK: Hypertext, Hypermedia, and Web
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 13,   Downloads (12 Months): 122,   Citation Count: 48
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/226931.226934
What is a DOI?

ABSTRACT

Music is traditionally retrieved by title, composer or subject classification. It is possible, with current technology, to retrieve music from a database on the basis of a few notes sung or hummed into a microphone. This paper describes the implementation of such a system, and discusses several issues pertaining to music retrieval. We first describe an interface that transcribes acoustic input into standard music notation. We then analyze string matching requirements for ranked retrieval of music and present the results of an experiment which tests how accurately people sing well known melodies. The performance of several string matching criteria are analyzed using two folk song databases. Finally, we describe a prototype system which has been developed for retrieval of tunes from acoustic input.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Bainbridge, D. and Bell, T.C. (1996) "An extensible optical music recognition system." Proc Australian Conference on Computer Science. Melbourne; January.
 
2
Bauer, B. (1988) The new Real Book. Sher Music Co, Petaluma, CA.
 
3
Bishop, M. J. and Thompson, E. A. (1986) "Maximum likelihood alignment of DNA sequences." J. Molecular Biology 190: 159-165.
 
4
Carter, N.P. (1989) Automatic recognition of printed music in the context of electronic publishing. Ph.D. thesis, University of Surrey, UK; February.
 
5
Cohen, A. and Cohen, N. (1973) "Tune evolution as an indicator of traditional musical norms." J. American Folklore 86, 339: 37-47.
 
6
Deutsch, D. (1972) "Octave generalization and tune recognition." Perception and Psychophysics 11(6): 411-412.
 
7
Dowling, W. J. (1978) "Scale and contour: Two components of a theory of memory for melodies." Psychological Review 85(4) 341-354.
8
 
9
Gold, B. and Rabmer, L. (1969) "Parallel processing techniques for estimating pitch periods of speech in the time domain." J. Acoust. Soc. Am. 46(2) 442-448.
 
10
Goodrum, C. A. and Dalrymple, H. W. (1982) Guide to the Library of Congress. Library of Congress, Washington, D. C.
 
11
Hess, W. (1983) Pitch Determination of Speech Signals. Springer- Verlag, New York.
 
12
Hawley, M. (1990) "The personal orchestra." Computing Systems 3(2): 289-329.
13
14
 
15
McNab, R. J., Smith, L. A. and Witten, I.H. (1995) "Signal processing for melody transcriptlon." Working Paper 95/22. University of Waikato, Hamilton, New Zealand.
 
16
Mongeau, M. and Sankoff, D. (1990) "Comparison of musical sequences." Computers and the Humanities 24:161-I75.
 
17
 
18
Selfridge-Field, E. (1994) "Optical recognition of music notation: a survey of current work." Computing in Musicology, 9: 109-145.
 
19
Sloboda, J. (I982) "Music performance." in The Psychology of Music, edited by D. Deutsch, Academic Press, pp. 479- 496.
 
20
Sundberg, j. and B. Lindblom (1976) "Generative theories in language and music descriptions." Cognition 4: 99-122.
21
 
22
Waibel, A. and Yegnanarayana, B. (1983) "Comparative study of nonlinear warping techniques in isolated word speech recognition systems." IEEE Trans Acoustics, Speech, and Signal Proc. 31(6): 1582-1586.
23

CITED BY  48

Collaborative Colleagues:
Rodger J. McNab: colleagues
Lloyd A. Smith: colleagues
Ian H. Witten: colleagues
Clare L. Henderson: colleagues
Sally Jo Cunningham: colleagues