|
ABSTRACT
This paper describes a control-by-humming interface in which a bluetooth-connected insertion earphone/microphone remotely controls a small portable system such as a modern assistive device, cell phone, etc. A pitch detection algorithm converts a subvocal hum input signal into pitch contours that are segmented into discrete "notes" and then grouped to form control commands. These commands cause transitions among operational states. An example application is given for hands-free control of a simplified (six-state) cell phone and music player system. Performance of the interface is discussed and future improvements are outlined.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Jeff A. Bilmes , Xiao Li , Jonathan Malkin , Kelley Kilanski , Richard Wright , Katrin Kirchhoff , Amarnag Subramanya , Susumu Harada , James A. Landay , Patricia Dowden , Howard Chizeck, The vocal joystick: a voice-based human-computer interface for individuals with motor impairments, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, p.995-1002, October 06-08, 2005, Vancouver, British Columbia, Canada
[doi> 10.3115/1220575.1220700]
|
| |
2
|
Roger B. Dannenberg , William P. Birmingham , Bryan Pardo , Ning Hu , Colin Meek , George Tzanetakis, A comparative evaluation of search techniques for query-by-humming using the MUSART testbed, Journal of the American Society for Information Science and Technology, v.58 n.5, p.687-701, March 2007
[doi> 10.1002/asi.v58:5]
|
| |
3
|
A. de Cheveigné and H. Kawahara. YIN, a fundamental frequency estimator for speech and music. J. Acoust. Soc. America, 111(4):1917--1930, April 2002.
|
| |
4
|
V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. soviet physics doklady. Soviet Physics Doklady, 10:707--710, 1966.
|
| |
5
|
Z. Liu, M. L. Seltzer, A. Acero, I. Tashev, Z. Zhang, and M. Sinclair. A compact multi--sensor headset for hands-free communication. In proceedings of the IEEE Workshop on Application of Signal Processing to Audio and Acoustics, October 2005.
|
| |
6
|
|
|