|
ABSTRACT
This article outlines some of the characteristics of speech technology that distinguish it from traditional interaction techniques. A number of human factors issues relating to speech will be discussed in the context of a proposed research agenda for speech-based interfaces.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Allen, J., Natural Language Understanding, The Benjamin/ Cummings Publishing Company Inc., Don Mills, 1987.
|
| |
3
|
|
| |
4
|
Boll, S.F., and Pulsipher, D.C., "Suppression of acoustic noise in speech using two microphone adaptive noise cancellation", IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-28(6), 1980, pp. 752-755.
|
| |
5
|
Bonner, R., "Changes in the speech pattern under emotional tension", American Journal of Psychology, 56, 1943, pp. 262- 273.
|
| |
6
|
|
| |
7
|
Bradford, J.H., "Towards a Robust Speech Interface for Teleoperation Systems", Proceedings of ICSLP'92: The International Conference on Spoken Language Processing, Banff Alta, October 1992, pp. 1331-1334.
|
| |
8
|
Darwin, C., The Expression of the Emotions in Man and Animals , The University of Chicago Press, 1965 (reprint), 1872 (original), pp. 83-93.
|
| |
9
|
Grosjean, F, "Spoken word recognition processes and the gating paradigm", Perception and Psychophysics, 28(4), 1980, pp. 267-283.
|
| |
10
|
|
| |
11
|
Kent, R.D., and Read, C., The Acoustic Analysis of Speech, Singular Publishing Group, San Diego, 1992, pp. 154-158.
|
| |
12
|
|
| |
13
|
Lieberman, P., and Michaels, S.B., "Some aspects of fundamental frequency and envelope amplitude as related to the emotional content of speech", The Journal of the Acoustical Society of America, 34, 1962, pp. 922-927.
|
| |
14
|
Logan, J., Greene, B., and Pisoni, D., "Segmental intelligibility of synthetic speech produced by rule", Journal of the Acoustical Society of America, 86(2), August 1989, pp. 566-581 (see table VII, p. 575).
|
| |
15
|
Luff, P., Gilbert, N., and Frohlich, D. (Eds), Computers and Conversation, Academic Press, Toronto, 1990.
|
| |
16
|
Mitchell, O.M.M., Ross, C.A., and Yates, G.H., "Signal processing for a cocktail party effect", Journal of the Acoustical Society of America, 50(2), 1971, pp. 656-660.
|
| |
17
|
Okawa, S., Endo, T., Kobayashi, T., and Shirai, K., "Phrase recognition in conversational speech using prosodic and phonemic information", IEICE Transactions on Information and Systems, E76-D(1), 1993, pp. 44-50.
|
 |
18
|
|
| |
19
|
Jenny Preece , Yvonne Rogers , Helen Sharp , David Benyon , Simon Holland , Tom Carey, Human-Computer Interaction, Addison-Wesley Longman Ltd., Essex, UK, 1994
|
| |
20
|
Price, P.J., Ostendorf, M., Shattuck-Hufnagel, S., and Fong, C., "The use of prosody in syntactic disambiguation", The Journal of the Acoustical Society of America, 90(6), 1991, pp. 2956-297O.
|
| |
21
|
|
 |
22
|
|
| |
23
|
Scherer, K.R., "Speech and emotional states", In J.K. Darby (Ed.), Speech Evaluation in Psychiatry, Grune-Stratton, New York, 1981, pp. 189-220.
|
| |
24
|
|
| |
25
|
|
| |
26
|
Widrow, B., Glover, J.R., McCool, J.M., Kaunitz, J., Williams, C.S., Hearn, R.H., Zeidler, J.R., Dong, E., and Goodlin, R.C., "Adaptive noise canceling: principles and applications", Proceedings of the IEEE, 63(12), 1975, pp. 1692-1716.
|
|