| Toward open-microphone engagement for multiparty interactions |
| Full text |
Pdf
(902 KB)
|
| Source
|
International Conference on Multimodal Interfaces
archive
Proceedings of the 8th international conference on Multimodal interfaces
table of contents
Banff, Alberta, Canada
POSTER SESSION: Poster session 2
table of contents
Pages: 273 - 280
Year of Publication: 2006
ISBN:1-59593-541-X
|
|
Authors
|
|
Rebecca Lunsford
|
Oregon Health & Science University, Beaverton, OR and Natural Interaction Systems, LLC, Seattle, WA
|
|
Sharon Oviatt
|
Oregon Health & Science University, Beaverton, OR and University of Washington, Seattle, WA and Natural Interaction Systems, LLC, Seattle, WA
|
|
Alexander M. Arthur
|
Natural Interaction Systems, LLC, Seattle, WA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 4, Downloads (12 Months): 22, Citation Count: 3
|
|
|
ABSTRACT
There currently is considerable interest in developing new open-microphone engagement techniques for speech and multimodal interfaces that perform robustly in complex mobile and multiparty field environments. State-of-the-art audio-visual open-microphone engagement systems aim to eliminate the need for explicit user engagement by processing more implicit cues that a user is addressing the system, which results in lower cognitive load for the user. This is an especially important consideration for mobile and educational interfaces due to the higher load required by explicit system engagement. In the present research, longitudinal data were collected with six triads of high-school students who engaged in peer tutoring on math problems with the aid of a simulated computer assistant. Results revealed that amplitude was 3.25dB higher when users addressed a computer rather than human peer when no lexical marker of intended interlocutor was present, and 2.4dB higher for all data. These basic results were replicated for both matched and adjacent utterances to computer versus human partners. With respect to dialogue style, speakers did not direct a higher ratio of commands to the computer, although such dialogue differences have been assumed in prior work. Results of this research reveal that amplitude is a powerful cue marking a speaker's intended addressee, which should be leveraged to design more effective microphone engagement during computer-assisted multiparty interactions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Alexander M. Arthur , Rebecca Lunsford , Matt Wesson , Sharon Oviatt, Prototyping novel collaborative multimodal systems: simulation, data collection and analysis tools for the next decade, Proceedings of the 8th international conference on Multimodal interfaces, November 02-04, 2006, Banff, Alberta, Canada
[doi> 10.1145/1180995.1181039]
|
| |
2
|
Boersma, P. & Weenink, D., Praat: Doing phonetics by computer (version 4.2). 2005. (URL:www.praat.org).
|
| |
3
|
Coulston, R., Oviatt, S. L., & Darves, C. Amplitude convergence in children's conversational speech with animated personas. In Proceedings of the International Conference on Spoken Language Processing (ICSLP'2002), 2002, Casual Prod. Ltd., Denver, CO: 2689--2692.
|
| |
4
|
Escera, C., Corral, M.-J., & Yago, E., An electrophysiological and behavioral investigation of involuntary attention towards auditory frequency, duration and intensity changes. Cognitive Brain Research, 14, 3: 325--332.
|
| |
5
|
|
| |
6
|
|
 |
7
|
|
| |
8
|
Levow, G.-A. Prosodic cues to discourse segment boundaries in human-computer dialogue. In Proceedings of the 5th SIGdial Workshop on Discourse and Dialogue 2004: 93--96.
|
 |
9
|
|
| |
10
|
Messer, D. J., The identification of names in maternal speech to infants. Journal of Psycholinguistic Research, 10, 1 (January 1981): 69--77.
|
| |
11
|
Neti, C., Iyengar, G., Potamianos, G., Senior, A., & Maison, B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. In Proceedings of the International Conference on Spoken Language Processing, 2000 (Beijing). 3, Chinese Friendship Publishers: 11--14.
|
 |
12
|
|
| |
13
|
|
| |
14
|
Paek, T., Horvitz, E., & Ringger, E. Continuous listening for unconstrained spoken dialog. In Proceedings of the International Conference on Spoken Language Processing, 2000 (Beijing, China). Chinese Freindship Publishers: 138--141.
|
| |
15
|
Schroger, E., A neural mechanism for involuntary attention shifts to changes in auditory stimulation. Journal of Cognitive Neuroscience, 8, 6 (November 1996): 527--539.
|
 |
16
|
|
| |
17
|
Welkowitz, J., Feldstein, S., Finklestein, M., & Aylesworth, L., Changes in vocal intensity as a function of interspeaker influence. Perceptual and Motor Skills, 35: 715--718.
|
CITED BY 3
|
|
Alexander M. Arthur , Rebecca Lunsford , Matt Wesson , Sharon Oviatt, Prototyping novel collaborative multimodal systems: simulation, data collection and analysis tools for the next decade, Proceedings of the 8th international conference on Multimodal interfaces, November 02-04, 2006, Banff, Alberta, Canada
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
User-centered design
Additional Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Interaction styles (e.g., commands, menus, forms, direct manipulation);
Voice I/O;
Natural language;
Input devices and strategies (e.g., mouse, touchscreen);
Evaluation/methodology;
Theory and methods;
Prototyping
General Terms:
Design,
Human Factors,
Performance,
Reliability
Keywords:
collaborative peer tutoring,
computer-supported collaborative work,
dialogue style,
intended addressee,
multimodal interaction,
open-microphone engagement,
spoken amplitude,
user communication modeling
|