ACM Home Page
Please provide us with feedback. Feedback
Toward open-microphone engagement for multiparty interactions
Full text PdfPdf (902 KB)
Source International Conference on Multimodal Interfaces archive
Proceedings of the 8th international conference on Multimodal interfaces table of contents
Banff, Alberta, Canada
POSTER SESSION: Poster session 2 table of contents
Pages: 273 - 280  
Year of Publication: 2006
ISBN:1-59593-541-X
Authors
Rebecca Lunsford  Oregon Health & Science University, Beaverton, OR and Natural Interaction Systems, LLC, Seattle, WA
Sharon Oviatt  Oregon Health & Science University, Beaverton, OR and University of Washington, Seattle, WA and Natural Interaction Systems, LLC, Seattle, WA
Alexander M. Arthur  Natural Interaction Systems, LLC, Seattle, WA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 4,   Downloads (12 Months): 22,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1180995.1181049
What is a DOI?

ABSTRACT

There currently is considerable interest in developing new open-microphone engagement techniques for speech and multimodal interfaces that perform robustly in complex mobile and multiparty field environments. State-of-the-art audio-visual open-microphone engagement systems aim to eliminate the need for explicit user engagement by processing more implicit cues that a user is addressing the system, which results in lower cognitive load for the user. This is an especially important consideration for mobile and educational interfaces due to the higher load required by explicit system engagement. In the present research, longitudinal data were collected with six triads of high-school students who engaged in peer tutoring on math problems with the aid of a simulated computer assistant. Results revealed that amplitude was 3.25dB higher when users addressed a computer rather than human peer when no lexical marker of intended interlocutor was present, and 2.4dB higher for all data. These basic results were replicated for both matched and adjacent utterances to computer versus human partners. With respect to dialogue style, speakers did not direct a higher ratio of commands to the computer, although such dialogue differences have been assumed in prior work. Results of this research reveal that amplitude is a powerful cue marking a speaker's intended addressee, which should be leveraged to design more effective microphone engagement during computer-assisted multiparty interactions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Boersma, P. & Weenink, D., Praat: Doing phonetics by computer (version 4.2). 2005. (URL:www.praat.org).
 
3
Coulston, R., Oviatt, S. L., & Darves, C. Amplitude convergence in children's conversational speech with animated personas. In Proceedings of the International Conference on Spoken Language Processing (ICSLP'2002), 2002, Casual Prod. Ltd., Denver, CO: 2689--2692.
 
4
Escera, C., Corral, M.-J., & Yago, E., An electrophysiological and behavioral investigation of involuntary attention towards auditory frequency, duration and intensity changes. Cognitive Brain Research, 14, 3: 325--332.
 
5
 
6
7
 
8
Levow, G.-A. Prosodic cues to discourse segment boundaries in human-computer dialogue. In Proceedings of the 5th SIGdial Workshop on Discourse and Dialogue 2004: 93--96.
9
 
10
Messer, D. J., The identification of names in maternal speech to infants. Journal of Psycholinguistic Research, 10, 1 (January 1981): 69--77.
 
11
Neti, C., Iyengar, G., Potamianos, G., Senior, A., & Maison, B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. In Proceedings of the International Conference on Spoken Language Processing, 2000 (Beijing). 3, Chinese Friendship Publishers: 11--14.
12
 
13
 
14
Paek, T., Horvitz, E., & Ringger, E. Continuous listening for unconstrained spoken dialog. In Proceedings of the International Conference on Spoken Language Processing, 2000 (Beijing, China). Chinese Freindship Publishers: 138--141.
 
15
Schroger, E., A neural mechanism for involuntary attention shifts to changes in auditory stimulation. Journal of Cognitive Neuroscience, 8, 6 (November 1996): 527--539.
16
 
17
Welkowitz, J., Feldstein, S., Finklestein, M., & Aylesworth, L., Changes in vocal intensity as a function of interspeaker influence. Perceptual and Motor Skills, 35: 715--718.


Collaborative Colleagues:
Rebecca Lunsford: colleagues
Sharon Oviatt: colleagues
Alexander M. Arthur: colleagues