|
ABSTRACT
Mobile usage patterns often entail high and fluctuating levels of difficulty as well as dual tasking. One major theme explored in this research is whether a flexible multimodal interface supports users in managing cognitive load. Findings from this study reveal that multimodal interface users spontaneously respond to dynamic changes in their own cognitive load by shifting to multimodal communication as load increases with task difficulty and communicative complexity. Given a flexible multimodal interface, users' ratio of multimodal (versus unimodal) interaction increased substantially from 18.6% when referring to established dialogue context to 77.1% when required to establish a new context, a +315% relative increase. Likewise, the ratio of users' multimodal interaction increased significantly as the tasks became more difficult, from 59.2% during low difficulty tasks, to 65.5% at moderate difficulty, 68.2% at high and 75.0% at very high difficulty, an overall relative increase of +27%. Analysis of users' task-critical errors and response latencies across task difficulty levels increased systematically and significantly as well, corroborating the manipulation of cognitive processing load. The adaptations seen in this study reflect users' efforts to self-manage limitations on working memory when task complexity increases. This is accomplished by distributing communicative information across multiple modalities, which is compatible with a cognitive load theory of multimodal interaction. The long-term goal of this research is the development of an empirical foundation for proactively guiding flexible and adaptive multimodal system design.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Almor, A., Noun-phrase anaphora and focus: The informational load hypothesis. Psychological Review, 1999. 106: 748--765.
|
| |
2
|
Baddeley, A., Working Memory. Science, 1992. 255: 556--559.
|
| |
3
|
Benoit, C., J.-C. Martin, C. Pelachaud, L. Schomaker, & B. Suhm, Audio-visual and multimodal speech-based systems, Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation, R. Moore, ed. 2000, Kluwer Academic Publishers: Boston, MA. 102--203.
|
| |
4
|
Calvert, G., C. Spence, & B.E. Stein, eds. The handbook of multisensory processing. 2004, MIT Press: Cambridge, MA.
|
| |
5
|
Chandler, P. & J. Sweller, Cognitive load theory and the format of instruction. Cognition and Instruction, 1991. 8: 293--332.
|
| |
6
|
Grant, K.W. & S. Greenberg. Speech intelligibility derived from asynchronous processing of auditory-visual information. Workshop on Audio-Visual Speech Processing (AVSP-2001). 2001. Scheelsminde, Denmark
|
| |
7
|
Grice, H.P., Logic and conversation, Syntax and Semantics: Speech Acts, J.L. Morgan, ed. 1975, Acad Press: NY. 41--58.
|
 |
8
|
|
 |
9
|
Julie A. Jacko , Leon Barnard , Thitima Kongnakorn , Kevin P. Moloney , Paula J. Edwards , V. Kathlene Emery , Francois Sainfort, Isolating the effects of visual impairment: exploring the effect of AMD on the utility of multimodal feedback, Proceedings of the SIGCHI conference on Human factors in computing systems, p.311-318, April 24-29, 2004, Vienna, Austria
[doi> 10.1145/985692.985732]
|
| |
10
|
|
| |
11
|
Mayer, R.E. & R. Moreno, A split-attention effect in multimedia learning: evidence for dual processing systems in working memory. Journal of Educational Psychology, 1998. 90(2): 312--320.
|
| |
12
|
Mousavi, S.Y., R. Low, & J. Sweller, Reducing cognitive load by mixing auditory and visual presentation modes. Journal of Educational Psychology, 1995. 87(2): 319--334.
|
| |
13
|
|
| |
14
|
Oviatt, S.L., Predicting spoken disfluencies during human-computer interaction. Computer Speech and Language, 1995. 9: 19--35.
|
| |
15
|
Oviatt, S.L., Multimodal interactive maps: Designing for human performance. Human Computer Interaction, 1997. 12(1-2): 93--129.
|
 |
16
|
|
 |
17
|
|
 |
18
|
|
| |
19
|
Oviatt, S.L., P.R. Cohen, L. Wu, J. Vergo, L. Duncan, B. Suhm, J. Bers, T.G. Holzman, T. Winograd, J. Landay, J. Larson, & D. Ferro, Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions. Human Computer Interaction, 2000. 15(4): 263--322.
|
 |
20
|
Sharon Oviatt , Rachel Coulston , Stefanie Tomko , Benfang Xiao , Rebecca Lunsford , Matt Wesson , Lesley Carmichael, Toward a theory of organized multimodal integration patterns during human-computer interaction, Proceedings of the 5th international conference on Multimodal interfaces, November 05-07, 2003, Vancouver, British Columbia, Canada
[doi> 10.1145/958432.958443]
|
| |
21
|
Oviatt, S.L., T. Darrell, & M. Flickner, Multimodal Interfaces that flex, adapt, and persist, Comm. of the ACM. 2004. 30--33
|
| |
22
|
Penney, C.G., Modality effects and the structure of short-term verbal memory. Memory and Cognition, 1989. 17: 398--422.
|
| |
23
|
Prince, E., Toward a taxonomy of given-new information, Radical Pragmatics, P. Cole, ed. 1986, Academic: NY. 223--255.
|
| |
24
|
Sweller, J., Cognitive load during problem solving: Effects on learning. Cognitive Science, 1988. 12: 257--285.
|
| |
25
|
Technology for adaptive aging: Reports and papers. 2003, Nat. Acad. of Sci. Workshop: Nat. Acad. Press. http://www.nap.edu/books/0309091160/html/
|
| |
26
|
Teder-Sälejärvi, W.A., J.J. McDonald, F. Di Russo, & S.A. Hillyard, An analysis of audio-visual crossmodal integration by means of event-related potential (ERP) recordings. Cognitive Brain Research, 2002. 14: 106--114.
|
| |
27
|
Tindall-Ford, S., P. Chandler, & J. Sweller, When two sensory modes are better than one. Journal of Experimental Psychology: Applied, 1997. 3(3): 257--287.
|
| |
28
|
Wickens, C., Sandry, D., and Vidulich, M., Compatibility and resource competition between modalities of input, central processing, and output. Human Factors, 1983. 25(2): 227--248.
|
 |
29
|
Benfang Xiao , Rebecca Lunsford , Rachel Coulston , Matt Wesson , Sharon Oviatt, Modeling multimodal integration patterns and performance in seniors: toward adaptive processing of individual differences, Proceedings of the 5th international conference on Multimodal interfaces, November 05-07, 2003, Vancouver, British Columbia, Canada
[doi> 10.1145/958432.958480]
|
CITED BY 23
|
|
Fang Chen , Eric Choi , Julien Epps , Serge Lichman , Natalie Ruiz , Yu Shi , Ronnie Taib , Mike Wu, A study of manual gesture-based selection for the PEMMI multimodal transport management interface, Proceedings of the 7th international conference on Multimodal interfaces, October 04-06, 2005, Torento, Italy
|
|
|
Antti Oulasvirta , Sakari Tamminen , Virpi Roto , Jaana Kuorelahti, Interaction in 4-second bursts: the fragmented nature of attentional resources in mobile HCI, Proceedings of the SIGCHI conference on Human factors in computing systems, April 02-07, 2005, Portland, Oregon, USA
|
|
|
|
|
|
Matthias Jöst , Jochen Häußler , Matthias Merdes , Rainer Malaka, Multimodal interaction for pedestrians: an evaluation study, Proceedings of the 10th international conference on Intelligent user interfaces, January 10-13, 2005, San Diego, California, USA
|
|
|
|
|
|
Paulo Barthelmess , Edward Kaiser , Rebecca Lunsford , David McGee , Philip Cohen , Sharon Oviatt, Human-centered collaborative interaction, Proceedings of the 1st ACM international workshop on Human-centered multimedia, October 27-27, 2006, Santa Barbara, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yu Shi , Natalie Ruiz , Ronnie Taib , Eric Choi , Fang Chen, Galvanic skin response (GSR) as an index of cognitive load, CHI '07 extended abstracts on Human factors in computing systems, April 28-May 03, 2007, San Jose, CA, USA
|
|
|
|
|
|
|
|
|
Rock Leung , Karon MacLean , Martin Bue Bertelsen , Mayukh Saubhasik, Evaluation of haptically augmented touchscreen gui elements under cognitive load, Proceedings of the 9th international conference on Multimodal interfaces, November 12-15, 2007, Nagoya, Aichi, Japan
|
|
|
Pilar Manchón , Carmen del Solar , Gabriel Amores , Guillermo Pérez, Multimodal interaction analysis in a smart house, Proceedings of the 9th international conference on Multimodal interfaces, November 12-15, 2007, Nagoya, Aichi, Japan
|
|
|
Natalie Ruiz , Ronnie Taib , Yu (David) Shi , Eric Choi , Fang Chen, Using pen input features as indices of cognitive load, Proceedings of the 9th international conference on Multimodal interfaces, November 12-15, 2007, Nagoya, Aichi, Japan
|
|
|
Bo Yin , Natalie Ruiz , Fang Chen , M. Asif Khawaja, Automatic cognitive load detection from speech features, Proceedings of the 2007 conference of the computer-human interaction special interest group (CHISIG) of Australia on Computer-human interaction: design: activities, artifacts and environments, November 28-30, 2007, Adelaide, Australia
|
|
|
|
|
|
Madoka Miki , Chiyomi Miyajima , Takanori Nishino , Norihide Kitaoka , Kazuya Takeda, An integrative recognition method for speech and gestures, Proceedings of the 10th international conference on Multimodal interfaces, October 20-22, 2008, Chania, Crete, Greece
|
|
|
Saija Lemmelä , Akos Vetek , Kaj Mäkelä , Dari Trendafilov, Designing and evaluating multimodal interaction for mobile contexts, Proceedings of the 10th international conference on Multimodal interfaces, October 20-22, 2008, Chania, Crete, Greece
|
|
|
|
|
|
Meriam Horchani , Dominique Fréard , Benjamin Caron , Éric Jamet , Laurence Nigay , Franck Panaget, Stratégies de dialogue et de présentation multimodale: un composant logiciel dédié et son application à des expérimentations en magicien d'Oz, Proceedings of the 19th International Conference of the Association Francophone d'Interaction Homme-Machine, November 12-15, 2007, Paris, France
|
|
|
Sharon Oviatt , Alexander Arthur , Yaro Brock , Julia Cohen, Expressive pen-based interfaces for math education, Proceedings of the 8th iternational conference on Computer supported collaborative learning, p.573-582, July 16-21, 2007, New Brunswick, New Jersey, USA
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
User-centered design
Additional Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Evaluation/methodology;
Theory and methods;
Input devices and strategies (e.g., mouse, touchscreen);
Prototyping;
Voice I/O;
Natural language;
Interaction styles (e.g., commands, menus, forms, direct manipulation)
General Terms:
Design,
Human Factors,
Performance,
Reliability
Keywords:
cognitive load,
dialogue context,
human performance,
individual differences,
multimodal integration,
multimodal interaction,
speech and pen input,
system adaptation,
task difficulty,
unimodal interaction
|