|
ABSTRACT
This paper describes an intelligent system that we developed to support affective multimodal human-computer interaction (AMM-HCI) where the user's actions and emotions are modeled and then used to adapt the HCI and support the user in his or her activity. The proposed system, which we named Gaze-X, is based on sensing and interpretation of the human part of the computer's context, known as W5+ (who, where, what, when, why, how). It integrates a number of natural human communicative modalities including speech, eye gaze direction, face and facial expression, and a number of standard HCI modalities like keystrokes, mouse movements, and active software identification, which, in turn, are fed into processes that provide decision making and adapt the HCI to support the user in his or her activity according to his or her preferences. To attain a system that can be educated, that can improve its knowledge and decision making through experience, we use case-based reasoning as the inference engine of Gaze-X. The utilized case base is a dynamic, incrementally self-organizing event-content-addressable memory that allows fact retrieval and evaluation of encountered events based upon the user preferences and the generalizations formed from prior input. To support concepts of concurrency, modularity/scalability, persistency, and mobility, Gaze-X has been built as an agent-based system where different agents are responsible for different parts of the processing. A usability study conducted in an office scenario with a number of users indicates that Gaze-X is perceived as effective, easy to use, useful, and affectively qualitative.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
|
| |
4
|
Browne, D., Norman, M. and Riches, D. Why Build Adaptive Interfaces? In Adaptive User Interfaces. Browne, D., Totterdell, P. and Norman, M., Eds. Academic Press, London, UK, 1990, 15--57.
|
| |
5
|
Conati, C. Probabilistic assessment of user's emotions in educational games. Applied Artificial Intelligence, 16, 7-8 (Aug. 2002), 555--575.
|
| |
6
|
|
 |
7
|
|
| |
8
|
Dey, A. K., Abowd, G. D. and Salber, D. A conceptual framework and a toolkit for supporting the rapid prototyping of context-aware applications. J. Human-Computer Interaction, 16, 2-4 (Dec. 2001), 97--166.
|
| |
9
|
Duric, Z., Gray, W. D., Heishman, R., Li, F., Rosenfeld, A., Schoelles, M. J., Schunn, C. and Wechsler, H. Integrating perceptual and cognitive modeling for adaptive and intelligent human-computer interaction. Proceedings of the IEEE, 90, 7 (July 2002), 1272--1289.
|
| |
10
|
|
| |
11
|
Gu, H. and Ji, Q. An automated face reader for fatigue detection. In Proc. Int'l Conf. Face & Gesture Recognition, 111--116, 2004.
|
| |
12
|
Gunes, H. and Piccardi, M. Affect Recognition from Face and Body: Early Fusion vs. Late Fusion, In Proc. Int'l Conf. Systems, Man and Cybernetics, 3437--3443, 2005.
|
 |
13
|
|
| |
14
|
|
| |
15
|
|
| |
16
|
Jaimes, A. and Sebe, N. Multimodal human computer interaction: A survey. In Proc. Int'l Workshop on HCI in conjunction with Int'l Conf. Computer Vision, 2005.
|
 |
17
|
|
| |
18
|
Keltner, D. and Ekman, P. Facial expression of emotion. In Handbook of Emotions, Lewis, M., and Haviland-Jones, J. M. Eds. The Guilford Press, New York, 2000, pp. 236--249.
|
| |
19
|
van Kuilenburg, H., Wiering, M. and den Uyl, M. A model-based method for automatic facial expression recognition. Lecture Notes in Artificial Intelligence, 3720, 194--205, 2005.
|
 |
20
|
|
| |
21
|
Lisetti, C. L. and Schiano, D. J. Automatic facial expression interpretation: Where human-computer interaction, AI and cognitive science intersect. Pragmatics and Cognition, 8, 1 (Jan. 2000), 185--235.
|
| |
22
|
Marsic, I., Medl, A. and Flanagan, J. Natural communication with information systems. Proceedings of the IEEE, 88, 8 (Aug. 2000), 1354--1366.
|
| |
23
|
|
 |
24
|
|
| |
25
|
Oviatt, S. User-centred modelling and evaluation of multimodal interfaces. Proceedings of the IEEE, 91, 9 (Sep. 2003), 1457--1468.
|
| |
26
|
Pantic, M., Grootjans, R. J. and Zwitserloot, R. Teaching Ad-hoc Networks using a Simple Agent Framework. In Proc. Int'l Conf. Information Technology Based Higher Education and Training, pp. 6--11, 2005.
|
 |
27
|
Maja Pantic , Alex Pentland , Anton Nijholt , Thomas Huang, Human computing and machine understanding of human behavior: a survey, Proceedings of the 8th international conference on Multimodal interfaces, November 02-04, 2006, Banff, Alberta, Canada
[doi> 10.1145/1180995.1181044]
|
| |
28
|
Pantic, M. and Rothkrantz, L. J. M. Toward an affect-sensitive multimodal human-computer interaction. Proceedings of the IEEE, 91, 9 (Sep. 2003), 1370--1390.
|
| |
29
|
Pantic, M. and Rothkrantz, L. J. M. Case-based reasoning for user-profiled recognition of emotions from face images. In Proc. Int'l Conf. Multimedia and Expo, 2004, 391--394.
|
 |
30
|
|
 |
31
|
|
| |
32
|
|
| |
33
|
Porta, M. Vision-based user interfaces: methods and applications. Int'l J. Human-Computer Studies, 57, 1 (July 2002), 27--73.
|
| |
34
|
Prendinger, H. and Ishizuka, M. The empathic companion: A character-based interface that addresses users' affective states. Applied Artificial Intelligence, 19, 3-4 (Mar. 2005), 267--285.
|
 |
35
|
Leah M. Reeves , Jennifer Lai , James A. Larson , Sharon Oviatt , T. S. Balaji , Stéphanie Buisine , Penny Collings , Phil Cohen , Ben Kraal , Jean-Claude Martin , Michael McTear , TV Raman , Kay M. Stanney , Hui Su , Qian Ying Wang, Guidelines for multimodal user interface design, Communications of the ACM, v.47 n.1, January 2004
[doi> 10.1145/962081.962106]
|
| |
36
|
|
| |
37
|
Schank, R. C. Memory based expert systems. AFOSR.TR. 84-0814, Comp. Science Dept., Yale University, 1984.
|
 |
38
|
|
| |
39
|
Sharma, R., Yeasin, M., Krahnstoever, N., Rauschert, I., Cai, G., Maceachren, A. M. and Sengupta, K. Speech-gesture driven multimodal interfaces for crisis management. Proceedings of the IEEE, 91, 9 (Sep. 2003), 1327--1354.
|
 |
40
|
|
 |
41
|
|
| |
42
|
Vo, M. T. and Waibel, A. Multimodal human-computer interaction. In Proc. Int'l Symposium on Spoken Dialogue, 1993.
|
| |
43
|
|
 |
44
|
|
CITED BY 2
|
Maja Pantic , Alex Pentland , Anton Nijholt , Thomas Huang, Human computing and machine understanding of human behavior: a survey, Proceedings of the 8th international conference on Multimodal interfaces, November 02-04, 2006, Banff, Alberta, Canada
|
|
Zhihong Zeng , Maja Pantic , Glenn I. Roisman , Thomas S. Huang, A survey of affect recognition methods: audio, visual and spontaneous expressions, Proceedings of the 9th international conference on Multimodal interfaces, November 12-15, 2007, Nagoya, Aichi, Japan
|
|