|
ABSTRACT
Classification of natural hand gestures is usually approached by applying pattern recognition to the movements of the hand. However, the gesture categories most frequently cited in the psychology literature are fundamentally multimodal; the definitions make reference to the surrounding linguistic context. We address the question of whether gestures are naturally multimodal, or whether they can be classified from hand-movement data alone. First, we describe an empirical study showing that the removal of auditory information significantly impairs the ability of human raters to classify gestures. Then we present an automatic gesture classification system based solely on an n-gram model of linguistic context; the system is intended to supplement a visual classifier, but achieves 66% accuracy on a three-class classification problem on its own. This represents higher accuracy than human raters achieve when presented with the same information.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Cassell, J. A framework for gesture generation and interpretation. Computer Vision in Human-Machine Interaction. Cambridge University Press (1998), 191--215.
|
 |
3
|
|
 |
4
|
Philip R. Cohen , Michael Johnston , David McGee , Sharon Oviatt , Jay Pittman , Ira Smith , Liang Chen , Josh Clow, QuickSet: multimodal interaction for distributed applications, Proceedings of the fifth ACM international conference on Multimedia, p.31-40, November 09-13, 1997, Seattle, Washington, United States
[doi> 10.1145/266180.266328]
|
| |
5
|
Eisenstein, J., and Davis, R. Natural gesture in descriptive monologues. UIST'03 Supplemental Proceedings. ACM Press (2003), 69--70.
|
| |
6
|
Freeman, W. T., and Weissman, C. Television control by hand gestures. International Workshop on Automatic Face- and Gesture- Recognition. IEEE Press (1995), M. Bichsel, Ed., 179--183.
|
 |
7
|
Ed Kaiser , Alex Olwal , David McGee , Hrvoje Benko , Andrea Corradini , Xiaoguang Li , Phil Cohen , Steven Feiner, Mutual disambiguation of 3D multimodal interaction in augmented and virtual reality, Proceedings of the 5th international conference on Multimodal interfaces, November 05-07, 2003, Vancouver, British Columbia, Canada
[doi> 10.1145/958432.958438]
|
| |
8
|
Kendon, A. Conducting Interaction. Cambridge University Press, 1990.
|
| |
9
|
|
| |
10
|
|
| |
11
|
David B. Koons , Carlton J. Sparrell , Kristinn R. Thorisson, Integrating simultaneous input from speech, gaze, and hand gestures, Intelligent multimedia interfaces, American Association for Artificial Intelligence, Menlo Park, CA, 1993
|
| |
12
|
|
| |
13
|
McNeill, D. Hand and Mind. The University of Chicago Press, 1992.
|
| |
14
|
McNeill, D., Quek, F., McCullough, K.-E., Duncan, S., Furuyama, N., Bryll, R., Ma, X.-F., and Ansari, R. Catchments, prosody, and discourse. Gesture 1 (2001), 9--33.
|
| |
15
|
Nakano, Y. I., Okamoto, M., Kawahara, D., Li, Q., and Nishida, T. Converting a text into agent animations: Assigning gestures to a text. HLT-NAACL 2004: Companion Volume. ACL Press (2004), 153--156.
|
 |
16
|
|
| |
17
|
Rennie, J. D. M., Shih, L., Teevan, J., and Karger, D. R. Tackling the poor assumptions of naive bayes text classifiers. Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003). AAAI Press (2003).
|
| |
18
|
Sparrell, C. Coverbal iconic gesture in human-computer interaction. Master's thesis, Massachusetts Institute of Technology, 1993.
|
| |
19
|
|
 |
20
|
|
CITED BY 4
|
|
|
|
|
Christine Alvarado , Ned Burns , Howard Chen , Jason Fennell , Sarah Harris , Max Pfleuger , Devin Smith , Paul Wais , Matt Weiner , Aaron Wolin, Sketch understanding systems, ACM SIGGRAPH 2007 courses, August 05-09, 2007, San Diego, California
|
|
|
|
|
|
Gregory Gay , Tim Menzies , Bojan Cukic , Burak Turhan, How to build repeatable experiments, Proceedings of the 5th International Conference on Predictor Models in Software Engineering, May 18-19, 2009, Vancouver, British Columbia, Canada
|
|