ACM Home Page
Please provide us with feedback. Feedback
Visual and linguistic information in gesture classification
Full text PdfPdf (319 KB)
Source International Conference on Multimodal Interfaces archive
Proceedings of the 6th international conference on Multimodal interfaces table of contents
State College, PA, USA
SESSION: Multimodal communication table of contents
Pages: 113 - 120  
Year of Publication: 2004
ISBN:1-58113-995-0
Authors
Jacob Eisenstein  Massachusetts Institute of Technology, Cambridge, MA
Randall Davis  Massachusetts Institute of Technology, Cambridge, MA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 12,   Downloads (12 Months): 74,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027933.1027954
What is a DOI?

ABSTRACT

Classification of natural hand gestures is usually approached by applying pattern recognition to the movements of the hand. However, the gesture categories most frequently cited in the psychology literature are fundamentally multimodal; the definitions make reference to the surrounding linguistic context. We address the question of whether gestures are naturally multimodal, or whether they can be classified from hand-movement data alone. First, we describe an empirical study showing that the removal of auditory information significantly impairs the ability of human raters to classify gestures. Then we present an automatic gesture classification system based solely on an n-gram model of linguistic context; the system is intended to supplement a visual classifier, but achieves 66% accuracy on a three-class classification problem on its own. This represents higher accuracy than human raters achieve when presented with the same information.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Cassell, J. A framework for gesture generation and interpretation. Computer Vision in Human-Machine Interaction. Cambridge University Press (1998), 191--215.
3
4
 
5
Eisenstein, J., and Davis, R. Natural gesture in descriptive monologues. UIST'03 Supplemental Proceedings. ACM Press (2003), 69--70.
 
6
Freeman, W. T., and Weissman, C. Television control by hand gestures. International Workshop on Automatic Face- and Gesture- Recognition. IEEE Press (1995), M. Bichsel, Ed., 179--183.
7
 
8
Kendon, A. Conducting Interaction. Cambridge University Press, 1990.
 
9
 
10
 
11
 
12
 
13
McNeill, D. Hand and Mind. The University of Chicago Press, 1992.
 
14
McNeill, D., Quek, F., McCullough, K.-E., Duncan, S., Furuyama, N., Bryll, R., Ma, X.-F., and Ansari, R. Catchments, prosody, and discourse. Gesture 1 (2001), 9--33.
 
15
Nakano, Y. I., Okamoto, M., Kawahara, D., Li, Q., and Nishida, T. Converting a text into agent animations: Assigning gestures to a text. HLT-NAACL 2004: Companion Volume. ACL Press (2004), 153--156.
16
 
17
Rennie, J. D. M., Shih, L., Teevan, J., and Karger, D. R. Tackling the poor assumptions of naive bayes text classifiers. Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003). AAAI Press (2003).
 
18
Sparrell, C. Coverbal iconic gesture in human-computer interaction. Master's thesis, Massachusetts Institute of Technology, 1993.
 
19
20


Collaborative Colleagues:
Jacob Eisenstein: colleagues
Randall Davis: colleagues