ACM Home Page
Please provide us with feedback. Feedback
Estimating focus of attention based on gaze and sound
Full text PdfPdf (357 KB)
Source ACM International Conference Proceeding Series; Vol. 15 archive
Proceedings of the 2001 workshop on Perceptive user interfaces table of contents
Orlando, Florida
SESSION: Paper session #3 table of contents
Pages: 1 - 9  
Year of Publication: 2001
Authors
Rainer Stiefelhagen  University of Karlsruhe, Germany
Jie Yang  Carnegie Mellon University, Pittsburgh, PA
Alex Waibel  Carnegie Mellon University, Pittsburgh, PA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 36,   Citation Count: 9
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/971478.971505
What is a DOI?

ABSTRACT

Estimating a person's focus of attention is useful for various human-computer interaction applications, such as smart meeting rooms, where a user's goals and intent have to be monitored. In work presented here, we are interested in modeling focus of attention in a meeting situation. We have developed a system capable of estimating participants' focus of attention from multiple cues. We employ an omnidirectional camera to simultaneously track participants' faces around a meeting table and use neural networks to estimate their head poses. In addition, we use microphones to detect who is speaking. The system predicts participants' focus of attention from acoustic and visual information separately, and then combines the output of the audio- and video-based focus of attention predictors. We have evaluated the system using the data from three recorded meetings. The acoustic information has provided 8% error reduction on average compared to using a single modality.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
M. Argyle and M. Cook. Gaze and Mutual Gaze. Cambridge University Press, 1976.
 
3
 
4
M. Black, F. Brard, A. Jepson, W. Newman, E. Saund, G. Socher, and M. Taylor. The digital office: Overview. In Proceedings of the 1998 AAAI Spring Symposium on Intelligent Environments, volume AAAI Technical Report SS-98-02. AAAI, AAAI Press, March 1998.
 
5
 
6
A. J. Diebold. Animal Communication - Techniques of Study and Results of Research, chapter Anthropology of the comparative psychology of communicative behavior. Bloomington: Indiana University Press, 1968.
 
7
A. H. Gee and R. Cipolla. Non-intrusive gaze tracking for human-computer interaction. In Proc. Mechatronics and Machine Vision in Practise, pages 112--117, 1994.
 
8
D. Gopher. The Blackwell dictionary of Cognitive Psychology, chapter Attention, pages 23--28. Basil Blackwell Inc., 1990.
 
9
 
10
S. R. Langton, R. J. Watt, and V. Bruce. Do the eyes have it? cues to the direction of social attention. Trends in Cognitive Neuroscience, 4(2), 2000.
 
11
M. Mozer. The neural network house: An environment that adapts to its inhibitants. In Intelligent Environments, Papers from the 1998 AAAI Spring Symposium, number Technical Report SS-98-92, pages 110--114. AAAI, AAAI Press, 1998.
 
12
D. Perret and N. Emery. Understanding the intentions of others from visual signals: neurophysiological evidence. Cahiers de Psychologie Cognitive, 13:683-694, 1994.
 
13
14
 
15
A. Waibel, M. Bett, M. Finke, and R. Stiefelhagen. Meeting browser: Tracking and summarizing meetings. In D. E. M. Penrose, editor, Proceedings of the Broadcast News Transcription and Understanding Workshop, pages 281--286, Lansdowne, Virginia, February. 8-11 1998. DARPA, Morgan Kaufmann.
 
16

CITED BY  9

Collaborative Colleagues:
Rainer Stiefelhagen: colleagues
Jie Yang: colleagues
Alex Waibel: colleagues