|
ABSTRACT
Human-Centered Computing (HCC) is a set of methodologies that apply to any field that uses computers, in any form, in applications in which humans directly interact with devices or systems that use computer technologies. This tutorial takes a holistic view on the research issues and applications of Human-Centered Multimedia Systems focusing on three main areas: (1)multimodal interaction: visual (body, gaze, gesture) and audio (emotion)analysis; (2) image databases, indexing, and retrieval: context modeling, cultural issues, and machine learning for user-centric approaches; (3)multimedia data: conceptual analysis at different levels (feature, cognitive, and affective). This paper gives a brief overview of the areas covered in the tutorial.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
A. T. Duchowski, "A Breadth-First Survey of Eye Tracking Applications," Behavior Research Methods, Instruments, and Computing, 34(4):455--70, 2002.
|
| |
2
|
J. Flanagan and T. S. Huang, Special Issue on Human-computer Multimodal Multimodal Interface, Proceedings of the IEEE, 91(9), 2003.
|
| |
3
|
A. Hanjalic and L-Q. Xu, "Affective video content representation and modeling," IEEE Trans. on Multimedia, 7(1):143--154, 2005.
|
| |
4
|
|
 |
5
|
|
| |
6
|
A. Jaimes, D. Gatica-Perez, N. Sebe, and T. S. Huang (eds.) IEEE Computer Magazine special issue on Human-Centered Computing, Vol. 40, No. 5, May 2007.
|
| |
7
|
A. Jaimes and S.-F. Chang, "A Conceptual Framework for Indexing Visual Information at Multiple Levels", SPIE Internet Imaging, Vol. 3964, pp. 2--15, 2000.
|
| |
8
|
T. P. Minka and R. W. Picard, "Interactive Learning using a 'Society of Models'," Pattern Recognition, 30(4), 1997.
|
| |
9
|
M. R. Naphade and T. S. Huang, "Extracting semantics from audio-visual content: the final frontier in multimedia retrieval," IEEE Trans. Neural Networks, 13(4):793--810, 2002.
|
| |
10
|
V. Pavlovic, A. Garg, and J. M. Rehg, "Boosted learning in dynamic Bayesian networks for multimodal speaker detection," Proceedings of the IEEE, 91(9):1355--1369, 2003.
|
 |
11
|
|
| |
12
|
|
| |
13
|
N. Sebe, I. Cohen, and T. S. Huang, "Multimodal emotion recognition," in Handbook of Pattern Recognition and Computer Vision, C. H. Chen and P. S. P. Wang eds, chapter 4.1, pp. 387--419, World Scientific, January 2005.
|
| |
14
|
|
 |
15
|
|
| |
16
|
Eric Brewer , Michael Demmer , Bowei Du , Melissa Ho , Matthew Kam , Sergiu Nedevschi , Joyojeet Pal , Rabin Patra , Sonesh Surana , Kevin Fall, The Case for Technology in Developing Regions, Computer, v.38 n.6, p.25-38, June 2005
[doi> 10.1109/MC.2005.204]
|
 |
17
|
|
| |
18
|
|
| |
19
|
D. Maynes-Aminzade, R. Pausch, and S. Seitz, "Techniques for interactive audience participation," ICMI 2002
|
|