| |
1
|
. Barzelay and Y. Schechner. "Harmony in Motion". In Proc. CVPR, pp.1--8, 2007.
|
| |
2
|
. Darrell and J. Fisher III. "Speaker Association with Signal-Level Audiovisual Fusion". IEEE Trans. on Multimedia, 6(3):406--413, 2004.
|
| |
3
|
. Driver. "Enhancement of Selective Listening by Illusory Mislocation of Speech Sounds due to Lip-Reading", Nature, 381:66--68, 1996.
|
| |
4
|
. Hershey and J. R. Movellan. "Audio Vision: Using Audiovisual Synchrony to Locate Sounds". In Proc. NIPS, pp.813--819, 1999.
|
| |
5
|
. Kidron, Y. Schechner, and M. Elad. "Pixels that Sound". In Proc. CVPR, pp.88--95, 2005.
|
| |
6
|
. Liu and Y. Sato. "Finding Speaker Face Region by Audiovisual Correlation". In Proc. ECCV Workshop, pp.1--12, 2008.
|
| |
7
|
. Liu and Y. Sato. "Recovering Audio-to-Video Synchronization by Audiovisual Correlation Analysis". In Proc. ICPR, pp.1--4, 2008.
|
| |
8
|
. Lucas and T. Kanade. "An Iterative Image Registration Technique with an Application to Stereo Vision". In Proc. Int'l Joint Conf. on Artificial Intelligence, pp.674--679, 1981.
|
| |
9
|
. Monaci and P. Vandergheynst. "Audiovisual Gestalts". In Proc. CVPR Workshop on Perceptual Organization in Computer Vision, pp.1--8, 2006.
|
| |
10
|
. O'Donovan, R. Duraiswami, and J. Neumann. "Microphone Arrays as Generalized Cameras for Integrated Audio Visual Processing". In Proc. CVPR, 1--8, 2007.
|
| |
11
|
. Patterson, S. Gurbuz, Z. Tufekci, and J. Gowdy. "Moving-Talker, Speaker-Independent Feature Study and Baseline Results using the Cuave Multimodal Speech Corpus". EURASIP J. on Applied Signal Processing, 2002(11):1189--1201, 2002.
|
| |
12
|
. Rabiner and B. Juang. Fundamentals of Speech Recognition. Prentice Hall, 1993.
|
| |
13
|
. Shannon. "Prediction and entropy of printed English". The Bell System Technical Journal, 30:50--64, 1951.
|
| |
14
|
. Shechtman and M. Irani. "Space-Time Behaviour-Based Correlation". Trans. on Pattern Analysis and Machine Intelligence, 29(11):2045--2056, 2007.
|