|
ABSTRACT
This paper presents a generic Virtual Content Insertion (VCI) system based on visual attention analysis. VCI is an emerging application of video analysis and has been used in video augmentation and advertisement insertion. There are three critical issues for a VCI system: when (time), where (place) and how (method) to insert the Virtual Content (VC) into the video. Our system selects the insertion time and place by performing temporal and spatial attention analysis, which predicts the attention change along time and the attended region over space. In order to enable the inserted VC to be noticed by audience while not to interrupt the audience's viewing experience to the original content, the VC should be inserted at the time when the video content attracts much audience attention and at the place where attracts less. Dynamic insertion is performed by using Global Motion Estimation (GME) and affine transformation. Our VCI system is able to obtain an optimal balance between the notice of the VC by audience and disruption of viewing experience to the original content. Extensive subjective evaluations based on user study on the VCI result have verified the effectiveness of the system.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Xinguo Yu , Xin Yan , Tran Thi Phuong Chi , Loong Fah Cheong, Inserting 3D projected virtual content into broadcast tennis video, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
[doi> 10.1145/1180639.1180767]
|
| |
2
|
K. Wan, C. Xu, "Automatic Content Placement in Sports Highlights", IEEE International Conference on Multimedia & Expo, pp: 1893--1896, 2006.
|
| |
3
|
C. Xu, K. W. Wan, S. H. Bui, Q. Tian, "Implanting Virtual Advertisement into Broadcast Soccer Video", Pacific-Rim Conference on Multimedia, pp: 264--271, 2004.
|
 |
4
|
|
| |
5
|
M. Tamir, et al., "Method and Apparatus for Automatic Electronic Replacement of Billboards in a Video Image", US Patent 6,292,227, 2001.
|
| |
6
|
S. Deshpande, et al., "Method and Apparatus for Including Virtual ADs in Video Presentations", US Patent, 7,158,666, 2007.
|
 |
7
|
|
| |
8
|
H. Liu, S. Jiang, Q. Huang and C. Xu. "Lower Attentive Region Detection for Virtual Content Insertion", IEEE International Conference on Multimedia & Expo, 2008.
|
| |
9
|
S. Das, et al., "Method of Tracking Scene Motion for Live Video Insertion Systems", US Patent 5,808,695, 1998.
|
| |
10
|
Kreitman, et al., "Method and System for Perspectively Distorting an Image and Implanting Same into a Video Stream", US Patent 5,731,846, 1998.
|
| |
11
|
Y.-F. Ma, X.-S. Hua, L. Lu, H.-J. Zhang. "A Generic Framework of User Attention Model and Its Application in Video Summarization", IEEE Trans on Multimedia, Vol. 7, No. 5, pp: 907--919, 2005.
|
 |
12
|
|
| |
13
|
|
| |
14
|
|
 |
15
|
Huiying Liu , Shuqiang Jiang , Qingming Huang , Changsheng Xu , Wen Gao, Region-based visual attention analysis with its application in image browsing on small displays, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291298]
|
 |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
T. N. Topper, "Selection Mechanisms in Human and Machine Vision", Ph.D. Thesis, University of Waterloo, 1991.
|
| |
20
|
N. D. B. Bruce, "Features That Draw Visual Attention: an Information Theoretic Perspective", Neurocomputing, Vol. 65-66, pp: 125--133, 2005.
|
| |
21
|
María T. López , Antonio Fernández-Caballero , Miguel A. Fernández , José Mira , Ana E. Delgado, Visual surveillance by dynamic visual attention method, Pattern Recognition, v.39 n.11, p.2194-2211, November, 2006
[doi> 10.1016/j.patcog.2006.04.018]
|
| |
22
|
|
| |
23
|
L. Itti. "Automatic Foveation for Video Compression Using a Neurobiological Model of Visual Attention", IEEE Transactions on Image Processing, Vol. 13, No. 10, pp: 1304--1318, 2004.
|
| |
24
|
G. Qiu, X. Gu, Z. Chen, Q. Chen and C. Wang, "An Information Theoretic Model of Spatiotemporal Visual Saliency", IEEE International Conference on Multimedia & Expo, pp: 1806--1809, 2007.
|
| |
25
|
J. You, G. Liu, L. Sun, H. Li, "A Multiple Visual Models Based Perceptive Analysis Framework for Multilevel Video Summarization", IEEE Trans. Circuits and Systems for Video Technology, Vol. 17, No. 3, pp: 273--285, 2007.
|
| |
26
|
C. Liu, H. Liu, S. Jiang, Q. Huang, Y. Zheng and W. Zhang: "JDL at Trecvid 2006 Shot Boundary Detection", TRECVID 2006 Workshop (2006).
|
| |
27
|
R. E. Soodak, "Two-dimensional modeling of visual receptive fields using Gaussian subunits", Proc. Antl. Acad, Sci. USA, Vol. 83, pp: 9259--9263, 1986.
|
| |
28
|
E. T. Rolls, N. C. Aggelopoulos, and F. Zheng, "Effective size of receptive fields of inferior temporal visual cortex neurons in natural scenes", The Journal of Neuroscience, Vol. 23, No. 1, pp:339--348, 2003.
|
| |
29
|
L.-Y. Duan, M. Xu, Q. Tian, C.-S. Xu, J. S. Jin, "A Unified Framework for Semantic Shot Classification in Sports Video", IEEE Trans on Multimedia, Vol. 7, No. 6, pp: 1066--1083, 2005.
|
| |
30
|
F. Dufaux and J. Konrad, "Efficient, robust, and fast global motion estimation for video coding," IEEE Trans. Image Processing, vol. 9, no. 3, pp. 497--501, 2000.
|
| |
31
|
X. Chen, J. Yang, J. Zhang, and A. Waibel. "Automatic Detection and Recognition of Signs from Natural Scenes", IEEE Transactions on Image Processing, Vol. 13, No.1, pp: 87--99, 2004.
|
CITED BY
|
|
Jinlian Guo , Tao Mei , Falin Liu , Xian-Sheng Hua, AdOn: an intelligent overlay video advertising system, Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, July 19-23, 2009, Boston, MA, USA
|
|