ACM Home Page
Please provide us with feedback. Feedback
A generic virtual content insertion system based on visual attention analysis
Full text PdfPdf (834 KB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Applications track A2: watch table of contents
Pages 379-388  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Huiying Liu  Chinese Academy of Sciences and China-Singapore Institute of Digital Media, Beijing, China
Shuqiang Jiang  Chinese Academy of Sciences, Beijing, China
Qingming Huang  Graduate University of Chinese Academy of Sciences and China-Singapore Institute of Digital Media, Beijing, China
Changsheng Xu  China-Singapore Institute of Digital Media and Chinese Academy of Sciences, Beijing, China
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 20,   Downloads (12 Months): 166,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459410
What is a DOI?

ABSTRACT

This paper presents a generic Virtual Content Insertion (VCI) system based on visual attention analysis. VCI is an emerging application of video analysis and has been used in video augmentation and advertisement insertion. There are three critical issues for a VCI system: when (time), where (place) and how (method) to insert the Virtual Content (VC) into the video. Our system selects the insertion time and place by performing temporal and spatial attention analysis, which predicts the attention change along time and the attended region over space. In order to enable the inserted VC to be noticed by audience while not to interrupt the audience's viewing experience to the original content, the VC should be inserted at the time when the video content attracts much audience attention and at the place where attracts less. Dynamic insertion is performed by using Global Motion Estimation (GME) and affine transformation. Our VCI system is able to obtain an optimal balance between the notice of the VC by audience and disruption of viewing experience to the original content. Extensive subjective evaluations based on user study on the VCI result have verified the effectiveness of the system.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
K. Wan, C. Xu, "Automatic Content Placement in Sports Highlights", IEEE International Conference on Multimedia & Expo, pp: 1893--1896, 2006.
 
3
C. Xu, K. W. Wan, S. H. Bui, Q. Tian, "Implanting Virtual Advertisement into Broadcast Soccer Video", Pacific-Rim Conference on Multimedia, pp: 264--271, 2004.
4
 
5
M. Tamir, et al., "Method and Apparatus for Automatic Electronic Replacement of Billboards in a Video Image", US Patent 6,292,227, 2001.
 
6
S. Deshpande, et al., "Method and Apparatus for Including Virtual ADs in Video Presentations", US Patent, 7,158,666, 2007.
7
 
8
H. Liu, S. Jiang, Q. Huang and C. Xu. "Lower Attentive Region Detection for Virtual Content Insertion", IEEE International Conference on Multimedia & Expo, 2008.
 
9
S. Das, et al., "Method of Tracking Scene Motion for Live Video Insertion Systems", US Patent 5,808,695, 1998.
 
10
Kreitman, et al., "Method and System for Perspectively Distorting an Image and Implanting Same into a Video Stream", US Patent 5,731,846, 1998.
 
11
Y.-F. Ma, X.-S. Hua, L. Lu, H.-J. Zhang. "A Generic Framework of User Attention Model and Its Application in Video Summarization", IEEE Trans on Multimedia, Vol. 7, No. 5, pp: 907--919, 2005.
12
 
13
 
14
15
16
 
17
 
18
 
19
T. N. Topper, "Selection Mechanisms in Human and Machine Vision", Ph.D. Thesis, University of Waterloo, 1991.
 
20
N. D. B. Bruce, "Features That Draw Visual Attention: an Information Theoretic Perspective", Neurocomputing, Vol. 65-66, pp: 125--133, 2005.
 
21
 
22
 
23
L. Itti. "Automatic Foveation for Video Compression Using a Neurobiological Model of Visual Attention", IEEE Transactions on Image Processing, Vol. 13, No. 10, pp: 1304--1318, 2004.
 
24
G. Qiu, X. Gu, Z. Chen, Q. Chen and C. Wang, "An Information Theoretic Model of Spatiotemporal Visual Saliency", IEEE International Conference on Multimedia & Expo, pp: 1806--1809, 2007.
 
25
J. You, G. Liu, L. Sun, H. Li, "A Multiple Visual Models Based Perceptive Analysis Framework for Multilevel Video Summarization", IEEE Trans. Circuits and Systems for Video Technology, Vol. 17, No. 3, pp: 273--285, 2007.
 
26
C. Liu, H. Liu, S. Jiang, Q. Huang, Y. Zheng and W. Zhang: "JDL at Trecvid 2006 Shot Boundary Detection", TRECVID 2006 Workshop (2006).
 
27
R. E. Soodak, "Two-dimensional modeling of visual receptive fields using Gaussian subunits", Proc. Antl. Acad, Sci. USA, Vol. 83, pp: 9259--9263, 1986.
 
28
E. T. Rolls, N. C. Aggelopoulos, and F. Zheng, "Effective size of receptive fields of inferior temporal visual cortex neurons in natural scenes", The Journal of Neuroscience, Vol. 23, No. 1, pp:339--348, 2003.
 
29
L.-Y. Duan, M. Xu, Q. Tian, C.-S. Xu, J. S. Jin, "A Unified Framework for Semantic Shot Classification in Sports Video", IEEE Trans on Multimedia, Vol. 7, No. 6, pp: 1066--1083, 2005.
 
30
F. Dufaux and J. Konrad, "Efficient, robust, and fast global motion estimation for video coding," IEEE Trans. Image Processing, vol. 9, no. 3, pp. 497--501, 2000.
 
31
X. Chen, J. Yang, J. Zhang, and A. Waibel. "Automatic Detection and Recognition of Signs from Natural Scenes", IEEE Transactions on Image Processing, Vol. 13, No.1, pp: 87--99, 2004.


Collaborative Colleagues:
Huiying Liu: colleagues
Shuqiang Jiang: colleagues
Qingming Huang: colleagues
Changsheng Xu: colleagues