ACM Home Page
Please provide us with feedback. Feedback
Consumer video retargeting: context assisted spatial-temporal grid optimization
Full text PdfPdf (8.78 MB)
Source
International Multimedia Conference archive
Proceedings of the seventeen ACM international conference on Multimedia table of contents
Beijing, China
SESSION: Application track A2: context awareness table of contents
Pages 301-310  
Year of Publication: 2009
ISBN:978-1-60558-608-3
Authors
Liang Shi  Beijing University of Posts and Telecommunications, Beijing, China
Jinqiao Wang  National Lab of Pattern Recognition,Institute of Automation, Chinese Academy of Sciences, Beijing, China
Lingyu Duan  Institute of Digital Media, School of EE&CS, Peking University, Beijing, China
Hanqing Lu  National Lab of Pattern Recognition,Institute of Automation, Chinese Academy of Sciences, Beijing, China
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 34,   Downloads (12 Months): 34,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1631272.1631315
What is a DOI?

ABSTRACT

Pervasive multimedia devices require accurate video retargeting, especially in connected consumer electronics platforms. In this paper, we present a context assisted spatialtemporal grid scheme for consumer video retargeting. First, we parse consumer videos from low-level features to highlevel visual concepts, combining visual attention into a more accurate importance description. Then, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to the target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations were done on two popular video genres, sports and advertisements. The comparison with state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Technical report, comScore 2008.
 
2
A. Albiol, M. J. C. Full'a, and A. Albiol. Detection of tv commercials. In ICASSP, pages 541--544, 2004.
 
3
A. Albiol, M. J. C. Fulla, A. Albiol, and L. Torres. Commercials detection using hmms. In Int. Workshop on Image Analysis for Multimedia Interactive Services, 2004.
 
4
J. Assfalg, M. Bertini, C. Colombo, A. D. Bimbo, and W. Nunziati. Semantic annotation of soccer videos: automatic highlights identification. Comput. Vis. Image Underst., 2003.
 
5
S. Avidan and A. Shamir. Seam carving for content-aware image resizing. In ACM Trans. Graph, 2007.
 
6
P. Chang, M. Han, and Y. Gong. Extract highlights from baseball game video with hidden markov models. In ICIP, 2002.
 
7
L.-Y. Duan, J. Wang, Y. Zheng, J. S. Jin, H. Lu, and C. Xu. Segmentation, categorization, and identification of commercials from tv streams using multimodal analysis. In ACM Multimedia, 2006.
 
8
L.-Y. Duan, M. Xu, Q. Tian, and C. sheng Xu. Nonparametric color characterization using mean shift. In ACM Multimedia, 2003.
 
9
L.-Y. Duan, M. Xu, Q. Tian, C. Xu, and J. S. Jin. A unified framework for semantic shot classification in sports video. In IEEE Transactions on Multimedia, volume 7(6), 2005.
 
10
A. Ekin, A. M. Tekalp, and R. Mehrotra. Automatic soccer video analysis and summarization. IEEE Transactions on Image Processing, 2003.
 
11
X. Fan, X. Xie, H.-Q. Zhou, and W.-Y. Ma. Looking into video frames on small displays. In ACM Multimedia, 2003.
 
12
X.-S. Hua, L. Lu, and H.-J. Zhang. Robust learning-based tv commercial detection. In ICME, 2005.
 
13
C. Huang, H. Ai, Y. Li, and S. Lao. Vector boosting for rotation invariant multi-view face detection. In ICCV, 2005.
 
14
F. Li. Knowledge transfer in learning to recognize visual objects classes. In ICDL, 2006.
 
15
R. Lienhart, C. Kuhmunch, , and W. Effelsberg. On the detection and recognition of television commercials. In Int. Conf. on Multimedia Computing and Systems, 1997.
 
16
F. Liu and M. Gleicher. Video retargeting automating pan and scan. In Acm Multimedia, 2006.
 
17
T. Liu, J. Sun, N.-N. Zheng, X. Tang, and H.-Y. Shum. Learning to detect a salient object. In CVPR, 2007.
 
18
A. Loui, J. Luo, S.-F. Chang, D. Ellis, W. Jiang, L. Kennedy, K. Lee, and A. Yanagawa. Kodak's consumer video benchmark data set: concept definition and annotation. In ACM MIR, 2007.
 
19
D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 2004.
 
20
F. Megino, J. Sanchez, and V. Lopez. Virtual camera tools for an image2video application. In International Workshop on Image Analysis for Multimedia Interactive Services, 2008.
 
21
T. Mei, X.-S. Hua, and S. Li. Contextual in-image advertising. In ACM Multimedia, 2008.
 
22
T. Mei, X.-S. Hua, L. Yang, and S. Li. Videosense - towards effective online video advertising. In ACM Multimedia, 2007.
 
23
M. Mizutani, S. Ebadollahi, and S.-F. Chang. Commercial detection in heterogeneous video streams using fused multi-modal and temporal features. In ICASSP, 2005.
 
24
M. Rubinstein, A. Shamir, and S. Avidan. Improved seam carving for video retargeting. In Siggraph, 2008.
 
25
V. Setlur and S. Takagi. Automatic image retargetting. In MUM, 2005.
 
26
Y.-P. Tan, D. D. Saur, S. R. Kulkarni, and P. J. Ramadge. Rapid estimation of camera motion from compressed video with application to video annotation. IEEE Trans. on CSVT, 2000.
 
27
C. Tao, J. Jia, and H. Sun. Active window oriented dynamic video retargeting. In Workshop On Dynamical Vision,ICCV, 2007.
 
28
D. Thomas, D. Philippe, and N. Hermann. Pan, zoom, scan time-coherent, trained automatic video cropping. In CVPR, 2008.
 
29
L. Vincent and P. Soille. Watersheds in digital spaces: An efficient algorithm based on immersion simulations. In IEEE PAMI, 1991.
 
30
J.Wang, M. J. Reinders, R. L. Lagendijk, J. Lindenberg, and M. S.Kankanhalli. Video content representation on tiny devices. In ICME, 2004.
 
31
Y.-S. Wang, C.-L. Tai, O. Sorkine, and T.-Y. Lee. Optimized scale-and-stretch for image resizing. In ACM Trans. Graph, 2008.
 
32
L.Wolf, M. Guttmann, and D. Cohen-Or. Non-homogeneous content-driven video retargeting. In ICCV, 2007.
 
33
L. Xie, P. Xu, S.-F. Chang, A. Divakaran, and H. Sun. Structure analysis of soccer video with domain knowledge and hidden markov models. Pattern Recogn. Lett., 2004.
 
34
J. Yang, Y. G. Jiang, A. G. Hauptmann, and C. W. Ngo. Evaluating bag-of-visual-words representations in scene classification. In MIR, 2007.
 
35
H. Zhang, A. Kankanhalli, and S. Smoliar. Automatic partitioning of full-motion video. In ACM Multimedia Systems, 1993.