ACM Home Page
Please provide us with feedback. Feedback
VideoMule: a consensus learning approach to multi-label classification from noisy user-generated videos
Full text PdfPdf (900 KB)
Source
International Multimedia Conference archive
Proceedings of the seventeen ACM international conference on Multimedia table of contents
Beijing, China
SESSION: Short papers session 2: content analysis and HCM table of contents
Pages 721-724  
Year of Publication: 2009
ISBN:978-1-60558-608-3
Authors
Chandrasekar Ramachandran  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Rahul Malik  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Xin Jin  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Jing Gao  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Klara Nahrstedt  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Jiawei Han  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 9,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1631272.1631397
What is a DOI?

ABSTRACT

With the growing proliferation of conversational media and devices for generating multimedia content, the Internet has seen an expansion in websites catering to user-generated media. Most of the user-generated content is multimodal in nature as it has videos, audio, text (in the form of tags), comments and so on. Content analysis is a challenging problem on this type of media since it is noisy, unstructured and unreliable. In this paper we propose VideoMule, a consensus learning approach for multi-label video classification from noisy user-generated videos. In our scheme, we train classification and clustering algorithms on individual modes of information such as user comments, tags, video features and so on. We then combine the results of trained classifiers and clustering algorithms using a novel heuristic consensus learning algorithm which as a whole performs better than each individual learning model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
S. Baluja, R. Seth, D. Sivakumar, Y. Jing, J. Yagnik, S. Kumar, D. Ravichandran, and M. Aly. Video suggestion and discovery for youtube: taking random walks through the view graph. In WWW, pages 895--904, 2008.
 
2
C. M. Bishop. Chapter 8: Graphical models. In Pattern Recognition and Machine Learning. Springer., pages 359--418, 2006.
 
3
X. Cheng, C. Dale, and J. Liu. Understanding the characteristics of internet short video sharing: Youtube as a case study. In CoRR abs, 2007.
 
4
K. Ganchev, J. Graca, J. Blitzer, and B. Taskar. Multi-view learning over structured and non-identical outputs. In UAI, pages 204--211, 2008.
 
5
J. Gao, W. Fan, Y. Sun, and J. Han. Heterogeneous source consensus learning via decision propagation and negotiation. In KDD, pages 339--348, 2009.
 
6
J. Han and M. Kamber. Data mining: Concepts and techniques. In Morgan Kauffman, 2000.
 
7
A. Klaser, M. Marszalek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In British Machine Vision Conference 2008.
 
8
L. Liu, Y. Rui, L.-F. Sun, B. Yang, J. Zhang, and S.-Q. Yang. Topic mining on web-shared videos. In ICASSP, pages 2145--2148, 2008.
 
9
L. Liu, L. Sun, Y. Rui, Y. Shi, and S. Yang. Web video topic discovery and tracking via bipartite graph reinforcement model. In WWW, pages 1009--1018, 2008.
 
10
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Scholkopf. Learning with local and global consistency. In NIPS, pages 321--328, 2004.