| Collaborative learning for image and video annotation |
| Full text |
Pdf
(1.03 MB)
|
Source
|
International Multimedia Conference
archive
Proceeding of the 1st ACM international conference on Multimedia information retrieval
table of contents
Vancouver, British Columbia, Canada
SESSION: Video retrieval and concept detection
table of contents
Pages 443-450
Year of Publication: 2008
ISBN:978-1-60558-312-9
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 16, Downloads (12 Months): 148, Citation Count: 0
|
|
|
ABSTRACT
Classical machine learning methods, such as Support Vector Machines, by taking each concept detection as an independent classification problem, can not achieve a sound performance for image and video annotation due to the overfitting problems. Thus, some prior knowledge is required to assist the learning of independent concept detectors, e.g. some concepts look much more alike to each other. In this paper, we assume that visually similar concepts should share resembled detectors. Based on the assumption, Collaborative Learning is proposed, to incorporate cross-concept collaborations into the joint learning of similar detectors over related concepts. Besides the collaborations, different concepts should also perform discriminations for classifying each other. To benefit from different trade-offs between collaboration and discrimination, we propose Multi-Granularity Boosting strategy, where each granularity corresponds to a specific balance between collaboration and discrimination for Collaborative Learning. The ultimate concept detector is an additive model that combines classifiers under different collaboration granularities together. Evaluations on both image and video annotation benchmark demonstrate that our method achieves a superior performance over independent annotation.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Trecvid. http://www-nlpir.nist.gov/projects/trecvid/.
|
| |
2
|
A. Amir and et al. Ibm research trecvid-2003 video retrieval system. In TRECVID Proceedings, 2003.
|
| |
3
|
N. Aronszajn. Theory of reproducing kernels. Trans. Am. Math. Soc., (68):337--404, 1950.
|
| |
4
|
|
| |
5
|
|
| |
6
|
S.-F. Chang and et al. Columbia university trecvid-2006 video search and high-level feature extraction. In TRECVID Proceedings, 2006.
|
 |
7
|
|
| |
8
|
J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: a statistical view of boosting. In Annals of statistics, 2000.
|
| |
9
|
A. Hauptmann, M.-Y. Chen, and M. Christel. Confounded expectations: Informedia at trecvid 2004. In TRECVID Proceedings, 2004.
|
 |
10
|
|
| |
11
|
P. Moreno, P. Ho, and N. Vasconcelos. A kullback-leibler divergence based kernel for svm classification in multimedia applications. In NIPS, 2003.
|
| |
12
|
M. R. Naphade. Statistical techniques in video data management. In IEEE Workshop on Multimedia Signal Processing, 2002.
|
| |
13
|
M. R. Naphade and et al. A light scale concept ontology for multimedia. Technical report, 2005.
|
| |
14
|
M. R. Naphade, I. Kozintsev, and T. S. Huang. Factor graph framework for semantic video indexing. IEEE Trans. on CSVT, 12(1), Jan. 2002.
|
 |
15
|
Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Jinhui Tang , Tao Mei , Hong-Jiang Zhang, Correlative multi-label video annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291245]
|
| |
16
|
|
| |
17
|
V. N. Vapnik. Statistical Learning Theory.Wiley-Interscience, 1998.
|
 |
18
|
Lei Wu , Xian-Sheng Hua , Nenghai Yu , Wei-Ying Ma , Shipeng Li, Flickr distance, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
[doi> 10.1145/1459359.1459364]
|
 |
19
|
|
| |
20
|
A. Yanagawa, S.-F. Chang, L. Kennedy, andW. Hsu. Columbia university's baseline detectors for 374 lscom semantic visual concepts. Columbia University ADVENT Technical Report, (222--2006--8), 2007.
|
 |
21
|
|
|