ACM Home Page
Please provide us with feedback. Feedback
Learning to tag
Full text PdfPdf (1.23 MB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Rich media/session: tagging and clustering table of contents
Pages 361-370  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
Lei Wu  MOE-MS KeyLab of MCC, Dept. of EEIS, University of Science and Technology of China, Hefei, China
Linjun Yang  Microsoft Research Asia, Beijing, China
Nenghai Yu  MOE-MS KeyLab of MCC, Dept. of EEIS, University of Science and Technology of China, Hefei, China
Xian-Sheng Hua  Microsoft Research Asia, Beijing, China
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 68,   Downloads (12 Months): 295,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526758
What is a DOI?

ABSTRACT

Social tagging provides valuable and crucial information for large-scale web image retrieval. It is ontology-free and easy to obtain; however, irrelevant tags frequently appear, and users typically will not tag all semantic objects in the image, which is also called semantic loss. To avoid noises and compensate for the semantic loss, tag recommendation is proposed in literature. However, current recommendation simply ranks the related tags based on the single modality of tag co-occurrence on the whole dataset, which ignores other modalities, such as visual correlation. This paper proposes a multi-modality recommendation based on both tag and visual correlation, and formulates the tag recommendation as a learning problem. Each modality is used to generate a ranking feature, and Rankboost algorithm is applied to learn an optimal combination of these ranking features from different modalities. Experiments on Flickr data demonstrate the effectiveness of this learning-based multi-modality recommendation strategy.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
E. Akbas and F. Yarman Vural. Automatic image annotation by ensemble of visual descriptors. CVPR'07., June 2007.
2
 
3
4
5
6
 
7
8
9
10
11
 
12
J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Supervised dictionary learning, 2008.
 
13
14
15
16
17
18
 
19
C. G. M. Snoek, B. Huurnink, L. Hollink, M. D. Rijke, G. Schreiber, and M. Worring. Adding semantics to detectors for video retrieval. IEEE Transactions on Multimedia, 9, 2007.
 
20
C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Content-based image annotation refinement. Proceedings of CVPR 07, 2007.
21
22
23
 
24
 
25
Y.-T. Zheng, S.-Y. Neo, T.-S. Chua, and Q. Tian. Visual synset: towards a higher-level visual representation. In Proceedings of CVPR'08, 2008.


Collaborative Colleagues:
Lei Wu: colleagues
Linjun Yang: colleagues
Nenghai Yu: colleagues
Xian-Sheng Hua: colleagues