| Learning to tag |
| Full text |
Pdf
(1.23 MB)
|
Source
|
International World Wide Web Conference
archive
Proceedings of the 18th international conference on World wide web
table of contents
Madrid, Spain
SESSION: Rich media/session: tagging and clustering
table of contents
Pages 361-370
Year of Publication: 2009
ISBN:978-1-60558-487-4
|
|
Authors
|
|
Lei Wu
|
MOE-MS KeyLab of MCC, Dept. of EEIS, University of Science and Technology of China, Hefei, China
|
|
Linjun Yang
|
Microsoft Research Asia, Beijing, China
|
|
Nenghai Yu
|
MOE-MS KeyLab of MCC, Dept. of EEIS, University of Science and Technology of China, Hefei, China
|
|
Xian-Sheng Hua
|
Microsoft Research Asia, Beijing, China
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 68, Downloads (12 Months): 295, Citation Count: 1
|
|
|
ABSTRACT
Social tagging provides valuable and crucial information for large-scale web image retrieval. It is ontology-free and easy to obtain; however, irrelevant tags frequently appear, and users typically will not tag all semantic objects in the image, which is also called semantic loss. To avoid noises and compensate for the semantic loss, tag recommendation is proposed in literature. However, current recommendation simply ranks the related tags based on the single modality of tag co-occurrence on the whole dataset, which ignores other modalities, such as visual correlation. This paper proposes a multi-modality recommendation based on both tag and visual correlation, and formulates the tag recommendation as a learning problem. Each modality is used to generate a ranking feature, and Rankboost algorithm is applied to learn an optimal combination of these ranking features from different modalities. Experiments on Flickr data demonstrate the effectiveness of this learning-based multi-modality recommendation strategy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
E. Akbas and F. Yarman Vural. Automatic image annotation by ensemble of visual descriptors. CVPR'07., June 2007.
|
 |
2
|
|
| |
3
|
|
 |
4
|
|
 |
5
|
|
 |
6
|
Susanne Boll , Philipp Sandhaus , Ansgar Scherp , Utz Westermann, Semantics, content, and structure of many for the creation of personal photo albums, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291385]
|
| |
7
|
|
 |
8
|
|
 |
9
|
Haoyuan Li , Yi Wang , Dong Zhang , Ming Zhang , Edward Y. Chang, Pfp: parallel fp-growth for query recommendation, Proceedings of the 2008 ACM conference on Recommender systems, October 23-25, 2008, Lausanne, Switzerland
[doi> 10.1145/1454008.1454027]
|
 |
10
|
|
 |
11
|
Jing Liu , Bin Wang , Mingjing Li , Zhiwei Li , Weiying Ma , Hanqing Lu , Songde Ma, Dual cross-media relevance model for image annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291380]
|
| |
12
|
J. Mairal, F. Bach, J. Ponce, G. Sapiro, and A. Zisserman. Supervised dictionary learning, 2008.
|
| |
13
|
|
 |
14
|
Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Jinhui Tang , Tao Mei , Hong-Jiang Zhang, Correlative multi-label video annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291245]
|
 |
15
|
Yan Qi , K. Selçuk Candan , Junichi Tatemura , Songting Chen , Fenglin Liao, Supporting OLAP operations over imperfectly integrated taxonomies, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, June 09-12, 2008, Vancouver, Canada
[doi> 10.1145/1376616.1376703]
|
 |
16
|
Xiaoguang Rui , Mingjing Li , Zhiwei Li , Wei-Ying Ma , Nenghai Yu, Bipartite graph reinforcement model for web image annotation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291378]
|
 |
17
|
Shilad Sen , Shyong K. Lam , Al Mamunur Rashid , Dan Cosley , Dan Frankowski , Jeremy Osterhouse , F. Maxwell Harper , John Riedl, tagging, communities, vocabulary, evolution, Proceedings of the 2006 20th anniversary conference on Computer supported cooperative work, November 04-08, 2006, Banff, Alberta, Canada
[doi> 10.1145/1180875.1180904]
|
 |
18
|
|
| |
19
|
C. G. M. Snoek, B. Huurnink, L. Hollink, M. D. Rijke, G. Schreiber, and M. Worring. Adding semantics to detectors for video retrieval. IEEE Transactions on Multimedia, 9, 2007.
|
| |
20
|
C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Content-based image annotation refinement. Proceedings of CVPR 07, 2007.
|
 |
21
|
Lei Wu , Xian-Sheng Hua , Nenghai Yu , Wei-Ying Ma , Shipeng Li, Flickr distance, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
[doi> 10.1145/1459359.1459364]
|
 |
22
|
Lei Wu , Mingjing Li , Zhiwei Li , Wei-Ying Ma , Nenghai Yu, Visual language modeling for image classification, Proceedings of the international workshop on Workshop on multimedia information retrieval, September 24-29, 2007, Augsburg, Bavaria, Germany
[doi> 10.1145/1290082.1290101]
|
 |
23
|
|
| |
24
|
|
| |
25
|
Y.-T. Zheng, S.-Y. Neo, T.-S. Chua, and Q. Tian. Visual synset: towards a higher-level visual representation. In Proceedings of CVPR'08, 2008.
|
CITED BY
|
|
Dong Liu , Xian-Sheng Hua , Linjun Yang , Meng Wang , Hong-Jiang Zhang, Tag ranking, Proceedings of the 18th international conference on World wide web, April 20-24, 2009, Madrid, Spain
|
|