| Annotating photo collections by label propagation according to multiple similarity cues |
| Full text |
Pdf
(1.67 MB)
|
Source
|
International Multimedia Conference
archive
Proceeding of the 16th ACM international conference on Multimedia
table of contents
Vancouver, British Columbia, Canada
SESSION: Content track C3: image annotation and tagging
table of contents
Pages 121-130
Year of Publication: 2008
ISBN:978-1-60558-303-7
|
|
Authors
|
|
Liangliang Cao
|
University of Illinois at Urbana-Champaign, Urbana, IL, USA
|
|
Jiebo Luo
|
Eastman Kodak Company, Rochester, NY, USA
|
|
Thomas S. Huang
|
University of Illinois at Urbana-Champaign, Urbana, IL, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 197, Citation Count: 0
|
|
|
ABSTRACT
This paper considers the emerging problem of annotating personal photo collections that are taken by digital cameras and may have been subsequently organized by customers. Unlike the images from the web searching engine or commercial image banks (e.g. the Corel database), the photos in the same personal collection are related to each other in time, location, and content. Advanced technologies can record the GPS coordinates for each photo, and thus provide a richer source of context to model and enforce the correlation between the photos in the same collection. Recognizing the well-known limitations ("semantic gap") of visual recognition algorithms, we exploit the correlation between the photos to enhance the annotation performance. In our approach, high-confidence annotation labels are first obtained for certain photos and then propagated to the remaining photos in the same collection, according to time, location, and visual proximity (or similarity). A novel generative probabilistic model is employed, which outperforms the pervious linear propagation scheme. Experimental results have shown the advantages of the proposed annotation scheme.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Flickr, http://www.flickr.com
|
| |
2
|
Picasa Web Album, http://picasaweb.google.com
|
| |
3
|
Corel stock photo database, http://www.corel.com
|
| |
4
|
|
| |
5
|
A. K. Jain and A. Vailaya, Image retrieval using color and shape, Pattern Recognition, 29 (8), 1233--1244, 1996.
|
| |
6
|
Y. Rui, T. S. Huang, S Mehrotra, Content-based image retrieval with relevance feedback in MARS, IEEE Proc. Image Processing, 1997.
|
| |
7
|
A. Blum, T. Mitchell, Combining labeled and unlabeled data with co-training, Annual Conference on Learning Theory, 1998.
|
 |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
X. Zhu, Z. Ghahramani, and J. Lafferty, Semi-supervised learning using Gaussian fields and harmonic functions, International Conference on Machine Learning, 2003.
|
| |
12
|
|
| |
13
|
D. Zhou, O. Bousquet, T. N. Lal, J. Weston and B. Scholkopf, Learning with Local and Global Consistency. Neural Information Processing Systems, 2004.
|
 |
14
|
Jingrui He , Mingjing Li , Hong-Jiang Zhang , Hanghang Tong , Changshui Zhang, Manifold-ranking based image retrieval, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027531]
|
 |
15
|
Jing Liu , Mingjing Li , Wei-Ying Ma , Qingshan Liu , Hanqing Lu, An adaptive graph model for automatic image annotation, Proceedings of the 8th ACM international workshop on Multimedia information retrieval, October 26-27, 2006, Santa Barbara, California, USA
[doi> 10.1145/1178677.1178689]
|
 |
16
|
Hanghang Tong , Jingrui He , Mingjing Li , Changshui Zhang , Wei-Ying Ma, Graph based multi-modality learning, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
[doi> 10.1145/1101149.1101337]
|
 |
17
|
Jingjing Liu , Wei Lai , Xian-Sheng Hua , Yalou Huang , Shipeng Li, Video search re-ranking via multi-graph propagation, Proceedings of the 15th international conference on Multimedia, September 25-29, 2007, Augsburg, Germany
[doi> 10.1145/1291233.1291279]
|
 |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
M. Boutell, and J. Luo. Beyond pixels: Exploiting camera metadata for photo classification. Pattern Recognition 38(6): 935--946, 2005.
|
| |
22
|
Y. Aytar, O. B. Orhan, and M. Shah. Improving Semantic Concept Detection and Retrieval Using Contextual Estimates, IEEE International Conference on Multimedia & Expo, 2007.
|
| |
23
|
|
| |
24
|
|
| |
25
|
|
| |
26
|
P. Quelhas , F. Monay , J.-M. Odobez , D. Gatica-Perez , T. Tuytelaars , L. Van Gool, Modeling Scenes with Local Descriptors and Latent Aspects, Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, p.883-890, October 17-20, 2005
[doi> 10.1109/ICCV.2005.152]
|
| |
27
|
S. F. Chang, et. al., Columbia Univ. TRECVID-2005 Video Search and High-Level Feature Extraction, Proc. TREC Video Retrieval Evaluation, 2005.
|
| |
28
|
M. Campbell, et. al., IBM Research TRECVID-2006 Video Retrieval System, Proc. TREC Video Retrieval Evaluation, 2006.
|
| |
29
|
M. Boutell and J. Luo, Beyond Pixels: Exploiting camera metadata for photo classification, Pattern Recognition 38(6): 935--946, 2005.
|
|