ACM Home Page
Please provide us with feedback. Feedback
Annotating photo collections by label propagation according to multiple similarity cues
Full text PdfPdf (1.67 MB)
Source
International Multimedia Conference archive
Proceeding of the 16th ACM international conference on Multimedia table of contents
Vancouver, British Columbia, Canada
SESSION: Content track C3: image annotation and tagging table of contents
Pages 121-130  
Year of Publication: 2008
ISBN:978-1-60558-303-7
Authors
Liangliang Cao  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Jiebo Luo  Eastman Kodak Company, Rochester, NY, USA
Thomas S. Huang  University of Illinois at Urbana-Champaign, Urbana, IL, USA
Sponsors
ACM: Association for Computing Machinery
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 197,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1459359.1459376
What is a DOI?

ABSTRACT

This paper considers the emerging problem of annotating personal photo collections that are taken by digital cameras and may have been subsequently organized by customers. Unlike the images from the web searching engine or commercial image banks (e.g. the Corel database), the photos in the same personal collection are related to each other in time, location, and content. Advanced technologies can record the GPS coordinates for each photo, and thus provide a richer source of context to model and enforce the correlation between the photos in the same collection. Recognizing the well-known limitations ("semantic gap") of visual recognition algorithms, we exploit the correlation between the photos to enhance the annotation performance. In our approach, high-confidence annotation labels are first obtained for certain photos and then propagated to the remaining photos in the same collection, according to time, location, and visual proximity (or similarity). A novel generative probabilistic model is employed, which outperforms the pervious linear propagation scheme. Experimental results have shown the advantages of the proposed annotation scheme.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Flickr, http://www.flickr.com
 
2
Picasa Web Album, http://picasaweb.google.com
 
3
Corel stock photo database, http://www.corel.com
 
4
 
5
A. K. Jain and A. Vailaya, Image retrieval using color and shape, Pattern Recognition, 29 (8), 1233--1244, 1996.
 
6
Y. Rui, T. S. Huang, S Mehrotra, Content-based image retrieval with relevance feedback in MARS, IEEE Proc. Image Processing, 1997.
 
7
A. Blum, T. Mitchell, Combining labeled and unlabeled data with co-training, Annual Conference on Learning Theory, 1998.
8
 
9
 
10
 
11
X. Zhu, Z. Ghahramani, and J. Lafferty, Semi-supervised learning using Gaussian fields and harmonic functions, International Conference on Machine Learning, 2003.
 
12
 
13
D. Zhou, O. Bousquet, T. N. Lal, J. Weston and B. Scholkopf, Learning with Local and Global Consistency. Neural Information Processing Systems, 2004.
14
15
16
17
18
 
19
 
20
 
21
M. Boutell, and J. Luo. Beyond pixels: Exploiting camera metadata for photo classification. Pattern Recognition 38(6): 935--946, 2005.
 
22
Y. Aytar, O. B. Orhan, and M. Shah. Improving Semantic Concept Detection and Retrieval Using Contextual Estimates, IEEE International Conference on Multimedia & Expo, 2007.
 
23
 
24
 
25
 
26
 
27
S. F. Chang, et. al., Columbia Univ. TRECVID-2005 Video Search and High-Level Feature Extraction, Proc. TREC Video Retrieval Evaluation, 2005.
 
28
M. Campbell, et. al., IBM Research TRECVID-2006 Video Retrieval System, Proc. TREC Video Retrieval Evaluation, 2006.
 
29
M. Boutell and J. Luo, Beyond Pixels: Exploiting camera metadata for photo classification, Pattern Recognition 38(6): 935--946, 2005.

Collaborative Colleagues:
Liangliang Cao: colleagues
Jiebo Luo: colleagues
Thomas S. Huang: colleagues