ACM Home Page
Please provide us with feedback. Feedback
Leveraging probabilistic season and location context models for scene understanding
Full text PdfPdf (805 KB)
Source
Conference On Image And Video Retrieval archive
Proceedings of the 2008 international conference on Content-based image and video retrieval table of contents
Niagara Falls, Canada
POSTER SESSION: Poster/reception table of contents
Pages 169-178  
Year of Publication: 2008
ISBN:978-1-60558-070-8
Authors
Jie Yu  Kodak Research Labs, Rochester, NY, USA
Jiebo Luo  Kodak Research Labs, Rochester, NY, USA
Sponsors
SIGIR: ACM Special Interest Group on Information Retrieval
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 97,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1386352.1386379
What is a DOI?

ABSTRACT

Recent research has shown the power of context-aware scene understanding in bridging the semantic gap between high-level semantic concepts and low-level image features. In this paper, we present a new method to exploit nonvisual context information from the season and location proximity in which pictures were taken to facilitate region (object) annotation in consumer photos. Our method does not require precise time and location from the capture device or user input. Instead, it learns from rough location (e.g., states in the US) and time (e.g., seasons) information, which can be obtained through picture metadata automatically or through minimal user input (e.g., grouping). In addition, the visual context within the image is obtained by analyzing the spatial relationships between different regions (objects) in the scene. Both visual and nonvisual context information are fused using a probabilistic graphical model to improve the accuracy of object region recognition. Our method has been evaluated on a database that consists of over 10,000 regions in more than 1000 images collected from both the Web and consumers. Experimental results show that incorporating the season and location context significantly improves the performance of region recognition.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Berretti, S., Bimbo, A. D., and Vicario, E. 2002. Spatial arrangement of color in retrieval by visual similarity. Pattern Recog. 35 (2002) 1661--1674.
 
2
Boutell, M., Luo, J., and Brown, C. M. 2004. Learning spatial configuration models using modified Dirichlet priors. In Proceedings of the Workshop on Statistical Relational Learning (in Conjunction with ICML) (2004).
 
3
Boutell, M., Choudhury, A., Luo, J., and Brown, C. M. 2006. Using semantic features for scene classification: How good do they need to be? In Proceedings of the IEEE International Conference on Multimedia and Exposition. (Jul. 2006).
 
4
Boutell, M., Luo, J., and Brown, C. 2007. Scene parsing using region-based generative models. IEEE Trans. Multimedia.
 
5
 
6
 
7
8
 
9
Flickr, http://www.flickr.com
 
10
 
11
Kohavi, R. 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the International Joint Conference on Artificial Intelligence (1995).
 
12
Kschischang, F. R., Frey, B. H., and Loeliger, H.-A. 2001. Factor graphs and the sum-product algorithm. IEEE Trans. Info. Theory. 47(2) (Feb. 2001) 498--519.
 
13
 
14
 
15
Murphy, K., Torralba, A., and Freeman, W. T. 2003. Using the forest to see the trees: A graphical model relating features, objects, and scenes. In Proceedings of the Neural Information Processing Systems (2003).
 
16
Naphade, M. R. and Huang, T. S. 2001. A probabilistic framework for semantic video indexing, filtering, and retrieval. IEEE Trans. Multimedia 3(1) (Mar. 2001) 141--151.
 
17
Naphade, M. and Huang, T. S. 2002. A factor graph framework for semantic video indexing. IEEE Trans. Circ. Syst. Video Technol. 12(1) (Jan. 2002).
 
18
Picasa, http://picasa.google.com/
 
19
Rui, Y., Huang, T. S., and Chang, S. F. 1999. Image retrieval: Current techniques, promising directions and open issues. J. Visual Commun. Image Represent. (1999).
 
20
 
21
Singhal, A., Luo, J., and Zhu, W. Probabilistic spatial context models for scene content understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Madison, WI, Jun. 2003).
 
22
 
23
 
24
Vailaya, A. and Jain, A. 2000. "Detecting sky and vegetation in outdoor images," In Proceedings of the SPIE. 3972 (2000).
 
25
Wang, W., Song, Y., and Zhang, A. 2002. Semantics retrieval by content and context of image regions. In Proceedings of the 15th International Conference on Vision Interface. (2002).
26