|
ABSTRACT
We propose a new method for automated large scale gathering of Web images relevant to specified concepts. Our main goal is to build a knowledge base associated with as many concepts as possible for large scale object recognition studies. A second goal is supporting the building of more accurate text-based indexes for Web images. In our method, good quality candidate sets of images for each keyword are gathered as a function of analysis of the surrounding HTML text. The gathered images are then segmented into regions, and a model for the probability distribution of regions for the concept is computed using an iterative algorithm based on the previous work on statistical image annotation. The learned model is then applied to identify which images are visually relevant to the concept implied by the keyword. Implicitly, which regions or the images are relevant is also determined. Our experiments reveal that the new method performs much better than Google Image Search and a simple method based on more standard content based image retrieval methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
 |
2
|
|
| |
3
|
S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.
|
| |
4
|
|
| |
5
|
|
 |
6
|
|
| |
7
|
R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In Proc. of IEEE Computer Vision and Pattern Recognition, volume 2, pages 264--271, 2003.
|
| |
8
|
R. Fergus, P. Perona, and A. Zisserman. A visual category filter for google images. In Proc. of European Conference on Computer Vision, pages 242--255, 2004.
|
| |
9
|
|
| |
10
|
|
| |
11
|
M. R. Naphade, S. Basu, J. R. Smith, C. Y. Lin, and B. Tseng. Modeling semantic concepts to support query by keywords in video. In Proc. of IEEE Intl. Conference on Image Processing, pages I--145--148, 2002.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
K. Yanai. Image collector: An image-gathering system from the World-Wide Web employing keyword-based search engines. In Proc. of IEEE International Conference on Multimedia and Expo, pages 704--707, 2001.
|
 |
17
|
|
CITED BY 9
|
|
|
|
|
|
|
|
|
|
|
Tim Berners-Lee , Wendy Hall , James A. Hendler , Kieron O'Hara , Nigel Shadbolt , Daniel J. Weitzner, A framework for web science, Foundations and Trends in Web Science, v.1 n.1, p.1-130, January 2006
|
|
|
|
|
|
|
|
|
Yongqing Sun , Satoshi Shimada , Yukinobu Taniguchi , Akira Kojima, A novel region-based approach to visual concept modeling using web images, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Julien Ah-Pine , Marco Bressan , Stephane Clinchant , Gabriela Csurka , Yves Hoppenot , Jean-Michel Renders, Crossing textual and visual content in different application scenarios, Multimedia Tools and Applications, v.42 n.1, p.31-56, March 2009
|
|
|
|
|