|
ABSTRACT
We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the results into different semantic clusters facilitates users' browsing. In this paper, we propose a hierarchical clustering method using visual, textual and link analysis. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By using block-level link analysis techniques, an image graph can be constructed. We then apply spectral techniques to find a Euclidean embedding of the images which respects the graph structure. Thus for each image, we have three kinds of representations, i.e. visual feature based representation, textual feature based representation and graph based representation. Using spectral clustering techniques, we can cluster the search results into different semantic clusters. An image search example illustrates the potential of these techniques.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
AltaVista image search, http://www.altavista.com/image/
|
| |
2
|
|
| |
3
|
M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering", Advances in Neural Information Processing Systems 14, Canada, 2001.
|
| |
4
|
|
| |
5
|
D. Cai, X. He, W.-Y. Ma, J.-R. Wen and H.-J. Zhang. "Organizing WWW Images Based on The Analysis of Page Layout and Web Link Structure", in The 2004 IEEE International Conference on Multimedia and EXPO, 2004.
|
 |
6
|
|
| |
7
|
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma, "VIPS: a vision-based page segmentation algorithm", Microsoft Technical Report, MSR-TR-2003-79, 2003.
|
 |
8
|
|
 |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
Google image search engine, http://images.google.com/
|
| |
13
|
Google Zeitgeist - Search patterns, trends, and surprises according to Google, (2004) http://www.google.com/press/zeitgeist.html
|
| |
14
|
|
| |
15
|
X. He, D. Cai, J.-R. Wen, W.-Y. Ma and H.-J. Zhang, "ImageSeer: Clustering and Searching WWW Images Using Link and Page Layout Analysis", Microsoft Technical Report, MSR-TR-2004-38, 2004.
|
| |
16
|
X. He, W.-Y. Ma, and H. J. Zhang, "ImageRank: spectral techniques for structural analysis of image database", IEEE International Conference on Multimedia and Expo, 2003.
|
| |
17
|
|
 |
18
|
|
| |
19
|
A. V. Leouski and B. Croft, An Evaluation of Techniques for Clustering Search Results. Technical Report IR-76, Computer Science Dept., University of Massachusetts, 1996.
|
| |
20
|
|
| |
21
|
A. Y. Ng, M. Jordan, and Y. Weiss, "On spectral clustering: Analysis and an algorithm", Advances in Neural Information Processing Systems 14, Vancouver, Canada, 2001.
|
 |
22
|
Kerry Rodden , Wojciech Basalaj , David Sinclair , Kenneth Wood, Does organisation by similarity assist image browsing?, Proceedings of the SIGCHI conference on Human factors in computing systems, p.190-197, March 2001, Seattle, Washington, United States
[doi> 10.1145/365024.365097]
|
| |
23
|
|
| |
24
|
J. Smith and S.-F. Chang, "WebSEEK, a content-based image and video search and catalog tool for the web", IEEE Multimedia, 1997.
|
| |
25
|
M. Stricker and M. Orengo, "Similarity of color images", Proc. Storage and Retrieval for Image and Video Databases,SPIE 2420, pp. 381--392, 1995.
|
 |
26
|
|
| |
27
|
H. Yu, M. Li, H.-J. Zhang, and J. Feng. Color texture moments for content-based image retrieval. In International Conference on Image Processing, pages 24--28. 2002.
|
| |
28
|
Vivisimo clustering engine, (2004) http://vivisimo.com.
|
| |
29
|
|
CITED BY 34
|
|
|
|
|
Hanghang Tong , Jingrui He , Mingjing Li , Changshui Zhang , Wei-Ying Ma, Graph based multi-modality learning, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
|
|
|
|
|
|
Ying Liu , Tao Qin , Tie-Yan Liu , Lei Zhang , Wei-Ying Ma, Similarity space projection for web image search and annotation, Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval, November 10-11, 2005, Hilton, Singapore
|
|
|
Bin Gao , Tie-Yan Liu , Tao Qin , Xin Zheng , Qian-Sheng Cheng , Wei-Ying Ma, Web image clustering by consistent utilization of visual features and surrounding texts, Proceedings of the 13th annual ACM international conference on Multimedia, November 06-11, 2005, Hilton, Singapore
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Feng Jing , Changhu Wang , Yuhuan Yao , Kefeng Deng , Lei Zhang , Wei-Ying Ma, IGroup: web image search results clustering, Proceedings of the 14th annual ACM international conference on Multimedia, October 23-27, 2006, Santa Barbara, CA, USA
|
|
|
|
|
|
Changyun Zhu , Kun Li , Qin Lv , Li Shang , Robert P. Dick, iScope: personalized multi-modality image search for mobile devices, Proceedings of the 7th international conference on Mobile systems, applications, and services, June 22-25, 2009, Kraków, Poland
|
|
|
Shuo Wang , Feng Jing , Jibo He , Qixing Du , Lei Zhang, IGroup: presenting web image search results in semantic clusters, Proceedings of the SIGCHI conference on Human factors in computing systems, April 28-May 03, 2007, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ritendra Datta , Dhiraj Joshi , Jia Li , James Z. Wang, Image retrieval: Ideas, influences, and trends of the new age, ACM Computing Surveys (CSUR), v.40 n.2, p.1-60, April 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Yongqing Sun , Satoshi Shimada , Yukinobu Taniguchi , Akira Kojima, A novel region-based approach to visual concept modeling using web images, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Haojie Li , Jinhui Tang , Guangda Li , Tat-Seng Chua, Word2Image: towards visual interpreting of words, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Yangqing Jia , Jingdong Wang , Changshui Zhang , Xian-Sheng Hua, Finding image exemplars using fast sparse affinity propagation, Proceeding of the 16th ACM international conference on Multimedia, October 26-31, 2008, Vancouver, British Columbia, Canada
|
|
|
Stefanos Vrochidis , Charalampos Doulaverakis , Anastasios Gounaris , Evangelia Nidelkou , Lambros Makris , Ioannis Kompatsiaris, A hybrid ontology and visual-based retrieval model for cultural heritage multimedia collections, International Journal of Metadata, Semantics and Ontologies, v.3 n.3, p.167-182, July 2008
|
|
|
|
|
|
|
|
|
|
|