ACM Home Page
Please provide us with feedback. Feedback
Hierarchical clustering of WWW image search results using visual, textual and link information
Full text PdfPdf (1.15 MB)
Source International Multimedia Conference archive
Proceedings of the 12th annual ACM international conference on Multimedia table of contents
New York, NY, USA
SESSION: Technical session 15: WWW image retrieval table of contents
Pages: 952 - 959  
Year of Publication: 2004
ISBN:1-58113-893-8
Authors
Deng Cai  UIUC, IL and Microsoft Research Asia, Beijing, China
Xiaofei He  University of Chicago, Chicago, IL
Zhiwei Li  Microsoft Research Asia, Beijing, China
Wei-Ying Ma  Microsoft Research Asia, Beijing, China
Ji-Rong Wen  Microsoft Research Asia, Beijing, China
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 35,   Downloads (12 Months): 278,   Citation Count: 34
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027527.1027747
What is a DOI?

ABSTRACT

We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the results into different semantic clusters facilitates users' browsing. In this paper, we propose a hierarchical clustering method using visual, textual and link analysis. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By using block-level link analysis techniques, an image graph can be constructed. We then apply spectral techniques to find a Euclidean embedding of the images which respects the graph structure. Thus for each image, we have three kinds of representations, i.e. visual feature based representation, textual feature based representation and graph based representation. Using spectral clustering techniques, we can cluster the search results into different semantic clusters. An image search example illustrates the potential of these techniques.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
AltaVista image search, http://www.altavista.com/image/
 
2
 
3
M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering", Advances in Neural Information Processing Systems 14, Canada, 2001.
 
4
 
5
D. Cai, X. He, W.-Y. Ma, J.-R. Wen and H.-J. Zhang. "Organizing WWW Images Based on The Analysis of Page Layout and Web Link Structure", in The 2004 IEEE International Conference on Multimedia and EXPO, 2004.
6
 
7
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma, "VIPS: a vision-based page segmentation algorithm", Microsoft Technical Report, MSR-TR-2003-79, 2003.
8
9
10
 
11
 
12
Google image search engine, http://images.google.com/
 
13
Google Zeitgeist - Search patterns, trends, and surprises according to Google, (2004) http://www.google.com/press/zeitgeist.html
 
14
 
15
X. He, D. Cai, J.-R. Wen, W.-Y. Ma and H.-J. Zhang, "ImageSeer: Clustering and Searching WWW Images Using Link and Page Layout Analysis", Microsoft Technical Report, MSR-TR-2004-38, 2004.
 
16
X. He, W.-Y. Ma, and H. J. Zhang, "ImageRank: spectral techniques for structural analysis of image database", IEEE International Conference on Multimedia and Expo, 2003.
 
17
18
 
19
A. V. Leouski and B. Croft, An Evaluation of Techniques for Clustering Search Results. Technical Report IR-76, Computer Science Dept., University of Massachusetts, 1996.
 
20
 
21
A. Y. Ng, M. Jordan, and Y. Weiss, "On spectral clustering: Analysis and an algorithm", Advances in Neural Information Processing Systems 14, Vancouver, Canada, 2001.
22
 
23
 
24
J. Smith and S.-F. Chang, "WebSEEK, a content-based image and video search and catalog tool for the web", IEEE Multimedia, 1997.
 
25
M. Stricker and M. Orengo, "Similarity of color images", Proc. Storage and Retrieval for Image and Video Databases,SPIE 2420, pp. 381--392, 1995.
26
 
27
H. Yu, M. Li, H.-J. Zhang, and J. Feng. Color texture moments for content-based image retrieval. In International Conference on Image Processing, pages 24--28. 2002.
 
28
Vivisimo clustering engine, (2004) http://vivisimo.com.
 
29

CITED BY  34

Collaborative Colleagues:
Deng Cai: colleagues
Xiaofei He: colleagues
Zhiwei Li: colleagues
Wei-Ying Ma: colleagues
Ji-Rong Wen: colleagues