|
ABSTRACT
Web images come in hand with valuable contextual information. Although this information has long been mined for various uses such as image annotation, clustering of images, inference of image semantic content, etc., insufficient attention has been given to address issues in mining this contextual information. In this paper, we propose a webpage segmentation algorithm targeting the extraction of web images and their contextual information based on their characteristics as they appear on webpages. We conducted a user study to obtain a human-labeled dataset to validate the effectiveness of our method and experiments demonstrated that our method can achieve better results compared to an existing segmentation algorithm.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Cai, D. et al., Hierarchical clustering of WWW image search results using visual, textual and link information, ACM MM 2004.
|
| |
2
|
Cai, D., S. Yu, J.-R. Wen, and W.-Y. Ma, VIPS: a Vision-based Page Segmentation Algorithm. Microsoft Technical Report, 2003.
|
| |
3
|
Chakrabarti, D., R. Kumar, and K. Punera, A graph-theoretic approach to webpage segmentation, WWW 2008.
|
| |
4
|
Feng, H., R. Shi, and T.-S. Chua, A bootstrapping framework for annotating and retrieving WWW images, ACM MM 2004.
|
| |
5
|
Frankel, C., M.J. Swain, and V. Athitsos, WebSeer: An Image Search Engine for the World Wide Web. 1996.
|
| |
6
|
Gao, B. et al., Web image clustering by consistent utilization of visual features and surrounding texts, ACM MM 2005.
|
| |
7
|
Gong, Z., L.H. U., and C.W. Cheang, Web image indexing by using associated texts. Knowl. Inf. Syst., 2006.
|
| |
8
|
He, X. et al., Clustering and searching WWW images using link and page layout analysis. ACM Trans. Multimedia Comput. Commun. Appl., 2007. 3(2): p. 10.
|
| |
9
|
Hua, Z., X.--J. Wang, Q. Liu, and H. Lu, Semantic knowledge extraction and annotation for web images, ACM MM 2005.
|
| |
10
|
Kao, H.-Y., WISDOM: Web Intrapage Informative Structure Mining Based on Document Object Model. TKDE 2005.
|
| |
11
|
Kohlschutter, C. and W. Nejdl, A densitometric approach to web page segmentation, CIKM 2008.
|
| |
12
|
Li, J., T. Liu, W. Wang, and W. Gao, A Broadcast Model for Web Image Annotation, PCM 2006.
|
| |
13
|
Liu, J., M. Li, Q. Liu, H. Lu, and S. Ma, Image annotation via graph learning. Pattern Recogn., 2009.
|
| |
14
|
Sclaroff, S. et al., Unifying textual and visual cues for content--based image retrieval on the WWW. Comp.Vis.Img.Underst., 1999.
|
| |
15
|
Shen, H.T., B.C. Ooi, and K.-L. Tan, Giving meanings to WWW images, ACM MM 2000.
|
|