|
ABSTRACT
Thumbnail images provide users of image retrieval and browsing systems with a method for quickly scanning large numbers of images. Recognizing the objects in an image is important in many retrieval tasks, but thumbnails generated by shrinking the original image often render objects illegible. We study the ability of computer vision systems to detect key components of images so that automated cropping, prior to shrinking, can render objects more recognizable. We evaluate automatic cropping techniques 1) based on a general method that detects salient portions of images, and 2) based on automatic face detection. Our user study shows that these methods result in small thumbnails that are substantially more recognizable and easier to find in the context of visual search.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
1. ACDSee, ACD Systems, http://www.adsystems.com
|
| |
2
|
2. Adobe Photoshop Album, Adobe Systems Inc., http://www.adobe.com/products/photoshopalbum/
|
 |
3
|
|
| |
4
|
|
| |
5
|
5. Chen, L., Xie, X., Fan, X., Ma, W., Zhang, H., and Zhou, H. A Visual attention model for adapting images on small displays, ACM Multimedia Systems Journal, To appear in Fall 2003.
|
| |
6
|
|
| |
7
|
7. Corbis, http://www.corbis.com
|
 |
8
|
|
| |
9
|
9. Face Detection Demonstration. Robotics Institute, Carnegie Mellon University http://www.vasc.ri.cmu.edu/cgi-bin/demos/findface.cgi
|
| |
10
|
Myron Flickner , Harpreet Sawhney , Wayne Niblack , Jonathan Ashley , Qian Huang , Byron Dom , Monika Gorkani , Jim Hafner , Denis Lee , Dragutin Petkovic , David Steele , Peter Yanker, Query by Image and Video Content: The QBIC System, Computer, v.28 n.9, p.23-32, September 1995
[doi> 10.1109/2.410146]
|
| |
11
|
11. Folk, C.L., Remington, R.W., and Johnston, J.C. Involuntary covert orienting is contingent on attentional control settings. Journal of Experimental Psychology: HP&P, 18:1030-44, 1992.
|
| |
12
|
12. Itti, L., and Koch, C. A Comparison of Feature Combination Strategies for Saliency-Based Visual Attention Systems, SPIE human vision and electronic imaging IV(HVEI'99), San Jose, CA, pp. 473-482, 1999.
|
| |
13
|
|
| |
14
|
14. Kang, H., and Shneiderman, B. Visualization Methods for Personal Photo Collections: Browsing and Searching in the PhotoFinder, In Proc. of IEEE International Conference on Multimedia and Expo (ICME2000) New York: IEEE, pp. 1539-1542, 2000.
|
| |
15
|
Stan Z. Li , Long Zhu , ZhenQiu Zhang , Andrew Blake , HongJiang Zhang , Harry Shum, Statistical Learning of Multi-view Face Detection, Proceedings of the 7th European Conference on Computer Vision-Part IV, p.67-81, May 28-31, 2002
|
| |
16
|
16. Milanese, R., Wechsler H., Gil S., Bost J., and Pun T. Integration of Bottom-Up and Top-Down Cues for Visual Attention Using Non-Linear Relaxation, In proc. of Computer Vision and Pattern Recognition, IEEE, pp. 781-785, 1994.
|
| |
17
|
17. Milanese, R. Detecting Salient Regions in an Image: from Biological Evidence to Computer Implementation, Ph.D. thesis, Univ. of Geneva, 1993.
|
| |
18
|
18. Palmer, J. Ames, C.T., Lindsey, D.T. Measuring the Effect of Attention on Simple Visual Search. Journal of Experimental Psychology: Human Perception & Performance, 19, pp. 108-130, 1993.
|
| |
19
|
19. Schneiderman, H., and Kanade, T. A Statistical Model for 3D Object Detection Applied to Faces and Cars. In Proc. of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, June, 2000.
|
| |
20
|
20. Vimas Technologies. http://www.vimas.com
|
| |
21
|
21. Wolfe, J.M. Guided Search 2.0: A Revised Model of Visual Search, Psychonomic Buttletin and Review, Vol. 1, No. 2, pp. 202-238, 1994.
|
| |
22
|
|
CITED BY 29
|
|
|
|
|
|
|
|
|
|
|
David F. Huynh , Steven M. Drucker , Patrick Baudisch , Curtis Wong, Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections, CHI '05 extended abstracts on Human factors in computing systems, April 02-07, 2005, Portland, OR, USA
|
|
|
|
|
|
Xing Xie , Hao Liu , Simon Goumaz , Wei-Ying Ma, Learning user interest for image browsing on small-form-factor devices, Proceedings of the SIGCHI conference on Human factors in computing systems, April 02-07, 2005, Portland, Oregon, USA
|
|
|
|
|
|
|
|
|
Anthony Santella , Maneesh Agrawala , Doug DeCarlo , David Salesin , Michael Cohen, Gaze-based interaction for semi-automatic photo cropping, Proceedings of the SIGCHI conference on Human Factors in computing systems, April 22-27, 2006, Montréal, Québec, Canada
|
|
|
Hao Liu , Xing Xie , Xiaoou Tang , Zhi-Wei Li , Wei-Ying Ma, Effective browsing of web image search results, Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, October 15-16, 2004, New York, NY, USA
|
|
|
|
|
|
|
|
|
Vidya Setlur , Saeko Takagi , Ramesh Raskar , Michael Gleicher , Bruce Gooch, Automatic image retargeting, Proceedings of the 4th international conference on Mobile and ubiquitous multimedia, December 08-10, 2005, Christchurch, New Zealand
|
|
|
|
|
|
Vidya Setlur , Saeko Takagi , Ramesh Raskar , Michael Gleicher , Bruce Gooch, Automatic image retargeting, ACM SIGGRAPH 2004 Sketches, August 08-12, 2004, Los Angeles, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|