ACM Home Page
Please provide us with feedback. Feedback
Web image mining towards universal age estimator
Full text PdfPdf (942 KB)
Source
International Multimedia Conference archive
Proceedings of the seventeen ACM international conference on Multimedia table of contents
Beijing, China
SESSION: Content track C2: content analysis applications table of contents
Pages 85-94  
Year of Publication: 2009
ISBN:978-1-60558-608-3
Authors
Bingbing Ni  National University of Singapore, Singapore
Zheng Song  National University of Singapore, Singapore
Shuicheng Yan  National University of Singapore, Singapore
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 33,   Downloads (12 Months): 33,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1631272.1631287
What is a DOI?

ABSTRACT

In this paper, we present an automatic web image mining system towards building a universal human age estimator based on facial information, which is applicable to all ethnic groups and various image qualities. First, a large (<391k) yet noisy human aging image dataset is crawled from the photo sharing website Flickr and Google image search engine based on a set of human age related text queries. Then, within each image, several human face detectors of different implementations are used for robust face detection, and all the detected faces with multiple responses are considered as the multiple instances of a bag (image). An outlier removal step with Principal Component Analysis further refines the image set to about 220k faces, and then a robust multi-instance regressor learning algorithm is proposed to learn the kernel-regression based human age estimator under the scenarios with possibly noisy bags. The proposed system has the following characteristics: 1) no manual human age labeling process is required, and the age information is automatically obtained from the age related queries, 2) the derived human age estimator is universal owing to the diversity and richness of Internet images and thus has good generalization capability, and 3) the age estimator learning process is robust to the noises existing in both Internet images and corresponding age labels. This automatically derived human age estimator is extensively evaluated on three popular benchmark human aging databases, and without taking any images from these benchmark databases as training samples, comparable age estimation accuracies with the state-of-the-art results are achieved.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
The fg-net aging database: http://sting.cycollege.ac.cy/ alanitis/fgnetaging.html.
 
2
S. Andrews, I. Tsochantaridis, and T. Hofmann. Support vector machines for multiple-instance learning. In Neural Information Processing Systems, 2002.
 
3
Y. Chen, J. Bi, and J. Wang. Multiple-instance learning via embedded instance selection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(12):1931'lC1947, 2006.
 
4
T. Cootes, G. Edwards, and C. Taylor. Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(6):681--685, 2001.
 
5
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, 39(1):1--38, 1977.
 
6
Y. Fu and T. Huang. Human age estimation with regression on discriminative aging manifold. IEEE Transactions on Multimedia, 10(4):578--584, 2008.
 
7
X. Geng, Z. Zhou, and K. Smith-Miles. Automatic age estimation based on facial aging patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(12):2234--2240, 2007.
 
8
G. Guo, Y. Fu, C. Dyer, and T. Huang. Image-based human age estimation by manifold learning and locally adjusted robust regression. IEEE Transactions on Image Processing, 17(7):1178--1188, 2008.
 
9
J. Hayashi, M. Yasumoto, H. Ito, and H. Koshimizu. A method for estimating and modeling age and gender using facial image processing. In International Conference on Virtual Systems and Multimedia, pages 439--448, 2001.
 
10
I. Joliffe. Principal component analysis. Springer-Verlag, New York, 1986.
 
11
J. Keeler, D. Rumelhart, and W. Leow. Integrated segmentation and recognition of hand-printed numerals. In Neural Information Processing Systems, pages 557--563, 1990.
 
12
Y. Kwon and N. Lobo. Age classification from facial images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 74(1):1--21, 1999.
 
13
A. Lanitis, C. Draganova, and C. Christodoulou. Comparing different classifiers for automatic age estimation. IEEE Transactions on Systems, Man and Cybernetics, Part B, 34(1):621--628, 2004.
 
14
O. Maron and T. Lozano-Pierez. A framework for multiple-instance learning. In Neural Information Processing Systems, pages 570--576, 1998.
 
15
S. Ray and D. Page. Multiple instance regression. In International Conference on Machine Learning, pages 425--432, 2001.
 
16
K. Ricanek and T. Tesafaye. Morph: A longitudinal image database of normal adult age-progression. In IEEE International Conference on Automatic Face and Gesture Recognition, pages 341--345, March 2006.
 
17
W. Rudin. Principles of Mathematical Analysis, 3nd Edition. McGray-Hill, 1978.
 
18
P. Viola and M. Jones. Robust real-time face detection. In International Conference on Computer Vision, 2001.
 
19
P. Viola, J. Platt, and C. Zhang. Multiple instance boosting for object detection. In Neural Information Processing Systems, 2005.
 
20
J. Wang and J. Zucker. Solving the multiple-instance problem: a lazy learning approach. In International Conference on Machine Learning, pages 1119--1125, 2000.
 
21
S. Yan, H. Wang, X. Tang, J. Liu, and T. Huang. Regression from uncertain labels and its applications to soft-biometrics. IEEE Transactions on Information Forensics and Security, 3(4):698--708.
 
22
S. Yan, X. Zhou, M. Liu, M. Hasegawa-Johnson, and T. S. Huang. Regression from patch-kernel. In IEEE Conference on Computer Vision and Pattern Recognition, pages 1--8, 2008.
 
23
K. Yanai and K. Barnard. Finding visual concept by web image mining. In International World Wide Web Conference, 2006.
 
24
Q. Zhang and S. Goldman. Em-dd: An improved multiple-instance learning technique. In Neural Information Processing Systems, 2001.
 
25
Z. Zhou and M. Zhang. Multi-instance multi-label learning with application to scene classification. In Neural Information Processing Systems, 2007.