ACM Home Page
Please provide us with feedback. Feedback
Representing shape with a spatial pyramid kernel
Full text PdfPdf (3.15 MB)
Source Conference On Image And Video Retrieval archive
Proceedings of the 6th ACM international conference on Image and video retrieval table of contents
Amsterdam, The Netherlands
Pages: 401 - 408  
Year of Publication: 2007
ISBN:978-1-59593-733-9
Authors
Anna Bosch  University of Girona, Girona, Spain
Andrew Zisserman  University of Oxford, Oxford, UK
Xavier Munoz  University of Girona, Girona, Spain
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 34,   Downloads (12 Months): 233,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1282280.1282340
What is a DOI?

ABSTRACT

The objective of this paper is classifying images by the object categories they contain, for example motorbikes or dolphins. There are three areas of novelty. First, we introduce a descriptor that represents local image shape and its spatial layout, together with a spatial pyramid kernel. These are designed so that the shape correspondence between two images can be measured by the distance between their descriptors using the kernel. Second, we generalize the spatial pyramid kernel, and learn its level weighting parameters (on a validation set). This significantly improves classification performance. Third, we show that shape and appearance kernels may be combined (again by learning parameters on a validation set).

Results are reported for classification on Caltech-101 and retrieval on the TRECVID 2006 data sets. For Caltech-101 it is shown that the class specific optimization that we introduce exceeds the state of the art performance by more than 10%.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
A Bissacco, M. H. Yang, and S. Soatto. Detecting humans via their pose. In NIPS, 2006.
 
3
 
4
A. Bosch, A. Zisserman, and X. Muoz. Scene classification via plsa. In Proc. ECCV, 2006.
 
5
C. Chang and C. Lin. LIBSVM: a library for support vector machines, 2001. Software available at urlhttp://www.csie.ntu.edu.tw/cjlin/libsvm.
 
6
G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, 2004.
 
7
 
8
 
9
P. F. Felzenszwalb. Learning models for object recognition. In Proc. CVPR, 2001.
 
10
D. Gavrila and V. Philomin. Real-time object detection for "smart" vehicles. In Proc. ICCV, 1999.
 
11
 
12
D. Haussler. Convolution kernels on discrete structures. Technical Report UCS-CRL-99-10, 1999.
 
13
 
14
15
 
16
 
17
 
18
 
19
 
20
 
21
 
22
 
23
 
24
J. Thureson and S. Carlsson. Appearance based qualitative image description for object class recognition. In Proc. ECCV, 2004.
 
25
 
26
 
27

CITED BY  7

Collaborative Colleagues:
Anna Bosch: colleagues
Andrew Zisserman: colleagues
Xavier Munoz: colleagues