| Representing shape with a spatial pyramid kernel |
| Full text |
Pdf
(3.15 MB)
|
| Source
|
Conference On Image And Video Retrieval
archive
Proceedings of the 6th ACM international conference on Image and video retrieval
table of contents
Amsterdam, The Netherlands
Pages: 401 - 408
Year of Publication: 2007
ISBN:978-1-59593-733-9
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 34, Downloads (12 Months): 233, Citation Count: 6
|
|
|
ABSTRACT
The objective of this paper is classifying images by the object categories they contain, for example motorbikes or dolphins. There are three areas of novelty. First, we introduce a descriptor that represents local image shape and its spatial layout, together with a spatial pyramid kernel. These are designed so that the shape correspondence between two images can be measured by the distance between their descriptors using the kernel. Second, we generalize the spatial pyramid kernel, and learn its level weighting parameters (on a validation set). This significantly improves classification performance. Third, we show that shape and appearance kernels may be combined (again by learning parameters on a validation set). Results are reported for classification on Caltech-101 and retrieval on the TRECVID 2006 data sets. For Caltech-101 it is shown that the class specific optimization that we introduce exceeds the state of the art performance by more than 10%.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Francis R. Bach , Gert R. G. Lanckriet , Michael I. Jordan, Multiple kernel learning, conic duality, and the SMO algorithm, Proceedings of the twenty-first international conference on Machine learning, p.6, July 04-08, 2004, Banff, Alberta, Canada
[doi> 10.1145/1015330.1015424]
|
| |
2
|
A Bissacco, M. H. Yang, and S. Soatto. Detecting humans via their pose. In NIPS, 2006.
|
| |
3
|
|
| |
4
|
A. Bosch, A. Zisserman, and X. Muoz. Scene classification via plsa. In Proc. ECCV, 2006.
|
| |
5
|
C. Chang and C. Lin. LIBSVM: a library for support vector machines, 2001. Software available at urlhttp://www.csie.ntu.edu.tw/cjlin/libsvm.
|
| |
6
|
G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, 2004.
|
| |
7
|
|
| |
8
|
|
| |
9
|
P. F. Felzenszwalb. Learning models for object recognition. In Proc. CVPR, 2001.
|
| |
10
|
D. Gavrila and V. Philomin. Real-time object detection for "smart" vehicles. In Proc. ICCV, 1999.
|
| |
11
|
|
| |
12
|
D. Haussler. Convolution kernels on discrete structures. Technical Report UCS-CRL-99-10, 1999.
|
| |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
|
| |
22
|
|
| |
23
|
|
| |
24
|
J. Thureson and S. Carlsson. Appearance based qualitative image description for object class recognition. In Proc. ECCV, 2004.
|
| |
25
|
|
| |
26
|
|
| |
27
|
|
CITED BY 7
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Nils Plath , Marc Toussaint , Shinichi Nakajima, Multi-class image segmentation using conditional random fields and global classification, Proceedings of the 26th Annual International Conference on Machine Learning, p.817-824, June 14-18, 2009, Montreal, Quebec, Canada
|
|
|
Justin Talbot , Bongshin Lee , Ashish Kapoor , Desney S. Tan, EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers, Proceedings of the 27th international conference on Human factors in computing systems, April 04-09, 2009, Boston, MA, USA
|
|