ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Real-time bag of words, approximately
Full text PdfPdf (428 KB)
Source Conference On Image And Video Retrieval archive
Proceeding of the ACM International Conference on Image and Video Retrieval table of contents
Santorini, Fira, Greece
SESSION: Oral session: best paper candidates table of contents
Article No.: 6  
Year of Publication: 2009
ISBN:978-1-60558-480-5
Authors
J. R. R. Uijlings  University of Amsterdam, Amsterdam, The Netherlands
A. W. M. Smeulders  University of Amsterdam, Amsterdam, The Netherlands
R. J. H. Scha  University of Amsterdam, Amsterdam, The Netherlands
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 43,   Downloads (12 Months): 83,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1646396.1646405
What is a DOI?

ABSTRACT

We start from the state-of-the-art Bag of Words pipeline that in the 2008 benchmarks of TRECvid and PASCAL yielded the best performance scores. We have contributed to that pipeline, which now forms the basis to compare various fast alternatives for all of its components: (i) For descriptor extraction we propose a fast algorithm to densely sample SIFT and SURF, and we compare several variants of these descriptors. (ii) For descriptor projection we compare a k-means visual vocabulary with a Random Forest. As a preprojection step we experiment with PCA on the descriptors to decrease projection time. (iii) For classification we use Support Vector Machines and compare the x2 kernel with the RBF kernel. Our results lead to a 10-fold speed increase without any loss of accuracy and to a 30-fold speed increase with 17% loss of accuracy, where the latter system does real-time classification at 26 images per second.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
M. Everingham and J. Winn. The Pascal VOC challenge 2007 development kit. Technical report, University of Leeds, 2007.
 
3
 
4
J. M. Geusebroek, A. W. M. Smeulders, and J. van de Weijer. Fast anisotropic gauss filtering. IEEE Transactions on Image Processing, 12: 938--943, 2003.
 
5
M. Grabner, H. Grabner, and H. Bischof. Fast approximated SIFT. In ACCV, 2006.
6
 
7
 
8
 
9
 
10
M. Marszałek, C. Schmid, H. Harzallah, and J. van de Weijer. Learning representations for visual object class recognition. Pascal VOC 2007 challenge workshop. ICCV, 2007.
 
11
 
12
K. Mikolajczyk and J. Matas. Improving descriptors for fast tree matching by optimal linear projection. In ICCV, 2007.
 
13
 
14
 
15
F. Moosmann, B. Triggs, and F. Jurie. Fast discriminative visual codebooks using randomized clustering forests. In NIPS, pages 985--992, 2006.
 
16
 
17
J. Shotton, M. Johnson, and R. Cipolla. Semantic texton forests for image categorization and segmentation. In CVPR, 2008.
18
 
19
M. A. Tahir, K. van de Sande, J. Uijlings, F. Yan, X. Li, K. Mikolajczyk, J. Kittler, T. Gevers, and A. Smeulders. Uva and surrey @ pascal voc 2008. Pascal VOC 2008 challenge workshop. ECCV, 2008.
20
 
21

Collaborative Colleagues:
J. R. R. Uijlings: colleagues
A. W. M. Smeulders: colleagues
R. J. H. Scha: colleagues