| Learning non-redundant codebooks for classifying complex objects |
| Full text |
Pdf
(1.34 MB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 382
archive
Proceedings of the 26th Annual International Conference on Machine Learning
table of contents
Montreal, Quebec, Canada
Pages 1241-1248
Year of Publication: 2009
ISBN:978-1-60558-516-1
|
|
Authors
|
|
Wei Zhang
|
Hewlett-Packard Laboratories, Palo Alto, California, United States
|
|
Akshat Surve
|
Oregon State University, Corvallis, Oregon, United States
|
|
Xiaoli Fern
|
Oregon State University, Corvallis, Oregon, United States
|
|
Thomas Dietterich
|
Oregon State University, Corvallis, Oregon, United States
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 63, Citation Count: 0
|
|
|
ABSTRACT
Codebook-based representations are widely employed in the classification of complex objects such as images and documents. Most previous codebook-based methods construct a single codebook via clustering that maps a bag of low-level features into a fixed-length histogram that describes the distribution of these features. This paper describes a simple yet effective framework for learning multiple non-redundant codebooks that produces surprisingly good results. In this framework, each codebook is learned in sequence to extract discriminative information that was not captured by preceding codebooks and their corresponding classifiers. We apply this framework to two application domains: visual object categorization and document classification. Experiments on large classification tasks show substantial improvements in performance compared to a single codebook or codebooks learned in a bagging style.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
|
| |
3
|
|
| |
4
|
Chechik, G. & Tishby, N. (2002). Extracting relevant structures with side information. Proc. Advances in Neural Information Processing Systems, pp 857--864.
|
| |
5
|
Csurka, G., Dance, C. R., Fan L., Willamowski, J., & Bray, C. (2004). Visual categorization with bags of keypoints. Euro. Conf. Comput. Vision Workshop, pp 59--74.
|
| |
6
|
|
| |
7
|
Deng, H., Zhang W., Mortensen, E., Dietterich, T. & Shapiro, L. (2007). Principal curvature-based region detector for object recognition. Proc. IEEE Conf. Comput. Vision Pattern Recognition, pp 1--8.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
Dorko, G. & Schmid, C. (2005). Object class recognition using discriminative local features. Technical Report RR-5497, INRIA-Rhone-Alpes.
|
| |
12
|
Freund, Y. & Schapire, R. (1996). Experiments with a new boosting algorithm. Proc. Int'l Conf. Machine Learning, pp 148--156.
|
| |
13
|
|
| |
14
|
|
| |
15
|
Kadir, T., Zisserman A., & Brady, M. (2004). An affine invariant salient region detector. Proc. Euro. Conf. Comput. Vision, pp 228--241.
|
| |
16
|
Kalal, Z., Matas, J., & Mikolajczyk K. (2008). Weighted sampling for large-scale boosting. Proc. Brit. Machine Vision Conf.
|
| |
17
|
Natalia Larios , Hongli Deng , Wei Zhang , Matt Sarpola , Jenny Yuen , Robert Paasch , Andrew Moldenke , David A. Lytle , Salvador Ruiz Correa , Eric N. Mortensen , Linda G. Shapiro , Thomas G. Dietterich, Automated insect identification through concatenated histograms of local appearance features: feature vector generation and region detection for deformable objects, Machine Vision and Applications, v.19 n.2, p.105-123, January 2008
[doi> 10.1007/s00138-007-0086-y]
|
| |
18
|
|
| |
19
|
McCallum, A. K. (2002). MALLET: A machine learning for language toolkit. http://mallet.cs.umass.edu.
|
| |
20
|
|
| |
21
|
K. Mikolajczyk , T. Tuytelaars , C. Schmid , A. Zisserman , J. Matas , F. Schaffalitzky , T. Kadir , L. Van Gool, A Comparison of Affine Region Detectors, International Journal of Computer Vision, v.65 n.1-2, p.43-72, November 2005
[doi> 10.1007/s11263-005-3848-x]
|
| |
22
|
Moosmann, F., Triggs, B. & Jurie, F. (2007). Fast discriminative visual codebooks using randomized clustering forests. Proc. Advances in Neural Information Processing Systems, pp 985--992.
|
| |
23
|
|
| |
24
|
Perronnin, F., Dance, C., Csurka, G. & Bressan, M. (2006). Adapted vocabularies for generic visual categorization. Proc. Euro. Conf. Comput. Vision, pp 464--475.
|
| |
25
|
|
| |
26
|
|
| |
27
|
Slonim, N. & Tishby, N. (2001). The power of word clusters for text classification. In Proc. Euro. Colloq. Information Retrieval Research.
|
 |
28
|
|
| |
29
|
Viola, P. & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. Proc. IEEE Conf. Comput. Vision Pattern Recognition, pp 511--518.
|
| |
30
|
|
| |
31
|
Yang, L., Jin, R., Sukthankar R., & Jurie, F. (2008). Unifying discriminative visual codebook generation with classifier training for object category recognition. Proc. IEEE Conf. Comput. Vision Pattern Recognition, pp 1--8.
|
|