| Unsupervised modeling and recognition of object categories with combination of visual contents and geometric similarity links |
| Full text |
Pdf
(1.08 MB)
|
Source
|
International Multimedia Conference
archive
Proceeding of the 1st ACM international conference on Multimedia information retrieval
table of contents
Vancouver, British Columbia, Canada
SESSION: 3D Object retrieval
table of contents
Pages 419-426
Year of Publication: 2008
ISBN:978-1-60558-312-9
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 23, Downloads (12 Months): 123, Citation Count: 0
|
|
|
ABSTRACT
This paper proposes a probabilistic approach for unsupervised modeling and recognition of object categories which combines two types of complementary visual evidence, visual contents and inter-connected links between the images. By doing so, our approach not only increases modeling and recognition performance but also provides possible solutions to several problems including modeling of geometric information, computational complexity, and the inherent ambiguity of visual words. Our approach can be incorporated in any generative models, but here we consider two popular models, pLSA and LDA. Experimental results show that the topic models updated by adding link analysis terms significantly improve the standard pLSA and LDA models. Furthermore, we presented competitive performances on unsupervised modeling, ranking of training images, classification of unseen images, and localization tasks with MSRC and PASCAL2005 datasets.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
| |
3
|
L. Cao and L. Fei-Fei. Spatial coherent latent topic model for concurrent object segmentation and classification. In ICCV, 2007.
|
| |
4
|
D. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In NIPS, 2001.
|
| |
5
|
E. Erosheva, S. Fienberg, and J. Lafferty. Mixed-membership models of scientific publications. PNAS, 101(1):220--5227, 2004.
|
| |
6
|
L. Fei-Fei. Bag of words models: Recognizing and learning object categories. In CVPR Short Courses, 2007.
|
| |
7
|
L. Fei-Fei and P. Perona. A bayesian hierarchical model for learning natural scene categories. In CVPR, 2005.
|
| |
8
|
A. Frome, Y. Singer, F. Sha, and J. Malik. Learning globally-consistent local distance functions for shape-based image retrieval and classification. In ICCV, 2007.
|
| |
9
|
|
| |
10
|
|
| |
11
|
T. Hofmann. Probabilistic latent semantic analysis. In NIPS, 1999.
|
| |
12
|
|
| |
13
|
G. Kim, C. Faloutsos, and M. Hebert. Unsupervised modeling of object categories using link analysis techniques. In CVPR, 2008.
|
| |
14
|
|
| |
15
|
|
 |
16
|
|
| |
17
|
D. Liu and T. Chen. Unsupervised image categorization and object localization using topic models and correspondences between images. In ICCV, 2007.
|
 |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
R. Nallapati and W. Cohen. Link-plsa-lda: A new unsupervised model for topics and influence in blogs. In ICWSM, 2008.
|
| |
22
|
J. C. Niebles and L. Fei-Fei. A hierarchical model of shape and appearance for human action classification. In CVPR, 2007.
|
| |
23
|
M. Richardson and P. Domingos. The intelligent surfer: Probabilistic combination of link and content information in pagerank. In NIPS, 2002.
|
| |
24
|
|
| |
25
|
|
| |
26
|
Erik B. Sudderth , Antonio Torralba , William T. Freeman , Alan S. Willsky, Learning Hierarchical Models of Scenes, Objects, and Parts, Proceedings of the Tenth IEEE International Conference on Computer Vision, p.1331-1338, October 17-20, 2005
[doi> 10.1109/ICCV.2005.137]
|
| |
27
|
|
| |
28
|
X. Wang and E. Grimson. Spatial latent dirichlet allocation. In NIPS, 2007.
|
|