ACM Home Page
Please provide us with feedback. Feedback
Modeling annotated data
Full text PdfPdf (1.51 MB)
Source Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval table of contents
Toronto, Canada
SESSION: Multimedia information retrieval table of contents
Pages: 127 - 134  
Year of Publication: 2003
ISBN:1-58113-646-3
Authors
David M. Blei  University of California, Berkeley, Berkeley, CA
Michael I. Jordan  University of California, Berkeley, Berkeley, CA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 27,   Downloads (12 Months): 196,   Citation Count: 64
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/860435.860460
What is a DOI?

ABSTRACT

We consider the problem of modeling annotated data---data with multiple types where the instance of one type (such as a caption) serves as a description of the other type (such as an image). We describe three hierarchical probabilistic mixture models which aim to describe such data, culminating in correspondence latent Dirichlet allocation, a latent variable model that is effective at modeling the joint distribution of both types and the conditional distribution of the annotation given the primary type. We conduct experiments on the Corel database of images and captions, assessing performance in terms of held-out likelihood, automatic annotation, and text-based image retrieval.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
H. Attias. A variational Bayesian framework for graphical models. In Advances in Neural Information Processing Systems 12, 2000.
 
2
 
3
 
4
D. Cohn and T. Hofmann. The missing link-A probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems 13, 2001.
 
5
A. Goodrum. Image information retrieval: An overview of current research. Informing Science, 3(2):63--67, 2000.
6
 
7
8
 
9
C. Morris. Parametric empirical Bayes inference: Theory and applications. Journal of the American Statistical Association, 78(381):47--65, 1983.
 
10
M. Naphade and T. Huang. A probabilistic framework for semantic video indexing, filtering and retrieval. IEEE Transactions on Multimedia, 3(1):141--151, March 2001.
11
 
12
 
13
B. Taskar, E. Segal, and D. Koller. Probabilistic clustering in relational data. In IJCAI-01, pages 870--876, 2001.

CITED BY  64

Collaborative Colleagues:
David M. Blei: colleagues
Michael I. Jordan: colleagues