| EXTENT: fusing context, content, and semantic ontology for photo annotation |
| Full text |
Pdf
(257 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 160
archive
Proceedings of the 2nd international workshop on Computer vision meets databases
table of contents
Baltimore, MD
SESSION: Multimedia modeling and querying
table of contents
Pages: 5 - 11
Year of Publication: 2005
ISBN:1-59593-151-1
|
|
Author
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 5, Downloads (12 Months): 26, Citation Count: 1
|
|
|
ABSTRACT
This architecture paper presents EXTENT, a probabilistic framework that uses influence diagrams to fuse metadata of multiple modalities for photo annotation. EXTENT fuses contextual information (location, time, and camera parameters), photo content (perceptual features), and semantic ontology in a synergistic way. It uses causal strengths to encode causalities between variables, and between variables and semantic labels. Through a landmark-recognition case study, we show that EXTENT can provide high-quality annotation, substantially better than any traditional unimodal methods.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
K. Barnard and D. Forsyth. Learning the semantics of words and pictures. In International Conference on Computer Vision, volume 2, pages 408--415, 2000.
|
| |
2
|
E. Y. Chang. Extent: Combining context, content, and semantic ontology for photo annotation. US Provisional Patent, 2005.
|
| |
3
|
E. Y. Chang, K. Goh, G. Sychay, and G. Wu. Content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Trans. on Circuits and Systems for Video Technology Special Issue on Conceptual and Dynamical Aspects of Multimedia Content Description, 13(1):26--38, 2003.
|
 |
4
|
Marc Davis , Simon King , Nathan Good , Risto Sarvas, From context to content: leveraging context to infer media metadata, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027572]
|
| |
5
|
A. Deshpande, C. Guestrin, S. Madden, and W. Hong. Beyong pixels: Exploiting camera metadata for photo classification. IEEE CVPR, 2004.
|
| |
6
|
|
| |
7
|
|
| |
8
|
N. Friedman and D. Koller. Learning bayesian networks from data (tutorial). NIPS, 2000.
|
| |
9
|
|
 |
10
|
|
| |
11
|
D. Heckerman. A bayesian approach to learning causal networks. Conference on Uncertainty in Artificial Intelligence, pages 107--118, 1995.
|
| |
12
|
D. Heckerman and R. Shachter. Decision-theoretic foundations for causal reasoning. MSR-TR-94-11, 1994.
|
| |
13
|
|
| |
14
|
|
 |
15
|
Mor Naaman , Susumu Harada , QianYing Wang , Hector Garcia-Molina , Andreas Paepcke, Context data in geo-referenced digital photo collections, Proceedings of the 12th annual ACM international conference on Multimedia, October 10-16, 2004, New York, NY, USA
[doi> 10.1145/1027527.1027573]
|
| |
16
|
M. Naaman, A. Paepcke, and H. Garcia-Molina. From where to what: Metadata sharing for digital photographs with geographic coordinates. International Conference on Cooperative Information Systems (CoopIS), 2003.
|
| |
17
|
L. R. Novick and P. W. Cheng. Assessing interactive causal influence. Psychological Review, 111(2):455--485, 2004.
|
| |
18
|
J. B. Tenenbaum and T. L. Griffiths. Generalization, similarity, and bayesian inference. Behavioral and Brain Sciences, 24:629--641, 2001.
|
 |
19
|
|
| |
20
|
J. Williamson. Causality, in Dov Gabbay & F. Guenthner (eds.): Handbook of Philosophical Logic. Kluwer (to appear), 2005.
|
|