ACM Home Page
Please provide us with feedback. Feedback
Digital Library logoTake a look at the new version of this page: [ beta version ]. Tell us what you think.
Multimedia ontology learning for automatic annotation and video browsing
Full text PdfPdf (976 KB)
Source
International Multimedia Conference archive
Proceeding of the 1st ACM international conference on Multimedia information retrieval table of contents
Vancouver, British Columbia, Canada
SESSION: Multimedia browsing and summarization table of contents
Pages: 387-394  
Year of Publication: 2008
ISBN:978-1-60558-312-9
Authors
Anupama Mallik  Indian Institute of Technology, Delhi, New Delhi, India
Poornachander Pasumarthi  Indian Institute of Technology, Delhi, New Delhi, India
Santanu Chaudhury  Indian Institute of Technology, Delhi, New Delhi, India
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 167,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1460096.1460159
What is a DOI?

ABSTRACT

In this work, we offer an approach to combine standard multimedia analysis techniques with knowledge drawn from conceptual metadata provided by domain experts of a specialized scholarly domain, to learn a domain-specific multimedia ontology from a set of annotated examples. A standard Bayesian network learning algorithm that learns structure and parameters of a Bayesian network is extended to include media observables in the learning. An expert group provides domain knowledge to construct a basic ontology of the domain as well as to annotate a set of training videos. These annotations help derive the associations between high-level semantic concepts of the domain and low-level MPEG-7 based features representing audio-visual content of the videos. We construct a more robust and refined version of this ontology by learning from this set of conceptually annotated videos. To encode this knowledge, we use MOWL, a multimedia extension of Web Ontology Language (OWL) which is capable of describing domain concepts in terms of their media properties and of capturing the inherent uncertainties involved. We use the ontology specified knowledge for recognizing concepts relevant to a video to annotate fresh addition to the video database with relevant concepts in the ontology. These conceptual annotations are used to create hyperlinks in the video collection, to provide an effective video browsing interface to the user.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Invis multimedia webpage. http://www.invisplaza.com/search/index.php?mode=browse&format_=2.
 
2
 
3
N. Friedman and M. Goldszmidt. Learning bayesian networks with local structure. In Proceedings of the Twelth Conference on Uncertainty in Artificial Intelligence pages 252--262, August 1996.
 
4
N. Friedman and Yakhin. On the sample complexity of learning bayesian networks. In Proceedings of the Twelth Conference on Uncertainty in Artificial Intelligence pages 274--282, August 1996.
 
5
H. Ghosh, S. Chaudhury, K. Kashyap, and B. Maiti. Ontology Specification and Integration for Multimedia Applications chapter 9. Springer, 2006.
 
6
7
 
8
T. Karthik, S. Chaudhury, and H. Ghosh. Specifying spatio-temporal relations in multimedia ontologies. In Proceedings of International Conference of Pattern Recognition and Machine Intelligence (PReMI'05) 2005.
 
9
 
10
M. Naphade, T. Kristjansson, B. Frey, and T. Huang. Probabilistic multimedia objects (multijects):a novel approach to video indexing and retrieval in multimedia systems. In International Conference on Image Analysis and Processing pages 536--540, October 1998.
 
11
 
12
P. Quelhas and J.-M. Odobez. Natural scene image modeling using color and texture visterms. In Proceedings International Conference on Image and Video Retrieval (CIVR)July 2006.
13
 
14
 
15
 
16


Collaborative Colleagues:
Anupama Mallik: colleagues
Poornachander Pasumarthi: colleagues
Santanu Chaudhury: colleagues