ACM Home Page
Please provide us with feedback. Feedback
Semantic video search using natural language queries
Full text PdfPdf (478 KB)
Source
International Multimedia Conference archive
Proceedings of the seventeen ACM international conference on Multimedia table of contents
Beijing, China
SESSION: Short papers session 1: content analysis table of contents
Pages 605-608  
Year of Publication: 2009
ISBN:978-1-60558-608-3
Authors
Asaad Hakeem  ObjectVideo, Reston, VA, USA
Mun Wai Lee  ObjectVideo, Reston, VA, USA
Omar Javed  ObjectVideo, Reston, VA, USA
Niels Haering  ObjectVideo, Reston, VA, USA
Sponsor
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 27,   Downloads (12 Months): 27,   Citation Count: 0
Additional Information:

abstract   references   index terms  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1631272.1631367
What is a DOI?

ABSTRACT

Recent advances in computer vision and artificial intelligence algorithms have allowed automatic extraction of metadata from video. This metadata can be represented by using the RDF/OWL ontology which can encode scene objects and their relationships in an unambiguous and well-formed manner. The encoded data can be queried using SPARQL. However, SPARQL has a steep learning curve and cannot be directly utilized by a general user for video content search. In this paper, we propose a method to bridge this gap by automatically translating user provided natural language query into an ontology-based SPARQL query for semantic video search. The proposed method consists of three major steps. First, semantically labeled training corpus of natural language query sentences is used for learning the Semantic Stochastic Context Free Grammar (SSCFG). Second, given a user provided natural language query sentence, we use the Earley-Stolcke parsing algorithm to determine the maximum likelihood semantic parsing of the query sentence. This parsing infers the semantic meaning for each word in the query sentence from which the SPARQL query is constructed. Third, the SPARQL query is executed to retrieve relevant video segments from the RDF-OWL video content database. The method is evaluated by running natural language queries on surveillance videos from maritime and land-based domains, though the framework itself is general and extensible to search videos from other domains.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
NIST, TRECVID, http://www-nlpir.nist.gov/projects/trecvid/
 
2
R.X. Gao, T.F Wu, N. Sang, and S.C. Zhu "Bayesian Inference for Layer Representation with Mixed Markov Random Field" EMMCVPR, Springer LNCS 4679, Ezhou, China, Aug 2007.
 
3
M. Lee, A. Hakeem, N. Haering., S.C. Zhu. 'SAVE: A Framework for Semantic Annotation of Visual Events', CVPR Workshop, 2008.
 
4
Klein, D, Manning, C.D, "Accurate Unlexicalized Parsing". In ACL 423--430, 2003.
 
5
Fellbaum C., Wordnet: An Electronic Lexical Database, MIT Press.
 
6
D. Moore and I. Essa, "Recognizing Multitasked Activities using Stochastic Context-Free Grammar" CVPR 2001.
 
7
J.C. Earley, An Efficient Context-Free Parsing Algorithm. PhD thesis, Carnegie-Mellon University, 1968.
 
8
 
9
R. Nevatia, J. Hobbs, B. Bolles, "An Ontology for Video Event Representation", WS on Event Detection & Recognition, 2004.
 
10
Androutsopoulos, I., Ritchie, G., Thanisch, P., "Natural Language Interfaces to Databases -- An Introduction". Natural Language Engineering. vol.1(1), pp.29--81, 1995.
 
11
Androutsopoulos, I., Ritchie, G., Thanisch, P., "An Efficient and Portable Natural Language Query Interface for Relational Databases". In: IEAAI&ES, pp.327--330, 1993.
 
12
Popescu, A.M., Etzioni, O., Kautz, H.A., "Towards a Theory of Natural Language Interfaces to Databases". In: IUI, 2003.
 
13
C. Wang, M. Xiong, Q. Zhou, Y. Yu., "PANTO: A Portable Natural Language Interface to Ontologies". In ECSW, 2007.
 
14
Kaufmann, E., Bernstein, A., Zumstein, R., "Querix: A Natural Language Interface to Query Ontologies Based on Clarification Dialogs". In: Intl. Semantic Web Conference, pp.980--981, 2006.
 
15
H Wang, K Zhang, Q Liu, D Tran, Y Yu., "Q2Semantic: A Lightweight Keyword Interface to Semantic Search". ICSW, 2008.
 
16
S.C. Zhu, D.B. Mumford, "Quest for a stochastic grammar of images", In FTCGV, 2006.