ACM Home Page
Please provide us with feedback. Feedback
Multimodal question answering for mobile devices
Full text PdfPdf (575 KB)
Source
International Conference on Intelligent User Interfaces archive
Proceedings of the 13th international conference on Intelligent user interfaces table of contents
Gran Canaria, Spain
SESSION: Short papers table of contents
Pages 405-408  
Year of Publication: 2008
ISBN:978-1-59593-987-6
Authors
Tom Yeh  Massachusetts Institute of Technology, Cambridge, M.A.
Trevor Darrell  Massachusetts Institute of Technology, Cambridge, M.A.
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
AAAI : Association for the Advancement of Artifical Intelligence
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 58,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1378773.1378841
What is a DOI?

ABSTRACT

This paper introduces multimodal question answering, a new interface for community-based question answering services. By offering users an extra modality---photos---in addition to the text modality to formulate queries, multimodal question answering overcomes the limitations of text-only input methods when the users ask questions regarding visually distinctive objects. Such interface is especially useful when users become curious about an interesting object in the environment and want to know about it---simply by taking a photo and asking a question in a situated (from a mobile device) and intuitive (without describing the object in words) manner. We propose a system architecture for multimodal question answering, describe an algorithm for searching the database, and report on the findings of two prototype studies.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
H. Sonobe, S. Takagi, and F. Yoshimoto. Mobile computing system for fish image retrieval. In IWAIT 04, 2004.
 
4
T. Yeh, J. Lee, and T. Darrell. Adaptive vocabulary forests for dynamic indexing and category learning. In ICCV 07, 2007, to appear.
 
5
T. Yeh, K. Tollmar, and T. Darrell. Searching the web with mobile images for location recognition. CVPR 04, 02:76--81, 2004.