ACM Home Page
Please provide us with feedback. Feedback
The WAMI toolkit for developing, deploying, and evaluating web-accessible multimodal interfaces
Full text PdfPdf (2.24 MB)
Source
International Conference on Multimodal Interfaces archive
Proceedings of the 10th international conference on Multimodal interfaces table of contents
Chania, Crete, Greece
SESSION: Multimodal system design and tools (oral session) table of contents
Pages 141-148  
Year of Publication: 2008
ISBN:978-1-60558-198-9
Authors
Alexander Gruenstein  MIT, Cambridge, MA, USA
Ian McGraw  MIT, Cambridge, MA, USA
Ibrahim Badr  MIT, Cambridge, MA, USA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 86,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1452392.1452420
What is a DOI?

ABSTRACT

Many compelling multimodal prototypes have been developed which pair spoken input and output with a graphical user interface, yet it has often proved difficult to make them available to a large audience. This unfortunate reality limits the degree to which authentic user interactions with such systems can be collected and subsequently analyzed. We present the WAMI toolkit, which alleviates this difficulty by providing a framework for developing, deploying, and evaluating Web-Accessible Multimodal Interfaces in which users interact using speech, mouse, pen, and/or touch. The toolkit makes use of modern web-programming techniques, enabling the development of browser-based applications which rival the quality of traditional native interfaces, yet are available on a wide array of Internet-connected devices. We will showcase several sophisticated multimodal applications developed and deployed using the toolkit, which are available via desktop, laptop, and tablet PCs, as well as via several mobile devices. In addition, we will discuss resources provided by the toolkit for collecting, transcribing, and annotating usage data from multimodal user interactions.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Acero, N. Bernstein, R. Chambers, Y. C. Jui, X. Li, J. Odell, P. Nguyen, O. Scholz, and G. Zweig. Live search for mobile: Web services by voice on the cellphone. In Proc. of ICASSP, 2008.
 
2
G. Aist, J. Allen, E. Campana, C. G. Gallo, S. Stoness, M. Swift, and M. K. Tanenhaus. Incremental understanding in human-computer dialogue and experimental evidence for advantages over nonincremental methods. In R. Artstein and L. Vieu, editors, Proc. of the 11th Workshop on the Semantics and Pragmatics of Dialogue, pages 149--154, 2007.
 
3
J. Axelsson, C. Cross, J. Ferrans, G. McCobb, T. V. Raman, and L. Wilson. Mobile X V 1.2. Technical report, 2005. http://www.voicexml.org/specs/multimodal/xv/mobile/12/.
 
4
A. Gruenstein, B.-J. P. Hsu, J. Glass, S. Seneff, L. Hetherington, S. Cyphers, I. Badr, C. Wang, and S. Liu. A multimodal home entertainment interface via a mobile device. In Proc. of the ACL Workshop on Mobile Language Processing, 2008.
 
5
A. Gruenstein, S. Seneff, and C. Wang. Scalable and portable web-based multimodal dialogue interaction with geographical databases. In Proc. of INTERSPEECH, 2006.
 
6
A. Hjalmarsson. Evaluating AdApt, a multi-modal conversational dialogue system using PARADISE. Master's thesis, KTH, Stockhom, Sweden, 2002.
7
 
8
Java speech grammar format. http://java.sun.com/products/java-media/speech/forDevelopers/JSGF/.
 
9
10
 
11
12
 
13
I. McGraw and S. Seneff. Speech-enabled card games for language learners. In Proc. of the 23rd AAAI Conference on Artificial Intelligence, 2008.
 
14
A. Raux, D. Bohus, B. Langner, A. Black, and M. Eskenazi. Doing research on a deployed spoken dialogue system: One year of Let's Go! experience. In Proc. of INTERSPEECH-ICSLP, 2006.
 
15
S. Seneff, E. Hurley, R. Lau, C. Pao, P. Schmid, and V. Zue. Galaxy-II: A reference architecture for conversational system development. In Proc. ICSLP, 1998.
 
16
 
17
Tellme mobile. http://m.tellme.com.
 
18
K. Wang. SALT: A spoken language interface for web-based multimodal dialog systems. In Proc. of the 7th International Conference on Spoken Language Processing, 2002.
 
19
Yahoo onesearch. http://mobile.yahoo.com.
 
20
V. Zue, S. Seneff, J. Glass, J. Polifroni, C. Pao, T. J. Hazen, and L. Hetherington. JUPITER: A telephone-based conversational interface for weather information. IEEE Transactions on Speech and Audio Processing, 8(1), January 2000.


Collaborative Colleagues:
Alexander Gruenstein: colleagues
Ian McGraw: colleagues
Ibrahim Badr: colleagues