ACM Home Page
Please provide us with feedback. Feedback
Positive effects of redundant descriptions in an interactive semantic speech interface
Full text PdfPdf (476 KB)
Source
International Conference on Intelligent User Interfaces archive
Proceedings of the 13th international conference on Intelligent user interfaces table of contents
Sanibel Island, Florida, USA
SESSION: Novel input & output table of contents
Pages 217-226  
Year of Publication: 2009
ISBN:978-1-60558-168-2
Authors
Lane Schwartz  University of Minnesota, Minneapolis, MN, USA
Luan Nguyen  University of Minnesota, Minneapolis, MN, USA
Andrew Exley  University of Minnesota, Minneapolis, MN, USA
William Schuler  University of Minnesota, Minneapolis, MN, USA
Sponsors
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
SIGART: ACM Special Interest Group on Artificial Intelligence
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 10,   Downloads (12 Months): 72,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1502650.1502682
What is a DOI?

ABSTRACT

Spoken language interfaces based on interactive semantic language models allow probabilities for hypothesized words to be conditioned on the semantic interpretation of these words in the context of some interfaced application environment. This conditioning may allow users to avoid recognition errors in an intuitive way, by adding extra, possibly redundant description. This paper evaluates the effect on error reduction of redundant descriptions in an interactive semantic language model. In order to evaluate the effect in natural use, the model is run on rich domains, supporting references to sets of individuals (instead of just individuals themselves) arranged in multiple continuous dimensions (a 2-D floorplan scene). Results of these experiments suggest that an interactive semantic language model allows users to achieve significantly higher recognition accuracy by providing additional redundant spoken description.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
G. Aist, J. Allen, E. Campana, C. Gallo, S. Stoness, M. Swift, and M. Tanenhaus. Incremental understanding in human-computer dialogue and experimental evidence for advantages over nonincremental methods. In Proc. DECALOG, pages 149--154, 2007.
 
2
G. Chung, S. Seneff, C. Wang, and I. Hetherington. A dynamic vocabulary spoken dialogue interface. In Proc. ICSLP, pages 1457--1460, 2004.
 
3
D. DeVault and M. Stone. Domain inference in incremental interpretation. In Proc. ICoS, pages 73--87, 2003.
 
4
N. Haddock. Computational models of incremental semantic interpretation. Language and Cognitive Processes, 4:337--368, 1989.
 
5
6
 
7
J. L. McClelland and D. E. Rumelhart. An interactive activation model of context effects in letter perception: Part 1. an account of basic findings. Psychological Review, 88:375--407, 1981.
 
8
 
9
K. P. Murphy and M. A. Paskin. Linear time inference in hierarchical HMMs. In Proc. NIPS, pages 833--840, 2001.
 
10
T. Regier and L. Carlson. Grounding spatial language in perception: An empirical and computational investigation. Journal of Experimental Psychology: General, 130:273--298, 2001.
 
11
T. Robinson. An application of recurrent nets to phone probability estimation. In IEEE Transactions on Neural Networks, volume 5, pages 298--305, 1994.
 
12
D. Roy and N. Mukherjee. Towards situated speech understanding: visual context priming of language models. Computer Speech & Language, 19(2):227--248, 2005.
 
13
W. Schuler, S. AbdelRahman, T. Miller, and L. Schwartz. Toward a psycholinguistically-motivated model of language. In Proceedings of COLING, Manchester, UK, August 2008.
 
14
W. Schuler, S. Wu, and L. Schwartz. A framework for fast incremental interpretation during speech decoding. Computational Linguistics, in press.
 
15
M. K. Tanenhaus, M. J. Spivey-Knowlton, K. M. Eberhard, and J. E. Sedivy. Integration of visual and linguistic information in spoken language comprehension. Science, 268:1632--1634, 1995.
16

Collaborative Colleagues:
Lane Schwartz: colleagues
Luan Nguyen: colleagues
Andrew Exley: colleagues
William Schuler: colleagues