ACM Home Page
Please provide us with feedback. Feedback
Exploiting referential context in spoken language interfaces for data-poor domains
Full text PdfPdf (412 KB)
Source
International Conference on Intelligent User Interfaces archive
Proceedings of the 13th international conference on Intelligent user interfaces table of contents
Gran Canaria, Spain
SESSION: Speech table of contents
Pages 285-292  
Year of Publication: 2008
ISBN:978-1-59593-987-6
Authors
Stephen Wu  University of Minnesota, Minneapolis, MN
Lane Schwartz  University of Minnesota, Minneapolis, MN
William Schuler  University of Minnesota, Minneapolis, MN
Sponsors
SIGART: ACM Special Interest Group on Artificial Intelligence
AAAI : Association for the Advancement of Artifical Intelligence
SIGCHI: ACM Special Interest Group on Computer-Human Interaction
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 45,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1378773.1378811
What is a DOI?

ABSTRACT

This paper describes an implementation of a shell-like programming interface that utilizes referential context (that is, information about the current state of an interfaced application) in order to achieve accurate recognition -- even in user-defined domains with no available domain-specific training corpora. The interface incorporates a knowledge of context into its model of syntax, yielding a referential semantic language model. Interestingly, the referential semantic language model exploits context dynamically, unlike other recent systems, by using incremental processing and the limited stack memory of an HMM-like time series model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
G. Aist, J. Allen, E. Campana, C. Gallo, S. Stoness, M. Swift, and M. Tanenhaus. Incremental understanding in human-computer dialogue and experimental evidence for advantages over nonincremental methods. In Proc. DECALOG, pages 149--154, 2007.
 
2
J. Baker. The dragon system: an overivew. IEEE Transactions on Acoustics, Speech and Signal Processing, 23(1):24--29, 1975.
 
3
R. J. Brachman and J. G. Schmolze. An overview of the kl-one knolewdge representation system. Cognitive Science, 9(2):171--216, Apr. 1985.
 
4
G. Bugmann, E. Klein, S. Lauria, and T. Kyriacou. Corpus-based robotics: A route instruction example. In Proceedings of Intelligent Autonomous Systems, pages 96--103, 2004.
 
5
 
6
G. Chung, S. Seneff, C. Wang, and I. Hetherington. A dynamic vocabulary spoken dialogue interface. In Proc. ICSLP, 2004.
 
7
A. Church. A formulation of the simple theory of types. Journal of Symbolic Logic, 5(2):56--68, 1940.
 
8
D. DeVault and M. Stone. Domain inference in incremental interpretation. In Proc. ICoS, 2003.
 
9
W. M. Fisher, G. R. Doddington, and K. M. Goudie-Marshall. The darpa speech recognition research database: Specifications and status. In Proceedings of DARPA Workshop on Speech Recognition, pages 93--99, Feb. 1986.
 
10
W. M. Fisher, V. Zue, J. Bernstein, and D. S. Pallet. An acoustic-phonetic data base. Journal of the Acoustical Society of America, 81:S92--S93, 1987.
 
11
P. Gorniak and D. Roy. Grounded semantic composition for visual scenes. Journal of Artificial Intelligence Research, 21:429--470, 2004.
 
12
 
13
F. Jelinek, L. R. Bahl, and R. L. Mercer. Design of a linguistic statistical decoder for the recognition of continuous speech. IEEE Transactions on Information Theory, 21:250--256, 1975.
14
 
15
K. P. Murphy and M. A. Paskin. Linear time inference in hierarchical HMMs. In Proc. NIPS, pages 833--840, 2001.
 
16
T. Robinson. An application of recurrent nets to phone probability estimation. In IEEE Transactions on Neural Networks, 1994.
 
17
D. Roy and N. Mukherjee. Towards situated speech understanding: visual context priming of language models. Computer Speech & Language, 19(2):227--248, 2005.
18
 
19
 
20
A. Tarski. The concept of truth in the languages of the deductive sciences (polish). Prace Towarzystwa Naukowego Warszawskiego, Wydzial III Nauk Matematyczno-Fizycznych, 34, 1933. translated as 'The concept of truth in formalized languages', in: J. Corcoran (Ed.), Logic, Semantics, Metamathematics: papers from 1923 to 1938, Hackett Publishing Company, Indianapolis, IN, 1983, pp. 152--278.
 
21


Collaborative Colleagues:
Stephen Wu: colleagues
Lane Schwartz: colleagues
William Schuler: colleagues