|
ABSTRACT
Spoken dialogue managers have benefited from using stochastic planners such as Markov Decision Processes (MDPs). However, so far, MDPs do not handle well noisy and ambiguous speech utterances. We use a Partially Observable Markov Decision Process (POMDP)-style approach to generate dialogue strategies by inverting the notion of dialogue state; the state represents the user's intentions, rather than the system state. We demonstrate that under the same noisy conditions, a POMDP dialogue manager makes fewer mistakes than an MDP dialogue manager. Furthermore, as the quality of speech recognition degrades, the POMDP dialogue manager automatically adjusts the policy.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Harald Aust and Hermann Ney. 1998. Evaluating dialog systems used in the real world. In Proc. IEEE ICASSP, volume 2, pages 1053--1056.
|
| |
2
|
A. Black, P. Taylor, and R. Caley, 1999. The Festival Speech Synthesis System, 1.4 edition.
|
| |
3
|
Anthony Cassandra, Michael L. Littman, and Nevin L. Zhang. 1997. Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes. In Proc. 13th Ann. Conf. on Uncertainty in Artificial Intelligence (UAI-97), pages 54--61, San Francisco, CA.
|
| |
4
|
|
| |
5
|
Esther Levin, Roberto Pieraccini, and Wieland Eckert. 1998. Using Markov decision process for learning dialogue strategies. In Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP).
|
| |
6
|
George E. Monahan. 1982. A survey of partially observable Markov decision processes. Management Science, 28(1):1--16.
|
| |
7
|
Yasuhisa Niimi and Yutaka Kobayashi. 1996. Dialog control strategy based on the reliability of speech recognition. In Proc. International Conference on Spoken Language Processing (ICSLP).
|
| |
8
|
Ronald Parr and Stuart Russell. 1995. Approximating optimal policies for partially observable stochastic domains. In Proceedings of the 14th International Joint Conferences on Artificial Intelligence.
|
| |
9
|
M. Ravishankar. 1996. Efficient Algorithms for Speech Recognition. Ph.D. thesis, Carnegie Mellon.
|
| |
10
|
Nicholas Roy and Sebastian Thrun. 1999. Coastal navigation with mobile robots. In Advances in Neural Processing Systems, volume 12.
|
| |
11
|
Satinder Singh, Michael Kearns, Diane Litman, and Marilyn Walker. 1999. Reinforcement learning for spoken dialog systems. In Advances in Neural Processing Systems, volume 12.
|
| |
12
|
E. Sondik. 1971. The Optimal Control of Partially Observable Markov Decision Processes. Ph.D. thesis, Stanford University, Stanford, California.
|
| |
13
|
Sebastian Thrun. 1999. Monte carlo pomdps. In S. A. Solla, T. K. Leen, and K. R. Müller, editors, Advances in Neural Processing Systems, volume 12.
|
| |
14
|
Marilyn A. Walker , Jeanne C. Fromer , Shrikanth Narayanan, Learning optimal dialogue strategies: a case study of a spoken dialogue agent for email, Proceedings of the 17th international conference on Computational linguistics, p.1345-1351, August 10-14, 1998, Montreal, Quebec, Canada
|
| |
15
|
|
CITED BY 28
|
|
|
|
|
Matthew Rudary , Satinder Singh , Martha E. Pollack, Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning, Proceedings of the twenty-first international conference on Machine learning, p.91, July 04-08, 2004, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Maria Wolters , Kallirroi Georgila , Johanna D. Moore , Robert H. Logie , Sarah E. MacPherson , Matthew Watson, Reducing working memory load in spoken dialogue systems, Interacting with Computers, v.21 n.4, p.276-287, August, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Steve Young , Milica Gašić , Simon Keizer , François Mairesse , Jost Schatzmann , Blaise Thomson , Kai Yu, The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management, Computer Speech and Language, v.24 n.2, p.150-174, April, 2010
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Beth Ann Hockey , Manny Rayner, Using paraphrases of deep semantic representions to support regression testing in spoken dialogue systems, Proceedings of the Workshop on Software Engineering, Testing, and Quality Assurance for Natural Language Processing, p.14-21, June 05-05, 2009, Boulder, Colorado
|
|
|
M. Gašić , S. Keizer , F. Mairesse , J. Schatzmann , B. Thomson , K. Yu , S. Young, Training and evaluation of the HIS POMDP dialogue system in noise, Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue, June 19-20, 2008, Columbus, Ohio
|
|