| A bayesian reinforcement learning approach for customizing human-robot interfaces |
| Full text |
Pdf
(317 KB)
|
Source
|
International Conference on Intelligent User Interfaces
archive
Proceedings of the 13th international conference on Intelligent user interfaces
table of contents
Sanibel Island, Florida, USA
SESSION: Short papers
table of contents
Pages 355-360
Year of Publication: 2009
ISBN:978-1-60558-168-2
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 17, Downloads (12 Months): 135, Citation Count: 0
|
|
|
ABSTRACT
Personal robots are becoming increasingly prevalent, which raises a number of interesting issues regarding the design and customization of interfaces to such platforms. The particular problem addressed by this paper is the use of learning methods to improve the quality and effectiveness of human-machine interaction onboard a robotic wheelchair. In support of this, we present a method for learning and adapting probabilistic models with the aid of a human operator. We use a Bayesian reinforcement learning framework, that allows us to mix learning and execution, as well as take advantage of prior information about the world. We address the problems of learning, handling a partially observable environment, and limiting the number of action requests. We demonstrate empirical feasibility of our approach on an interface for an autonomous wheelchair.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
A. Atrash, R. Kaplow, J. Villemure, R. West, H. Yamani, and J. Pineau. Towards the deployment of an intelligent wheelchair in a standardized rehabilitation environment. Interaction Studies, In Submission.
|
| |
3
|
|
| |
4
|
R. Dearden, N. Friedman, and D. Andre. Model based Bayesian exploration. In Uncertainty in Artifical Intelligence, pages 150--159, 1999.
|
 |
5
|
|
| |
6
|
R. Jaulmes, J. Pineau, and D. Precup. Active learning in partially observable Markov decision processes. In European Conference on Machine Learning, 2005.
|
| |
7
|
|
| |
8
|
R. L. Kirby, D. J. Dupuis, A. H. MacPhee, A. L. Coolen, C. Smith, K. L. Best, A. M. Newtown, A. D. Mountain, D. A. MacLeod, and J. P. Bonaparte. The wheelchair skills test (version 2.4). In Arch. Phys. Med. Rehabil., volume 85, pages 795--804, 2004.
|
| |
9
|
S. Koenig and R. Simmons. Unsupervised learning of probabilistic models for robot navigation. In International Conference on Robotics and Automation, 1996.
|
| |
10
|
M. Montemerlo, N. Roy, and S. Thrun. Perspectives on standarization in mobile robot programming: The Carnegie Mellon Navigation (CARMEN) Toolkit. In International Conference on Robotics and Systems, pages 2436--2441, 2003.
|
| |
11
|
|
| |
12
|
J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In International Joint Conference on Artificial Intelligence, pages 1025--1032, 2003.
|
| |
13
|
P. Poupart, N. Vlassis, J. Hoey, and K. Regan. An analytic solution to discrete Bayesian reinforcement learning. National Conference on Artificial Intelligence, 1, 2006.
|
| |
14
|
|
 |
15
|
|
| |
16
|
S. Ross, B. Chaib-draa, and J. Pineau. Bayes-adaptive POMDPs. In Neural Information Processing Systems, 2007.
|
| |
17
|
|
| |
18
|
M. Spaan and N. Vlassis. Perseus: Randomized point-based value iteration for POMDPs. In Journal of Artificial Intelligence Research, pages 195--220, 2005.
|
|