|
ABSTRACT
Effective learning from demonstration techniques enable complex robot behaviors to be taught from a small number of demonstrations. A number of recent works have explored interactive approaches to demonstration, in which both the robot and the teacher are able to select training examples. In this paper, we focus on a demonstration selection algorithm used by the robot to identify informative states for demonstration. Existing automated approaches for demonstration selection typically rely on a single threshold value, which is applied to a measure of action confidence. We highlight the limitations of using a single fixed threshold for a specific subset of algorithms, and contribute a method for automatically setting multiple confidence thresholds designed to target domain states with the greatest uncertainty. We present a comparison of our multi-threshold selection method to confidence-based selection using a single fixed threshold, and to manual data selection by a human teacher. Our results indicate that the automated multi-threshold approach significantly reduces the number of demonstrations required to learn the task.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
K. S. Barber, A. Goel, and C. Martin. Dynamic adaptive autonomy in multi-agent systems. Journal of Experimental & Theoretical Artificial Intelligence, 12:129--147, 2000.
|
| |
4
|
D. C. Bentivegna, A. Ude, C. G. Atkeson, and G. Cheng. Learning to act from observation and practice. International Journal of Humanoid Robotics, 1(4), 2004.
|
 |
5
|
|
| |
6
|
T. W. Fong, C. Thorpe, and C. Baur. Collaboration, dialogue, and human-robot interaction. In Proceedings of the 10th International Symposium of Robotics Research, Lorne, Victoria, Australia, London, November 2001. Springer-Verlag.
|
 |
7
|
Michael A. Goodrich , Timothy W. McLain , Jeffrey D. Anderson , Jisang Sun , Jacob W. Crandall, Managing autonomy in robot teams: observations from four experiments, Proceedings of the ACM/IEEE international conference on Human-robot interaction, March 10-12, 2007, Arlington, Virginia, USA
[doi> 10.1145/1228716.1228721]
|
| |
8
|
D. Grollman and O. Jenkins. Dogged learning for robots. In IEEE International Conference on Robotics and Automation, pages 2483--2488, 2007.
|
| |
9
|
D. H. Grollman and O. C. Jenkins. Learning robot soccer from demonstration: Ball grasping. In Robotics: Science and Systems - Robot Manipulation: Sensing and Adapting to the Real World, 2007.
|
| |
10
|
A. Lockerd and C. Breazeal. Tutelage and socially guided robot learning. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004.
|
| |
11
|
M. N. Nicolescu and M. J. Mataric. Learning and interacting in human-robot domains. In IEEE Transaction on Systems, Man and Cybernetics, pages 419--430, 2001.
|
 |
12
|
|
 |
13
|
H. S. Seung , M. Opper , H. Sompolinsky, Query by committee, Proceedings of the fifth annual workshop on Computational learning theory, p.287-294, July 27-29, 1992, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/130385.130417]
|
| |
14
|
L. Xu, A. Krzyzak, and C. Suen. Several methods for combining multiple classifiers and their applications in handwritten character recognition. IEEE Transactions on System, Man and Cybernetics, pages 418--435, 1992.
|
|