|
ABSTRACT
As agents begin to perform complex tasks alongside humans as collaborative teammates, it becomes crucial that the resulting human-multiagent teams adapt to time-critical domains. In such domains, adjustable autonomy has proven useful by allowing for a dynamic transfer of control of decision making between human and agents. However, existing adjustable autonomy algorithms commonly discretize time, which not only results in high algorithm runtimes but also translates into inaccurate transfer of control policies. In addition, existing techniques fail to address decision making inconsistencies often encountered in human multiagent decision making. To address these limitations, we present novel approach for Resolving Inconsistencies in Adjustable Autonomy in Continuous Time (RIAACT) that makes three contributions: First, we apply continuous time planning paradigm to adjustable autonomy, resulting in high-accuracy transfer of control policies. Second, our new adjustable autonomy framework both models and plans for the resolving of inconsistencies between human and agent decisions. Third, we introduce a new model, Interruptible Action Time-dependent Markov Decision Problem (IA-TMDP), which allows for actions to be interrupted at any point in continuous time. We show how to solve IA-TMDPs efficiently and leverage them to plan for the resolving of inconsistencies in RIAACT. Furthermore, these contributions have been realized and evaluated in a complex disaster response simulation system.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
J. Boyan and M. Littman. Exact solutions to time-dependent MDPs. In NIPS, pages 1026--1032, 2000.
|
| |
2
|
CALO http://www.ai.sri.com/project/CALO, http://calo.sri.com. CALO: Cognitive Agent that Learns and Organizes, 2003.
|
| |
3
|
|
 |
4
|
Michael A. Goodrich , Timothy W. McLain , Jeffrey D. Anderson , Jisang Sun , Jacob W. Crandall, Managing autonomy in robot teams: observations from four experiments, Proceedings of the ACM/IEEE international conference on Human-robot interaction, March 10-12, 2007, Arlington, Virginia, USA
[doi> 10.1145/1228716.1228721]
|
| |
5
|
L. Li and M. Littman. Lazy approximation for solving continuous finite-horizon MDPs. In AAAI, pages 1175--1180, 2005.
|
| |
6
|
J. Marecki, S. Koenig, and M. Tambe. A fast analytical algorithm for solving markov decision processes with real-valued resources. In IJCAI, January 2007.
|
| |
7
|
R. Nair and M. Tambe. Hybrid bdi-pomdp framework for multiagent teaming. Journal of Artificial Intelligence Research (JAIR), 23:367--420, 2005.
|
| |
8
|
|
| |
9
|
P. Scerri, D. Pynadath, and M. Tambe. Towards adjustable autonomy for the real world. Journal of Artificial Intelligence Research, 17:171--228, 2002.
|
 |
10
|
|
| |
11
|
B. P. Sellner, F. Heger, L. Hiatt, R. Simmons, and S. Singh. Coordinated multi-agent teams and sliding autonomy for large-scale assembly. Proceedings of the IEEE - Special Issue on Multi-Robot Systems, July 2006.
|
 |
12
|
|
| |
13
|
|
|