ACM Home Page
Please provide us with feedback. Feedback
Robot trajectory optimization using approximate inference
Full text PdfPdf (911 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 1049-1056  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Author
Marc Toussaint  TU Berlin, Berlin, Germany
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 6,   Downloads (12 Months): 22,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553508
What is a DOI?

ABSTRACT

The general stochastic optimal control (SOC) problem in robotics scenarios is often too complex to be solved exactly and in near real time. A classical approximate solution is to first compute an optimal (deterministic) trajectory and then solve a local linear-quadratic-gaussian (LQG) perturbation model to handle the system stochasticity. We present a new algorithm for this approach which improves upon previous algorithms like iLQG. We consider a probabilistic model for which the maximum likelihood (ML) trajectory coincides with the optimal trajectory and which, in the LQG case, reproduces the classical SOC solution. The algorithm then utilizes approximate inference methods (similar to expectation propagation) that efficiently generalize to non-LQG systems. We demonstrate the algorithm on a simulated 39-DoF humanoid robot.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Attias, H. (2003). Planning by probabilistic inference. Proc. of the 9th Int. Workshop on Artificial Intelligence and Statistics.
 
2
Bryson, A. E., & Ho, Y.-C. (1969). Applied optimal control. Blaisdell Publishing Company.
 
3
Chen, Y.-C. (1991). Solving robot trajectory planning problems with uniform cubic b-splines. Optimal Control Applications and Methods, 12, 247--262.
 
4
Cooper, G. (1988). A method for using belief networks as influence diagrams. Proc. of the Fourth Workshop on Uncertainty in Artificial Intelligence (pp. 55--63).
 
5
 
6
Kappen, B., Gomez, V., & Opper, M. (2009). Optimal control as a graphical model inference problem.
 
7
 
8
 
9
 
10
Schlemmer, M., & Gruebel, G. (1998). Real-time collision- free trajectory optimization of robot manipulators via semi-infinite parameter optimization. Int. Journal of Robotics Research, 17, 1013--1021.
 
11
 
12
 
13
Stengel, R. F. (1986). Stochastic optimal control. Wiley.
 
14
Todorov, E. (2008). General duality between optimal control and estimation. In proceedings of the 47th ieee conf. on decision and control.
 
15
Todorov, E., & Li, W. (2005). A generalized iterative lqg method for locally-optimal feedback control of constrained nonlinear stochastic systems. In Proc. of the american control conference, 300--306.
 
16
Toussaint, M., & Goerick, C. (2007). Probabilistic inference for structured planning in robotics. Int Conf on Intelligent Robots and Systems (IROS 2007) (pp. 3068--3073).
 
17
Toussaint, M., Harmeling, S., & Storkey, A. (2006). Probabilistic inference for solving (PO)MDPs (Technical Report EDI-INF-RR-0934). University of Edinburgh, School of Informatics.
 
18
Yedidia, J., Freeman, W., & Weiss, Y. (2001). Understanding belief propagation and its generalizations.
 
19
Zhang, J., & Knoll, A. (1995). An enhanced optimization approach for generating smooth robot trajectories in the presence of obstacles. Proc. of the 1995 European Chinese Automation Conference (pp. 263--268). London.