ACM Home Page
Please provide us with feedback. Feedback
High speed obstacle avoidance using monocular vision and reinforcement learning
Full text PdfPdf (1.49 MB)
Source ACM International Conference Proceeding Series; Vol. 119 archive
Proceedings of the 22nd international conference on Machine learning table of contents
Bonn, Germany
Pages: 593 - 600  
Year of Publication: 2005
ISBN:1-59593-180-5
Authors
Jeff Michels  Stanford University, Stanford, CA
Ashutosh Saxena  Stanford University, Stanford, CA
Andrew Y. Ng  Stanford University, Stanford, CA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 0,   Downloads (12 Months): 31,   Citation Count: 4
Additional Information:

abstract   references   cited by   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1102351.1102426
What is a DOI?

ABSTRACT

We consider the task of driving a remote control car at high speeds through unstructured outdoor environments. We present an approach in which supervised learning is first used to estimate depths from single monocular images. The learning algorithm can be trained either on real camera images labeled with ground-truth distances to the closest obstacles, or on a training set consisting of synthetic graphics images. The resulting algorithm is able to learn monocular vision cues that accurately estimate the relative depths of obstacles in a scene. Reinforcement learning/policy search is then applied within a simulator that renders synthetic scenes. This learns a control policy that selects a steering direction as a function of the vision system's output. We present results evaluating the predictive ability of the algorithm both on held out test data, and in actual autonomous driving experiments.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Blthoff, I., Blthoff, H., & Sinha, P. (1998). Top-down influences on stereoscopic depth-perception. Nature Neuroscience, 1, 254--257.
 
3
 
4
 
5
Gini, G., & Marchi, A. (2002). Indoor robot navigation with single camera vision. Proc. Pattern Recognition in Information Systems, PRIS, Spain.
 
6
Honig, J., Heit, B., & Bremont, J. (1996). Visual depth perception based on optical blur. Proc. of Int'l Conf. on Image Processing (pp. 721--724).
 
7
Huber, P. (1981). Robust statistics. New York: Wiley.
 
8
Jahne, B., & Geissler, P. (1994). Depth from focus with one image. Proc. IEEE Conf. on Computer Vision and Pattern Recognition CVPR (pp. 713--717).
 
9
Kardas, E. (2005). Monocular cues in depth perception. {Online}: http://peace.saumag.edu/faculty/Kardas/Courses/GP Weiten/C4SandP/MonoCues.html.
 
10
 
11
Kim, M., & Uther, W. (2003). Automatic gait optimisation for quadruped robots. Proc. Australasian Conf. on Robotics and Automation (pp. 1--9).
 
12
Klarquist, W., Geisler, W., & Bovik, A. (1995). Maximum-likelihood depth-from-defocus for active vision. Proc. Int'l Conf. on Intelligent Robots and Systems (pp. 374--379).
 
13
Kohl, N., & Stone, P. (2004). Policy gradient reinforcement learning for fast quadrupedal locomotion. Proc. IEEE Int'l Conf. Robotics and Automation.
 
14
Kudo, H., Saito, M., Yamamura, T., & Ohnishi, N. (1999). Measurement of the ability in monocular depth perception during gazing at near visual target-effect of the ocular parallax cue. Proc. IEEE Int'l Conf. Systems, Man & Cybernetics (pp. 34--37).
 
15
LeCun, Y. (2003). Presentation at Navigation, Locomotion and Articulation workshop. Washington DC.
 
16
Loomis, J. M. (2001). Looking down is looking up. Nature News and Views, 414, 155--156.
 
17
Nagai, T., Naruse, T., Ikehara, M., & Kurematsu, A. (2002). Hmm-based surface reconstruction from single images. Proc. IEEE Int'l Conf. on Image Processing (pp. II--561 -- 11--564).
 
18
 
19
Pomerleau, D. (1989). An autonomous land vehicle in a neural network. NIPS 1. Morgan Kaufmann.
 
20
 
21
Shao, M., Simchony, T., & Chellappa, R. (1988). New algorithms from reconstruction of a 3-d depth map from one or more images. Proc. IEEE Conf. on Computer Vision and Pattern Recognition CVPR (pp. 530--535).
 
22
 
23
Wu, B., Ooi, T. L., & He, Z. J. (2004). Perceiving distance accurately by a directional process of integrating ground information. Letters to Nature, 428, 73 77.

Collaborative Colleagues:
Jeff Michels: colleagues
Ashutosh Saxena: colleagues
Andrew Y. Ng: colleagues