ACM Home Page
Please provide us with feedback. Feedback
Stochastic search using the natural gradient
Full text PdfPdf (730 KB)
Source ACM International Conference Proceeding Series; Vol. 382 archive
Proceedings of the 26th Annual International Conference on Machine Learning table of contents
Montreal, Quebec, Canada
Pages 1161-1168  
Year of Publication: 2009
ISBN:978-1-60558-516-1
Authors
Sun Yi  IDSIA, Manno, Switzerland
Daan Wierstra  IDSIA, Manno, Switzerland
Tom Schaul  IDSIA, Manno, Switzerland
Jürgen Schmidhuber  IDSIA, Manno, Switzerland
Sponsors
: MITACS
: NSF
Microsoft Research : Microsoft Research
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 8,   Downloads (12 Months): 32,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1553374.1553522
What is a DOI?

ABSTRACT

To optimize unknown 'fitness' functions, we present Natural Evolution Strategies, a novel algorithm that constitutes a principled alternative to standard stochastic search methods. It maintains a multinormal distribution on the set of solution candidates. The Natural Gradient is used to update the distribution's parameters in the direction of higher expected fitness, by efficiently calculating the inverse of the exact Fisher information matrix whereas previous methods had to use approximations. Other novel aspects of our method include optimal fitness baselines and importance mixing, a procedure adjusting batches with minimal numbers of fitness evaluations. The algorithm yields competitive results on a number of benchmarks.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Amari, S., Cichocki, A., & Yang, H. (1995). A new learning algorithm for blind signal separation. Advances in Neural Information Processing Systems (NIPS95), 8, 757--763.
 
3
Amari, S., & Douglas, S. C. (1998). Why natural gradient? Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP98), 2, 1213--1216.
 
4
 
5
Gomez, F., Schmidhuber, J., & Miikkulainen, R. (2006). Efficient non-linear control through neu-roevolution. Proceedings of the 16th European Conference on Machine Learning (ECML06), 4212, 654--662.
 
6
 
7
Kakade, S. (2001). A natural policy gradient. In Advances in neural information processing systems (NIPS01), 12, 1531--1538.
 
8
 
9
 
10
 
11
 
12
Suganthan, P. N., Hansen, N., Liang, J. J., Deb, K., Chen, Y. P., Auger, A., & Tiwari, S. (2005). Problem definitions and evaluation criteria for the cec 2005 special session on real-parameter optimization (Technical Report). Nanyang Technological University, Singapore.
 
13
Sun, Y., Wierstra, D., Schaul, T., & Schmidhuber, J. (2009). Efficient natural evolution strategies. To appear in: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO09).
 
14
Wieland, A. (1991). Evolving neural network controllers for unstable systems. Proceedings of the International Joint Conference on Neural Networks (IJCNN91), 2, 667--673.
 
15
 
16
Wierstra, D., Schaul, T., Peters, J., & Schmidhuber, J. (2008b). Natural evolution strategies. Proceedings of the Congress on Evolutionary Computation (CEC08), Hongkong, 3381--3387.


Collaborative Colleagues:
Sun Yi: colleagues
Daan Wierstra: colleagues
Tom Schaul: colleagues
Jürgen Schmidhuber: colleagues