| Stochastic search using the natural gradient |
| Full text |
Pdf
(730 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 382
archive
Proceedings of the 26th Annual International Conference on Machine Learning
table of contents
Montreal, Quebec, Canada
Pages 1161-1168
Year of Publication: 2009
ISBN:978-1-60558-516-1
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 8, Downloads (12 Months): 32, Citation Count: 1
|
|
|
ABSTRACT
To optimize unknown 'fitness' functions, we present Natural Evolution Strategies, a novel algorithm that constitutes a principled alternative to standard stochastic search methods. It maintains a multinormal distribution on the set of solution candidates. The Natural Gradient is used to update the distribution's parameters in the direction of higher expected fitness, by efficiently calculating the inverse of the exact Fisher information matrix whereas previous methods had to use approximations. Other novel aspects of our method include optimal fitness baselines and importance mixing, a procedure adjusting batches with minimal numbers of fitness evaluations. The algorithm yields competitive results on a number of benchmarks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Amari, S., Cichocki, A., & Yang, H. (1995). A new learning algorithm for blind signal separation. Advances in Neural Information Processing Systems (NIPS95), 8, 757--763.
|
| |
3
|
Amari, S., & Douglas, S. C. (1998). Why natural gradient? Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP98), 2, 1213--1216.
|
| |
4
|
|
| |
5
|
Gomez, F., Schmidhuber, J., & Miikkulainen, R. (2006). Efficient non-linear control through neu-roevolution. Proceedings of the 16th European Conference on Machine Learning (ECML06), 4212, 654--662.
|
| |
6
|
|
| |
7
|
Kakade, S. (2001). A natural policy gradient. In Advances in neural information processing systems (NIPS01), 12, 1531--1538.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
|
| |
12
|
Suganthan, P. N., Hansen, N., Liang, J. J., Deb, K., Chen, Y. P., Auger, A., & Tiwari, S. (2005). Problem definitions and evaluation criteria for the cec 2005 special session on real-parameter optimization (Technical Report). Nanyang Technological University, Singapore.
|
| |
13
|
Sun, Y., Wierstra, D., Schaul, T., & Schmidhuber, J. (2009). Efficient natural evolution strategies. To appear in: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO09).
|
| |
14
|
Wieland, A. (1991). Evolving neural network controllers for unstable systems. Proceedings of the International Joint Conference on Neural Networks (IJCNN91), 2, 667--673.
|
| |
15
|
|
| |
16
|
Wierstra, D., Schaul, T., Peters, J., & Schmidhuber, J. (2008b). Natural evolution strategies. Proceedings of the Congress on Evolutionary Computation (CEC08), Hongkong, 3381--3387.
|
CITED BY
|
|
Yi Sun , Daan Wierstra , Tom Schaul , Juergen Schmidhuber, Efficient natural evolution strategies, Proceedings of the 11th Annual conference on Genetic and evolutionary computation, July 08-12, 2009, Montreal, Québec, Canada
|
|