|
ABSTRACT
From the Publisher: An introduction to the mathematical theory of multistage decision processes, this text takes a functional equation approach to the discovery of optimum policies. Written by a leading developer of such policies, it presents a series of methods, uniqueness and existence theorems, and examples for solving the relevant equations. The text examines existence and uniqueness theorems, the optimal inventory equation, bottleneck problems in multistage production processes, a new formalism in the calculus of variation, strategies behind multistage games, and Markovian decision processes. Each chapter concludes with a problem set that Eric V. Denardo of Yale University, in his informative new introduction, calls a rich lode of applications and research topics. 1957 edition. 37 figures.
|
CITED BY 444
|
|
|
|
|
|
|
|
Jason Cong , Jie Fang , Kei-Yong Khoo, VIA design rule consideration in multi-layer maze routing algorithms, Proceedings of the 1999 international symposium on Physical design, p.214-220, April 12-14, 1999, Monterey, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Amir H. Salek , Jinan Lou , Massoud Pedram, MERLIN: semi-order-independent hierarchical buffered routing tree generation using local neighborhood search, Proceedings of the 36th ACM/IEEE conference on Design automation, p.472-478, June 21-25, 1999, New Orleans, Louisiana, United States
|
|
|
Mihai Barbuceanu , Mark S. Fox, Integrating communicative action, conversations and decision theory to coordinate agents, Proceedings of the first international conference on Autonomous agents, p.49-58, February 05-08, 1997, Marina del Rey, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
James B. Orlin , Andreas S. Schulz , Sudipta Sengupta, &egr;-optimization schemes and L-bit precision (extended abstract): alternative perspectives in combinatorial optimization, Proceedings of the thirty-second annual ACM symposium on Theory of computing, p.565-572, May 21-23, 2000, Portland, Oregon, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
David H. Wolpert , Sergery Kirshner , Chris J. Merz , Kagan Tumer, Adaptivity in agent-based routing for data networks, Proceedings of the fourth international conference on Autonomous agents, p.396-403, June 03-07, 2000, Barcelona, Spain
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Foster Provost , David Jensen , Tim Oates, Efficient progressive sampling, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.23-32, August 15-18, 1999, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Federico Angiolini , Luca Benini , Alberto Caprara, Polynomial-time algorithm for on-chip scratchpad memory partitioning, Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems, October 30-November 01, 2003, San Jose, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Marilyn A. Walker , Jeanne C. Fromer , Shrikanth Narayanan, Learning optimal dialogue strategies: a case study of a spoken dialogue agent for email, Proceedings of the 17th international conference on Computational linguistics, p.1345-1351, August 10-14, 1998, Montreal, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andrew Chou , Jeremy Cooperstock , Ran El-Yaniv , Michael Klugerman , Tom Leighton, The statistical adversary allows optimal money-making trading strategies, Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms, p.467-476, January 22-24, 1995, San Francisco, California, United States
|
|
|
|
|
|
David J. Ward , Alan F. Blackwell , David J. C. MacKay, Dasher—a data entry interface using continuous gestures and language models, Proceedings of the 13th annual ACM symposium on User interface software and technology, p.129-137, November 06-08, 2000, San Diego, California, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Federico Angiolini , Francesco Menichelli , Alberto Ferrero , Luca Benini , Mauro Olivieri, A post-compiler approach to scratchpad mapping of code, Proceedings of the 2004 international conference on Compilers, architecture, and synthesis for embedded systems, September 22-25, 2004, Washington DC, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kristian Kersting , Martijn Van Otterlo , Luc De Raedt, Bellman goes relational, Proceedings of the twenty-first international conference on Machine learning, p.59, July 04-08, 2004, Banff, Alberta, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
M. Murat Dundar , E. Daniel Hirleman , Arun K. Bhunia , J. Paul Robinson , Bartek Rajwa, Learning with a non-exhaustive training dataset: a case study: detection of bacteria cultures using optical-scattering technology, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, June 28-July 01, 2009, Paris, France
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
A. Bazzani , B. Giorgini , S. Rambaldi , M. Brambilla , L. Cattelani, Walking between free will and determinism, Proceedings of the 2007 summer computer simulation conference, July 16-19, 2007, San Diego, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
François Courtemanche , Mehdi Najjar , Blandine Paccoud , André Mayers, Assisting elders via dynamic multi-tasks planning: a Markov decision processes based approach, Proceedings of the 1st international conference on Ambient media and systems, p.1-8, February 11-14, 2008, Quebec, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Sascha Häckel , Marco Fischer , David Zechel , Tobias Teich, A multi-objective ant colony approach for pareto-optimization using dynamic programming, Proceedings of the 10th annual conference on Genetic and evolutionary computation, July 12-16, 2008, Atlanta, GA, USA
|
|
|
|
|
|
|
|
|
Bilge Mutlu , Andreas Krause , Jodi Forlizzi , Carlos Guestrin , Jessica Hodgins, Robust, low-cost, non-intrusive sensing and recognition of seated postures, Proceedings of the 20th annual ACM symposium on User interface software and technology, October 07-10, 2007, Newport, Rhode Island, USA
|
|
|
|
|
|
|
|
|
|
|
|
R. E. Kalman , R. W. Koepcke, The role of digital computers in the dynamic optimization of chemical reactions, Papers presented at the the March 3-5, 1959, western joint computer conference, p.107-116, March 03-05, 1959, San Francisco, California
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Juan M. Corchado , Javier Bajo , Yanira de Paz , Dante I. Tapia, Intelligent environment for monitoring Alzheimer patients, agent technology for health care, Decision Support Systems, v.44 n.2, p.382-396, January, 2008
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jaeyong Lee , Bonjung Koo , Kyungwhan Oh, State space optimization using plan recognition and reinforcement learning on RTS game, Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases, p.165-169, February 20-22, 2008, Cambridge, UK
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Guy Shani , Christopher Meek , Tim Paek , Bo Thiesson , Gina Danielle Venolia, Searching large indexes on tiny devices: optimizing binary search with character pinning, Proceedings of the 13th international conference on Intelligent user interfaces, February 08-11, 2009, Sanibel Island, Florida, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alex Dekhtyar , Judy Goldsmith , Beth Goldstein , Krol Kevin Mathias , Cynthia Isenhour, Planning for success: The interdisciplinary approach to building Bayesian models, International Journal of Approximate Reasoning, v.50 n.3, p.416-428, March, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ralph L. Kodell , Bruce A. Pearce , Songjoon Baek , Hojin Moon , Hongshik Ahn , John F. Young , James J. Chen, A model-free ensemble method for class prediction with application to biomedical decision making, Arificial Intelligence in Medicine, v.46 n.3, p.267-276, July, 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hideki Asoh , Satoru Hayamizu , Isao Hara , Yoichi Motomura , Shotaro Akaho , Toshihiro Matsui, Socially embedded learning of the office-conversant mobile robot Jijo-2, Proceedings of the Fifteenth international joint conference on Artifical intelligence, p.880-885, August 23-29, 1997, Nagoya, Japan
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Damien Ernst , Mevludin Glavic , Florin Capitanescu , Louis Wehenkel, Reinforcement learning versus model predictive control: a comparison on a power system problem, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, v.39 n.2, p.517-529, April 2009
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Chen Cai , Bernhard Hengst , Getian Ye , Enyang Huang , Yang Wang , Carlos Aydos , Glenn Geers, On the performance of adaptive traffic signal control, Proceedings of the Second International Workshop on Computational Transportation Science, p.37-42, November 03-03, 2009, Seattle, Washington
|
|
|
|
|
|
Craig Boutilier , Ray Reiter , Bob Price, Symbolic dynamic programming for first-order MDPs, Proceedings of the 17th international joint conference on Artificial intelligence, p.690-697, August 04-10, 2001, Seattle, WA, USA
|
|
|
Dmitri Dolgov , Edmund Durfee, Stationary deterministic policies for constrained MDPs with multiple rewards, costs, and discount factors, Proceedings of the 19th international joint conference on Artificial intelligence, p.1326-1331, July 30-August 05, 2005, Edinburgh, Scotland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Peng Dai , Mausam Mausam , Daniel S. Weld, Domain-independent, automatic partitioning for probabilistic planning, Proceedings of the 21st international jont conference on Artifical intelligence, p.1677-1683, July 11-17, 2009, Pasadena, California, USA
|
|
|
Andrey Kolobov , Mausam Mausam , Daniel S. Weld, ReTrASE: integrating paradigms for approximate probabilistic planning, Proceedings of the 21st international jont conference on Artifical intelligence, p.1746-1753, July 11-17, 2009, Pasadena, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Christian Dönitz , Iulian Vasile , Christopher Onder , Lino Guzzella, Dynamic programming for hybrid pneumatic vehicles, Proceedings of the 2009 conference on American Control Conference, p.3956-3963, June 10-12, 2009, St. Louis, Missouri, USA
|
|
|
|
|
|
Lieboud Van Den Broeck , Moritz Diehl , Jan Swevers, Performant design of an input shaping prefilter via embedded optimization, Proceedings of the 2009 conference on American Control Conference, p.166-171, June 10-12, 2009, St. Louis, Missouri, USA
|
|
|
Michael Basin , Peng Shi , Dario Calderon-Alvarez, Central suboptimal H∞control design for nonlinear polynomial systems, Proceedings of the 2009 conference on American Control Conference, p.3101-3105, June 10-12, 2009, St. Louis, Missouri, USA
|
|
|
Xingyong Song , Mohd Azrin Mohd Zulkefli , Zongxuan Sun , Hsu-Chiang Miao, Modeling, analysis, and optimal design of the automotive transmission ball capsule system, Proceedings of the 2009 conference on American Control Conference, p.1379-1384, June 10-12, 2009, St. Louis, Missouri, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Shun-Pin Hsu , Dong-Ming Chuang , Ari Arapostathis, On the adaptive control of a class of partially observed Markov decision processes, Proceedings of the 2009 conference on American Control Conference, p.5635-5640, June 10-12, 2009, St. Louis, Missouri, USA
|
|
|
|
|
|
|
|
|
Lei Yang , Jennie Si , Konstantinos S. Tsakalis , Armando A. Rodriguez, Direct heuristic dynamic programming for nonlinear tracking control with filtered tracking error, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, v.39 n.6, p.1617-1622, December 2009
|
|