| - | ||||
|
|
Risk-Averse Dynamic Programming for Markov Decision Processes
Andrzej Ruszczynski (rusz Abstract: We introduce the concept of a Markov risk measure and we use it to formulate risk-averse control problems for two Markov decision models: a finite horizon model and a discounted infinite horizon model. For both models we derive risk-averse dynamic programming equations and a value iteration method. For the infinite horizon problem we also develop a risk-averse policy iteration method and we prove its convergence. Finally, we propose a version of the Newton method to solve a nonsmooth equation arising in the policy iteration method and we prove its global convergence. Keywords: Dynamic Risk Measures, Markov Risk Measures, Value Iteration, Policy Iteration, Nonsmooth Newton's Method Category 1: Stochastic Programming Category 2: Other Topics (Dynamic Programming ) Citation: Presented at the 20th International Symposium on Mathematical Programming, Chicago, August 23 - 28 2009; to appear in Mathematical Programming, Series B, 2010. Download: [PDF] Entry Submitted: 12/19/2009 Modify/Update this entry | ||
| Visitors | Authors | More about us | Links | |
|
Subscribe, Unsubscribe Digest Archive Search, Browse the Repository
|
Submit Update Policies |
Coordinator's Board Classification Scheme Credits Give us feedback |
Optimization Journals, Sites, Societies | |
|
||||