Concepedia

Publication | Closed Access

Risk-Sensitive Markov Decision Processes

508

Citations

1

References

1972

Year

Abstract

This paper considers the maximization of certain equivalent reward generated by a Markov decision process with constant risk sensitivity. First, value iteration is used to optimize possibly time-varying processes of finite duration. Then a policy iteration procedure is developed to find the stationary policy with highest certain equivalent gain for the infinite duration case. A simple example demonstrates both procedures.

References

YearCitations

Page 1