Publication | Closed Access
Discounted MDP’s: Distribution Functions and Exponential Utility Maximization
133
Citations
12
References
1987
Year
Mathematical ProgrammingEconomicsEngineeringStochastic GameManagementMarkov KernelMarkov Decision ProcessesExpected UtilityProbability TheoryPresent ValueSequential Decision MakingUtility-driven ModelExponential Utility MaximizationDecision ScienceDecision TheoryStochastic DynamicMarkov Decision ProcessOperations Research
The present value of the rewards associated with a discrete-time Markov process has a probability distribution which depends on the initial state. The first part of the paper applies fixed point theory to a system of equations for the distribution functions of the present value. The second part of the paper expands the model to a Markov decision process (MDP) and considers the maximization of the expected utility of the present value when the utility function is exponential.
| Year | Citations | |
|---|---|---|
Page 1
Page 1