Concepedia

Publication | Closed Access

The variance of discounted Markov decision processes

213

Citations

13

References

1982

Year

Abstract

Formulae are presented for the variance and higher moments of the present value of single-stage rewards in a finite Markov decision process. Similar formulae are exhibited for a semi-Markov decision process. There is a short discussion of the obstacles to using the variance formula in algorithms to maximize the mean minus a multiple of the standard deviation.

References

YearCitations

Page 1