Sensitivity-analysis in discounted Markovian decision problems

Hordijk, A.; Dekker, R.; Kallenberg, L. C. M.

doi:10.1007/BF01721353

Sensitivity-analysis in discounted Markovian decision problems

Theoretical Papers
Published: September 1985

Volume 7, pages 143–151, (1985)
Cite this article

Operations-Research-Spektrum Aims and scope Submit manuscript

A. Hordijk¹,
R. Dekker¹^nAff2 &
L. C. M. Kallenberg¹

70 Accesses
18 Citations
Explore all metrics

Summary

This paper deals with a finite-state, finiteaction discrete-time Markov decision model. A linear programming procedure is developed for the computation of optimal policies over the entire range of the discount factor. Furthermore, a procedure is presented for the computation of a Blackwell optimal policy.

Zusammenfassung

Diese Arbeit befaßt sich mit diskreten Markoffschen Entscheidungsmodellen mit endlichen Zustands- und Aktionsräumen. Ein lineares Programm wird entwickelt für die Berechnung von optimalen Politiken über den ganzen Bereich des Diskontierungsfaktors. Anschließend wird ein Verfahren angegeben für die Bestimmung einer Blackwell-optimalen Politik.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust analysis of discounted Markov decision processes with uncertain transition probabilities

Article 01 October 2020

Zhen-kai Lou, Fu-jun Hou & Xu-ming Lou

Finite Markov Chains and Markov Decision Processes

Markov Decision Processes with Discounted Rewards: Improved Successive Over-Relaxation Method

References

Blackwell D (1962) Discrete dynamic programming. Ann Math Statist 33:719–726
Google Scholar
Charnes A, Cooper WW (1961) Management models and industrial applications of linear programming, vols 1 and 2. John Wiley, New York
Google Scholar
Derman C (1970) Finite state Markovian decision processes. Academic Press, New York
Google Scholar
D'Epenoux F (1960) Sur un probleme de production et de stockage dans l'aléatoire. Rev Fr Inform Rech Oper 14:3–16 (Engl transl Mang Sci 10:98–108)
Google Scholar
Holzbaur UD (1984) Entscheidungsmodelle über angeordneten Körpern. PhD Thesis, Universität Ulm
Hordijk A (1974) Dynamic programming and Markov potential theory. Math Centre Tract No 51 (Amsterdam)
Hordijk A, Dekker R, Kallenberg LCM (1981) A simplex-like algorithm to compute a Blackwell optimal policy. Report No 81-37 of the Inst Appl Math Comp Sci (University of Leiden)
Hordijk A, Kallenberg LCM (1984) Constrained undiscounted stochastic dynamic programming. Math Oper Res 9:276–289
Google Scholar
Hordijk A, Kallenberg LCM (1984) Transient policies in discrete dynamic programming: linear programming including suboptimality tests and additional constraints. Math Prog 30:46–70
Google Scholar
Howard R (1960) Dynamic programming and Markov processes. MIT Press, Cambridge, MA
Google Scholar
Jeroslow RG (1972) An algorithm for discrete dynamic programming with interest rates near zero. Manag Sci Res Report No 300. Carnegie-Mellon University, Pittsburgh
Google Scholar
Jeroslow RG (1973) Asymptotic linear programming. Oper Res 21:1128–1141
Google Scholar
Kallenberg LCM (1983) Linear programming and finite Markovian control problems. Math Centre Tract No 148 (Amsterdam)
Miller BL, Veinott AF (1969) Discrete dynamic programming with a small interest rate. Ann Math Statist 40:366–370
Google Scholar
Smallwood RD (1966) Optimum policy regions for Markov processes with discounting. Oper Res 14:658–669
Google Scholar
Stoer J, Bulirsch R (1980) Introduction to numerical analysis. Springer, Berlin Heidelberg New York
Google Scholar
Waerden van der BL (1953) Modern algebra, vols 1 and 2. Frederick Ungar, New York
Google Scholar
Zoutendijk G (1976) Mathematical programming methods. North-Holland, Amsterdam
Google Scholar

Download references

Author information

R. Dekker
Present address: Department of Mathematics and Systems Engineering, Kon./Shell Laboratory Amsterdam, P.O. Box 3003, NL-1003, AA Amsterdam, The Netherlands

Authors and Affiliations

Department of Mathematics and Computer Science, University of Leiden, P.O. Box 9512, NL-2300, RA Leiden, The Netherlands
A. Hordijk, R. Dekker & L. C. M. Kallenberg

Authors

A. Hordijk
View author publications
You can also search for this author in PubMed Google Scholar
R. Dekker
View author publications
You can also search for this author in PubMed Google Scholar
L. C. M. Kallenberg
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

The research of this author was supported by the Netherlands Foundation for Mathematics (SMC) with financial aid from the Netherlands Organization for the Advancement of Pure Research (ZWO)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hordijk, A., Dekker, R. & Kallenberg, L.C.M. Sensitivity-analysis in discounted Markovian decision problems. OR Spektrum 7, 143–151 (1985). https://doi.org/10.1007/BF01721353

Download citation

Received: 01 October 1984
Accepted: 18 July 1985
Issue Date: September 1985
DOI: https://doi.org/10.1007/BF01721353

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sensitivity-analysis in discounted Markovian decision problems

Summary

Zusammenfassung

Access this article

Similar content being viewed by others

Robust analysis of discounted Markov decision processes with uncertain transition probabilities

Finite Markov Chains and Markov Decision Processes

Markov Decision Processes with Discounted Rewards: Improved Successive Over-Relaxation Method

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sensitivity-analysis in discounted Markovian decision problems

Summary

Zusammenfassung

Access this article

Similar content being viewed by others

Robust analysis of discounted Markov decision processes with uncertain transition probabilities

Finite Markov Chains and Markov Decision Processes

Markov Decision Processes with Discounted Rewards: Improved Successive Over-Relaxation Method

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation