Loss bounds for uncertain transition probabilities in Markov decision processes

Andrew Mastin; Patrick Jaillet

doi:10.1109/CDC.2012.6426504

Loss bounds for uncertain transition probabilities in Markov decision processes

Source

2012 IEEE 51st IEEE Conference on Decision and Control (CDC) > 6708 - 6715

Abstract

We analyze losses resulting from uncertain transition probabilities in Markov decision processes with bounded nonnegative rewards. We assume that policies are precomputed using exact dynamic programming with the estimated transition probabilities, but the system evolves according to different, true transition probabilities. Given a bound on the total variation error of estimated transition probability distributions, we derive upper bounds on the loss of expected total reward. The approach analyzes the growth of errors incurred by stepping backwards in time while precomputing value functions, which requires bounding a multilinear program. Loss bounds are given for the finite horizon undiscounted, finite horizon discounted, and infinite horizon discounted cases, and a tight example is shown.

Identifiers

book ISSN :	0743-1546
book e-ISSN :	0743-1546
book ISBN :	978-1-4673-2065-8
book e-ISBN :	978-1-4673-2066-5 , 978-1-4673-2063-4 , 978-1-4673-2064-1
DOI	10.1109/CDC.2012.6426504

Authors

Mastin, Andrew

Laboratory for Information and Decision Systems, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, 02139, USA

Jaillet, Patrick

Laboratory for Information and Decision Systems, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, 02139, USA

Keywords

Markov processes Dynamic programming Vectors Probability distribution Upper bound Approximation methods Linear programming

Additional information

Data set: ieee

Publisher

IEEE

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Loss bounds for uncertain transition probabilities in Markov decision processes $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Mastin, Andrew

Jaillet, Patrick

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Loss bounds for uncertain transition probabilities in Markov decision processes