Papers
arxiv:1207.1386

Metrics for Markov Decision Processes with Infinite State Spaces

Published on Jul 4, 2012
Authors:
,
,

Abstract

Metrics for measuring state similarity in MDPs with infinite continuous state spaces provide a stable framework for planning and approximation, showing continuous variation of optimal value functions with respect to these metrics.

We present metrics for measuring state similarity in Markov decision processes (MDPs) with infinitely many states, including MDPs with continuous state spaces. Such metrics provide a stable quantitative analogue of the notion of bisimulation for MDPs, and are suitable for use in MDP approximation. We show that the optimal value function associated with a discounted infinite horizon planning task varies continuously with respect to our metric distances.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 1207.1386
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/1207.1386 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/1207.1386 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/1207.1386 in a Space README.md to link it from this page.

Collections including this paper 1