Non-ergodic Markov decision processes with a constraint on the asymptotic failure rate: general class of policies期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Non-ergodic Markov decision processes with a constraint on the asymptotic failure rate: general class of policies

Abstract:	In this paper, we introduce a Markov decision model with absorbing states and a constraint on the asymptotic failure rate. The objective is to find a policy which maximizes the infinite horizon expected average reward, given that the system never fails. First, we show that it is sufficient to consider markovian policies. Second, for solving the problem, we restrict ourselves to find a stationary policy. Finally, we give sufficient conditions for optimality in the Markovian policies class.

Keywords: