首页 | 本学科首页   官方微博 | 高级检索  
     


Non-ergodic Markov decision processes with a constraint on the asymptotic failure rate: general class of policies
Abstract:In this paper, we introduce a Markov decision model with absorbing states and a constraint on the asymptotic failure rate. The objective is to find a policy which maximizes the infinite horizon expected average reward, given that the system never fails. First, we show that it is sufficient to consider markovian policies. Second, for solving the problem, we restrict ourselves to find a stationary policy. Finally, we give sufficient conditions for optimality in the Markovian policies class.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号