Optimal Bayesian strategies for the infinite-armed Bernoulli bandit期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Optimal Bayesian strategies for the infinite-armed Bernoulli bandit

Authors:	Ying-Chao Hung

Affiliation:	Department of Statistics, National Chengchi University, Taipei 11605, Taiwan

Abstract:	We consider the bandit problem with an infinite number of Bernoulli arms, of which the unknown parameters are assumed to be i.i.d. random variables with a common distribution F. Our goal is to construct optimal strategies of choosing “arms” so that the expected long-run failure rate is minimized. We first review a class of strategies and establish their asymptotic properties when F is known. Based on the results, we propose a new strategy and prove that it is asymptotically optimal when F is unknown. Finally, we show that the proposed strategy performs well for a number of simulation scenarios.

Keywords:	Bandit problem Bernoulli arms Bayesian strategy Prior distribution
本文献已被 ScienceDirect 等数据库收录！