Response adaptive designs that incorporate switching costs and constraints |
| |
Authors: | Janis P. Hardwick Quentin F. Stout |
| |
Affiliation: | University of Michigan, Ann Arbor, MI 48109-2121, USA |
| |
Abstract: | This paper examines the design and performance of sequential experiments where extensive switching is undesirable. Given an objective function to optimize by sampling between Bernoulli populations, two different models are considered. The constraint model restricts the maximum number of switches possible, while the cost model introduces a charge for each switch. Optimal allocation procedures and a new “hyperopic” procedure are discussed and their behavior examined. For the cost model, if one views the costs as control variables then the optimal allocation procedures yield the optimal tradeoff of expected switches vs. expected value of the objective function. |
| |
Keywords: | Response adaptive sampling Switching Bandit problem Hyperopic design Optimal tradeoffs Sequential allocation Dynamic programming Experimental design |
本文献已被 ScienceDirect 等数据库收录! |