ERROR BOUNDS FOR CALCULATION OF THE GITTINS INDICES |
| |
Authors: | You-Gan Wang |
| |
Affiliation: | CSIRO Mathematical and Information Sciences, PO Box 120, Cleveland, Qld 4163. email: |
| |
Abstract: | For a wide class of semi-Markov decision processes the optimal policies are expressible in terms of the Gittins indices, which have been found useful in sequential clinical trials and pharmaceutical research planning. In general, the indices can be approximated via calibration based on dynamic programming of finite horizon. This paper provides some results on the accuracy of such approximations, and, in particular, gives the error bounds for some well known processes (Bernoulli reward processes, normal reward processes and exponential target processes). |
| |
Keywords: | Bandit process clinical trials dynamic programming stopping time |
|
|