Kernel density estimation for hierarchical data |
| |
Authors: | Christopher M. Wilson Patrick Gerard |
| |
Affiliation: | 1. Moffitt Cancer Center, Tampa, Florida, USA;2. christopher.wilson@moffitt.org;4. Clemson University, Clemson, South Carolina, USA |
| |
Abstract: | AbstractMultistage sampling is a common sampling technique employed in many studies. In this setting, observations are identically distributed but not independent, thus many traditional kernel smoothing techniques, which assume that the data are independent and identically distributed (i.i.d.), may not produce reasonable density estimates. In this paper, we sample repeatedly with replacement from each cluster, create multiple i.i.d. samples containing one observation from each cluster, and then create a kernel density estimate from each i.i.d. sample. These estimates will then be combined to form an estimate of the marginal probability density function of the population. |
| |
Keywords: | Kernel density estimation correlated data resampling bandwidth selection multistage sampling random effects |
|
|