Variance estimation when donor imputation is used to fill in missing values |
| |
Authors: | Jean‐François Beaumont Cynthia Bocci |
| |
Affiliation: | 1. Statistics Canada, Statistical Research and Innovation Division, Tunney's Pasture, R.H.Coats Building, 16th Floor, Ottawa, Ontario, Canada K1A 0T6;2. Statistics Canada, Business Survey Methods Division, Tunney's Pasture, R.H.Coats Building, 11th Floor, Ottawa, Ontario, Canada K1A 0T6 |
| |
Abstract: | Donor imputation is frequently used in surveys. However, very few variance estimation methods that take into account donor imputation have been developed in the literature. This is particularly true for surveys with high sampling fractions using nearest donor imputation, often called nearest‐neighbour imputation. In this paper, the authors develop a variance estimator for donor imputation based on the assumption that the imputed estimator of a domain total is approximately unbiased under an imputation model; that is, a model for the variable requiring imputation. Their variance estimator is valid, irrespective of the magnitude of the sampling fractions and the complexity of the donor imputation method, provided that the imputation model mean and variance are accurately estimated. They evaluate its performance in a simulation study and show that nonparametric estimation of the model mean and variance via smoothing splines brings robustness with respect to imputation model misspecifications. They also apply their variance estimator to real survey data when nearest‐neighbour imputation has been used to fill in the missing values. The Canadian Journal of Statistics 37: 400–416; 2009 © 2009 Statistical Society of Canada |
| |
Keywords: | Hot‐Deck imputation Imputation model Nearest‐neighbour imputation Nonresponse variance component SEVANI Smoothing splines |
|
|