首页 | 本学科首页   官方微博 | 高级检索  
 共查询到20条相似文献,搜索用时 15 毫秒
Item non‐response in surveys occurs when some, but not all, variables are missing. Unadjusted estimators tend to exhibit some bias, called the non‐response bias, if the respondents differ from the non‐respondents with respect to the study variables. In this paper, we focus on item non‐response, which is usually treated by some form of single imputation. We examine the properties of doubly robust imputation procedures, which are those that lead to an estimator that remains consistent if either the outcome variable or the non‐response mechanism is adequately modelled. We establish the double robustness property of the imputed estimator of the finite population distribution function under random hot‐deck imputation within classes. We also discuss the links between our approach and that of Chambers and Dunstan. The results of a simulation study support our findings.  相似文献   

Donor imputation is frequently used in surveys. However, very few variance estimation methods that take into account donor imputation have been developed in the literature. This is particularly true for surveys with high sampling fractions using nearest donor imputation, often called nearest‐neighbour imputation. In this paper, the authors develop a variance estimator for donor imputation based on the assumption that the imputed estimator of a domain total is approximately unbiased under an imputation model; that is, a model for the variable requiring imputation. Their variance estimator is valid, irrespective of the magnitude of the sampling fractions and the complexity of the donor imputation method, provided that the imputation model mean and variance are accurately estimated. They evaluate its performance in a simulation study and show that nonparametric estimation of the model mean and variance via smoothing splines brings robustness with respect to imputation model misspecifications. They also apply their variance estimator to real survey data when nearest‐neighbour imputation has been used to fill in the missing values. The Canadian Journal of Statistics 37: 400–416; 2009 © 2009 Statistical Society of Canada  相似文献   

We consider the problem of estimation of a finite population variance related to a sensitive character under a randomized response model and prove (i) the admissibility of an estimator for a given sampling design in a class of quadratic unbiased estimators and (ii) the admissibility of a sampling strategy in a class of comparable quadratic unbiased strategies.  相似文献   


This paper deals with the problem of estimating the finite population mean in stratified random sampling by using two auxiliary variables. This paper proposed a ratio-cum-product exponential type estimator of population mean under different situations: (i) when there is presence of non-response and measurement errors on the study as well as auxiliary variables; (ii) when there is non-response on the study and auxiliary variables but with no measurement error; (iii) when there is complete response on study variable but there is presence of non-response and measurement error on the auxiliary variables and (iv) when there are complete response and measurement error on study as well as auxiliary variables. The expressions of the bias and mean square error of the proposed estimator have been obtained up to the first degree of approximation. The proposed estimator has been compared with usual unbiased estimator, ratio estimator and other existing estimators and the conditions obtained to show the efficacy of the proposed estimator over other considered estimators. Simulation study is carried out to support the theoretical findings.  相似文献   

In this paper, the restricted almost unbiased ridge regression estimator and restricted almost unbiased Liu estimator are introduced for the vector of parameters in a multiple linear regression model with linear restrictions. The bias, variance matrices and mean square error (MSE) of the proposed estimators are derived and compared. It is shown that the proposed estimators will have smaller quadratic bias but larger variance than the corresponding competitors in literatures. However, they will respectively outperform the latter according to the MSE criterion under certain conditions. Finally, a simulation study and a numerical example are given to illustrate some of the theoretical results.  相似文献   


This article focuses on reducing the additional variance due to randomization of the responses. The idea of additive scrambling and its inverse has been used along with (i) split sample approach and (ii) double response approach. Specifically, our proposal is based on Gupta et al. (2006) randomized response model. We selected this model for improvement because it provides estimator of mean and sensitivity level of a sensitive variable and is better than all of its competitors proposed earlier to it and even Gupta et al. (2006) sensitivity estimator is better than that of Gupta et al. (2010). Our suggested estimators are unbiased estimators and perform better than Gupta et al. (2006) estimator. The issue of privacy protection is also discussed.  相似文献   

The purpose of this paper is to examine the asymptotic properties of the operational almost unbiased estimator of regression coefficients which includes almost unbiased ordinary ridge estimator a s a special case. The small distrubance approximations for the bias and mean square error matrix of the estimator are derived. As a consequence, it is proved that, under certain conditions, the estimator is more efficient than a general class of estimators given by Vinod and Ullah (1981). Also it is shown that, if the ordinary ridge estimator (ORE) dominates the ordinary least squares estimator then the almost unbiased ordinary ridge estimator does not dominate ORE under the mean square error criterion.  相似文献   

In this paper we study the problem of reducing the bias of the ratio estimator of the population mean in a ranked set sampling (RSS) design. We first propose a jackknifed RSS-ratio estimator and then introduce a class of almost unbiased RSS-ratio estimators of the population mean. We also present an unbiased RSS-ratio estimator of the mean using the idea of Hartley and Ross (Nature 174:270?C271, 1954) which performs better than its counterpart with simple random sample data. We show that under certain conditions the proposed unbiased and almost unbiased RSS-ratio estimators perform better than the commonly used (biased) RSS-ratio estimator in estimating the population mean in terms of the mean square error. The theoretical results are augmented by a simulation study using a wheat yield data set from the Iranian Ministry of Agriculture to demonstrate the practical benefits of our proposed ratio-type estimators relative to the RSS-ratio estimator in reducing the bias in estimating the average wheat production.  相似文献   

Under the, notion of superpopulation models, the concept of minimum expected variance is adopted as an optimality criterion for design-unbiased estimators, i.e. unbiased under repeated sampling. In this article, it is shown that the Horvitz-Thompson estimator is optimal among such estimators if and only if it is model-unbiased, i.e. unbiased under the model. The family of linear models is considered and a sample design is suggested to preserve the model-unbiasedness (and hence the optimality) of the Horvitz-Thompson estimator. It is also shown that under these models the Horvitz-Thompson estimator together with the suggested sample design is optimal among design-unbiased estimators with any sample design (of fixed size n ) having non-zero probabilities of inclusion for all population units.  相似文献   

Abstract. A model‐based predictive estimator is proposed for the population proportions of a polychotomous response variable, based on a sample from the population and on auxiliary variables, whose values are known for the entire population. The responses for the non‐sample units are predicted using a multinomial logit model, which is a parametric function of the auxiliary variables. A bootstrap estimator is proposed for the variance of the predictive estimator, its consistency is proved and its small sample performance is compared with that of an analytical estimator. The proposed predictive estimator is compared with other available estimators, including model‐assisted ones, both in a simulation study involving different sampling designs and model mis‐specification, and using real data from an opinion survey. The results indicate that the prediction approach appears to use auxiliary information more efficiently than the model‐assisted approach.  相似文献   


This study concerns semiparametric approaches to estimate discrete multivariate count regression functions. The semiparametric approaches investigated consist of combining discrete multivariate nonparametric kernel and parametric estimations such that (i) a prior knowledge of the conditional distribution of model response may be incorporated and (ii) the bias of the traditional nonparametric kernel regression estimator of Nadaraya-Watson may be reduced. We are precisely interested in combination of the two estimations approaches with some asymptotic properties of the resulting estimators. Asymptotic normality results were showed for nonparametric correction terms of parametric start function of the estimators. The performance of discrete semiparametric multivariate kernel estimators studied is illustrated using simulations and real count data. In addition, diagnostic checks are performed to test the adequacy of the parametric start model to the true discrete regression model. Finally, using discrete semiparametric multivariate kernel estimators provides a bias reduction when the parametric multivariate regression model used as start regression function belongs to a neighborhood of the true regression model.  相似文献   

A regression model is considered in which the response variable has a type 1 extreme-value distribution for smallest values. Bias approximations for the maximum likelihood estimators are pivm and a bias reduction estimator for the scale parameter is proposed. The small sample moment properties of the maximum likelihood estimators are compared with the properties of the ordinary least squares estimators and the best linear unbiased estimators based on order statistics for grouped data.  相似文献   

Abstract. The cross‐validation (CV) criterion is known to be asecond‐order unbiased estimator of the risk function measuring the discrepancy between the candidate model and the true model, as well as the generalized information criterion (GIC) and the extended information criterion (EIC). In the present article, we show that the 2kth‐order unbiased estimator can be obtained using a linear combination from the leave‐one‐out CV criterion to the leave‐k‐out CV criterion. The proposed scheme is unique in that a bias smaller than that of a jackknife method can be obtained without any analytic calculation, that is, it is not necessary to obtain the explicit form of several terms in an asymptotic expansion of the bias. Furthermore, the proposed criterion can be regarded as a finite correction of a bias‐corrected CV criterion by using scalar coefficients in a bias‐corrected EIC obtained by the bootstrap iteration.  相似文献   

The authors develop jackknife and analytical variance estimators for the estimator of Chambers & Dunstan (1986) and Rao, Kovar & Mantel (1990) of the finite population distribution function, using complete auxiliary information. They also describe the associated model and show the design consistency of the variance estimators, whose small‐sample performance is examined through a limited simulation study. They highlight the operational advantages of the jackknife in the model‐based setting of Chambers & Dunstan (1986) and its better conditional performance in the design‐based setting of Rao, Kovar & Mantel (1990).  相似文献   

Consider estimation of a population mean of a response variable when the observations are missing at random with respect to the covariate. Two common approaches to imputing the missing values are the nonparametric regression weighting method and the Horvitz-Thompson (HT) inverse weighting approach. The regression approach includes the kernel regression imputation and the nearest neighbor imputation. The HT approach, employing inverse kernel-estimated weights, includes the basic estimator, the ratio estimator and the estimator using inverse kernel-weighted residuals. Asymptotic normality of the nearest neighbor imputation estimators is derived and compared to kernel regression imputation estimator under standard regularity conditions of the regression function and the missing pattern function. A comprehensive simulation study shows that the basic HT estimator is most sensitive to discontinuity in the missing data patterns, and the nearest neighbors estimators can be insensitive to missing data patterns unbalanced with respect to the distribution of the covariate. Empirical studies show that the nearest neighbor imputation method is most effective among these imputation methods for estimating a finite population mean and for classifying the species of the iris flower data.  相似文献   

Patient dropout is a common problem in studies that collect repeated binary measurements. Generalized estimating equations (GEE) are often used to analyze such data. The dropout mechanism may be plausibly missing at random (MAR), i.e. unrelated to future measurements given covariates and past measurements. In this case, various authors have recommended weighted GEE with weights based on an assumed dropout model, or an imputation approach, or a doubly robust approach based on weighting and imputation. These approaches provide asymptotically unbiased inference, provided the dropout or imputation model (as appropriate) is correctly specified. Other authors have suggested that, provided the working correlation structure is correctly specified, GEE using an improved estimator of the correlation parameters (‘modified GEE’) show minimal bias. These modified GEE have not been thoroughly examined. In this paper, we study the asymptotic bias under MAR dropout of these modified GEE, the standard GEE, and also GEE using the true correlation. We demonstrate that all three methods are biased in general. The modified GEE may be preferred to the standard GEE and are subject to only minimal bias in many MAR scenarios but in others are substantially biased. Hence, we recommend the modified GEE be used with caution.  相似文献   

We propose a class of estimators of the variance of the systematic sample mean, which is unbiased under the assumption that the population follows a superpopulation model that satisfies some mild conditions. The approach is based on the separate estimation of the portion of the variance due to the systematic component of the model and that due to the stochastic component. In particular, we deal with two estimators belonging to the proposed class that are based on moving averages and local polynomials to estimate the systematic component of the model. The latter estimators are unbiased under the assumption that the population follows a linear trend and the errors are homoscedastic and uncorrelated. Through a simulation study we show that these estimators generally outperform, in terms of bias and mean square error, the usual estimator based on the first differences also when the superpopulation model departs significantly from linearity and the errors are heteroscedastic.  相似文献   

Missing data analysis requires assumptions about an outcome model or a response probability model to adjust for potential bias due to nonresponse. Doubly robust (DR) estimators are consistent if at least one of the models is correctly specified. Multiply robust (MR) estimators extend DR estimators by allowing for multiple models for both the outcome and/or response probability models and are consistent if at least one of the multiple models is correctly specified. We propose a robust quasi-randomization-based model approach to bring more protection against model misspecification than the existing DR and MR estimators, where any multiple semiparametric, nonparametric or machine learning models can be used for the outcome variable. The proposed estimator achieves unbiasedness by using a subsampling Rao–Blackwell method, given cell-homogenous response, regardless of any working models for the outcome. An unbiased variance estimation formula is proposed, which does not use any replicate jackknife or bootstrap methods. A simulation study shows that our proposed method outperforms the existing multiply robust estimators.  相似文献   

In this paper a new class of shrinkage estimators has been introduced for the shape parameter in an independently identically distributed two-parameterWeibull model under censored sampling. The main idea is to incorporate the prior guessed value by correcting the standard estimator, which is essentially an unbiased estimator, with optimally weighted ratios of the guessed value and the standard estimator, instead of considering a convex combination of the standard estimator and the difference of the guessed value and the standard estimator. The resulting estimator dominates the standard estimator in a surprisingly large neighborhood of the guessed value. The suggested estimator has also been compared with the minimum mean squared error estimator and a class of estimators suggested by Singh and Shukla in IAPQR Trans 25(2), 107–118, 2000. It is found that the suggested class of estimators has lesser bias as well as lesser mean squared error than its competitors subject to certain conditions.   相似文献   

This paper considers the problem of estimating the population variance S2y of the study variable y using the auxiliary information in sample surveys. We have suggested the (i) chain ratio-type estimator (on the lines of Kadilar and Cingi (2003)), (ii) chain ratio-ratio-type exponential estimator and their generalized version [on the lines of Singh and Pal (2015)] and studied their properties under large sample approximation. Conditions are obtained under which the proposed estimators are more efficient than usual unbiased estimator s2y and Isaki (1893) ratio estimator. Improved version of the suggested class of estimators is also given along with its properties. An empirical study is carried out in support of the present study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号