首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Unweighted estimators using data collected in a sample survey can be badly biased, whereas weighted estimators are approximately unbiased for population parameters. We present four examples using data from the 1988 National Maternal and Infant Health Survey to demonstrate that weighted and unweighted estimators can be quite different, and to show the underlying causes of such differences.  相似文献   

2.
The estimation of the variance for the GREG (general regression) estimator by weighted residuals is widely accepted as a method which yields estimators with good conditional properties. Since the optimal (regression) estimator shares the properties of GREG estimators which are used in the construction of weighted variance estimators, we introduce the weighting procedure also for estimating the variance of the optimal estimator. This method of variance estimation was originally presented in a seemingly ad hoc manner, and we shall discuss it from a conditional point of view and also look at an alternative way of utilizing the weights. Examples that stress conditional behaviour of estimators are then given for elementary sampling designs such as simple random sampling, stratified simple random sampling and Poisson sampling, where for the latter design we have conducted a small simulation study.  相似文献   

3.
Simultaneous estimation of the vector of the variance components for mixed and random models under the quadratic loss function is considered. For a large class of such models there are identified classes of admissible biased invariant quadratic estimators that are better than some admissible unbiased estimators. Numerous numerical results presented in the paper show that for many of the commonly used balanced models the improvements in the quadratic risk may be considerable over a large set of the parameter space.  相似文献   

4.
The unweighted means estimators of variance components in the one-way design are shown to equal the limit of the weighted MINQUE estimators as the ratio of the treatment variance weight to the error variance weight approaches infinity  相似文献   

5.
In this paper we present methods for inference on data selected by a complex sampling design for a class of statistical models for the analysis of ordinal variables. Specifically, assuming that the sampling scheme is not ignorable, we derive for the class of cub models (Combination of discrete Uniform and shifted Binomial distributions) variance estimates for a complex two stage stratified sample. Both Taylor linearization and repeated replication variance estimators are presented. We also provide design‐based test diagnostics and goodness‐of‐fit measures. We illustrate by means of real data analysis the differences between survey‐weighted and unweighted point estimates and inferences for cub model parameters.  相似文献   

6.
We consider a modelling approach to longitudinal data that aims at estimating flexible covariate effects in a model where the sampling probabilities are modelled explicitly. The joint modelling yields simple estimators that are easy to compute and analyse, even if the sampling of the longitudinal responses interacts with the response level. An incorrect model for the sampling probabilities results in biased estimates. Non-representative sampling occurs, for example, if patients with an extreme development (based on extreme values of the response) are called in for additional examinations and measurements. We allow covariate effects to be time-varying or time-constant. Estimates of covariate effects are obtained by solving martingale equations locally for the cumulative regression functions. Using Aalen's additive model for the sampling probabilities, we obtain simple expressions for the estimators and their asymptotic variances. The asymptotic distributions for the estimators of the non-parametric components as well as the parametric components of the model are derived drawing on general martingale results. Two applications are presented. We consider the growth of cystic fibrosis patients and the prothrombin index for liver cirrhosis patients. The conclusion about the growth of the cystic fibrosis patients is not altered when adjusting for a possible non-representativeness in the sampling, whereas we reach substantively different conclusions about the treatment effect for the liver cirrhosis patients.  相似文献   

7.
Estimation of the population mean under the regression model with random components is considered. Conditions under which the random components regression estimator is design consistent are given. It is shown that consistency holds when incorrect values are used for the variance components. The regression estimator constructed with model parameters that differ considerably from the true parameters performed well in a Monte Carlo study. Variance estimators for the regression predictor are suggested. A variance estimator appropriate for estimators constructed with a biased estimator for the between-group variance component performed well in the Monte Carlo study.  相似文献   

8.
When the probability of selecting an individual in a population is propor­tional to its lifelength, it is called length biased sampling. A nonparametric maximum likelihood estimator (NPMLE) of survival in a length biased sam­ple is given in Vardi (1982). In this study, we examine the performance of Vardi's NPMLE in estimating the true survival curve when observations are from a length biased sample. We also compute estimators based on a linear combination (LCE) of empirical distribution function (EDF) estimators and weighted estimators. In our simulations, we consider observations from a mix­ture of two different distributions, one from F and the other from G which is a length biased distribution of F. Through a series of simulations with vari­ous proportions of length biasing in a sample, we show that the NPMLE and the LCE closely approximate the true survival curve. Throughout the sur­vival curve, the EDF estimators overestimate the survival. We also consider a case where the observations are from three different weighted distributions, Again, both the NPMLE and the LCE closely approximate the true distribu­tion, indicating that the length biasedness is properly adjusted for. Finally, an efficiency study shows that Vardi's estimators are more efficient than the EDF estimators in the lower percentiles of the survival curves.  相似文献   

9.
Unbiased estimators for restricted adaptive cluster sampling   总被引:2,自引:0,他引:2  
In adaptive cluster sampling the size of the final sample is random, thus creating design problems. To get round this, Brown (1994) and Brown & Manly (1998) proposed a modification of the method, placing a restriction on the size of the sample, and using standard but biased estimators for estimating the population mean. But in this paper a new unbiased estimator and an unbiased variance estimator are proposed, based on estimators proposed by Murthy (1957) and extended to sequential and adaptive sampling designs by Salehi & Seber (2001). The paper also considers a restricted version of the adaptive scheme of Salehi & Seber (1997a) in which the networks are selected without replacement, and obtains unbiased estimators. The method is demonstrated by a simple example. Using simulation from this example, the new estimators are shown to compare very favourably with the standard biased estimators.  相似文献   

10.
ABSTRACT

In a regression model with a random individual and a random time effect explicit representations of the nonnegative quadratic minimum biased estimators of the corresponding variances are deduced. These estimators always exist and are unique. Moreover, under normality assumption of the dependent variable unbiased estimators of the mean squared errors of the variance estimates are derived. Finally, confidence intervals on the variance components are considered.  相似文献   

11.
For the balanced random effects models, when the variance components are correlated either naturally or through common prior structures, by assuming a mixed prior distribution for the variance components, we propose some new Bayesian estimators. To contrast and compare the new estimators with the minimum variance unbiased (MVUE) and restricted maximum likelihood estimators (RMLE), some simulation studies are also carried out. It turns out that the proposed estimators have smaller mean squared errors than the MVUE and RMLE.  相似文献   

12.
The present study deals with three different invarint quadratic unbiased estimators (IQUE) for variance components namely quadratic least squares estimators (QLSE), weighted quadratic least squares estimators (WQLSE) and Mitra type estimators (MTE). The variance and covariances of these three different estimators are presented for unbalanced one-way random model. The relative performances of these estimators are assessed based on different optimality criteria like, D-optimality, T-optimality and M-optimality together with variances of these estimators. As a result, it has been shown that MTE has optimal properties.  相似文献   

13.
《统计学通讯:理论与方法》2012,41(16-17):3278-3300
Under complex survey sampling, in particular when selection probabilities depend on the response variable (informative sampling), the sample and population distributions are different, possibly resulting in selection bias. This article is concerned with this problem by fitting two statistical models, namely: the variance components model (a two-stage model) and the fixed effects model (a single-stage model) for one-way analysis of variance, under complex survey design, for example, two-stage sampling, stratification, and unequal probability of selection, etc. Classical theory underlying the use of the two-stage model involves simple random sampling for each of the two stages. In such cases the model in the sample, after sample selection, is the same as model for the population; before sample selection. When the selection probabilities are related to the values of the response variable, standard estimates of the population model parameters may be severely biased, leading possibly to false inference. The idea behind the approach is to extract the model holding for the sample data as a function of the model in the population and of the first order inclusion probabilities. And then fit the sample model, using analysis of variance, maximum likelihood, and pseudo maximum likelihood methods of estimation. The main feature of the proposed techniques is related to their behavior in terms of the informativeness parameter. We also show that the use of the population model that ignores the informative sampling design, yields biased model fitting.  相似文献   

14.
In stratified sampling, methods for the allocation of effort among strata usually rely on some measure of within-stratum variance. If we do not have enough information about these variances, adaptive allocation can be used. In adaptive allocation designs, surveys are conducted in two phases. Information from the first phase is used to allocate the remaining units among the strata in the second phase. Brown et al. [Adaptive two-stage sequential sampling, Popul. Ecol. 50 (2008), pp. 239–245] introduced an adaptive allocation sampling design – where the final sample size was random – and an unbiased estimator. Here, we derive an unbiased variance estimator for the design, and consider a related design where the final sample size is fixed. Having a fixed final sample size can make survey-planning easier. We introduce a biased Horvitz–Thompson type estimator and a biased sample mean type estimator for the sampling designs. We conduct two simulation studies on honey producers in Kurdistan and synthetic zirconium distribution in a region on the moon. Results show that the introduced estimators are more efficient than the available estimators for both variable and fixed sample size designs, and the conventional unbiased estimator of stratified simple random sampling design. In order to evaluate efficiencies of the introduced designs and their estimator furthermore, we first review some well-known adaptive allocation designs and compare their estimator with the introduced estimators. Simulation results show that the introduced estimators are more efficient than available estimators of these well-known adaptive allocation designs.  相似文献   

15.
Capture–recapture processes are biased samplings of recurrent event processes, which can be modelled by the Andersen–Gill intensity model. The intensity function is assumed to be a function of time, covariates and a parameter. We derive the maximum likelihood estimators of both the parameter and the population size and show the consistency and asymptotic normality of the estimators for both recapture and removal studies. The estimators are asymptotically efficient and their theoretical asymptotic relative efficiencies with respect to the existing estimators of Yip and co-workers can be as large as ∞. The variance estimation and a numerical example are also presented.  相似文献   

16.
Many sampling problems from multiple populations can be considered under the semiparametric framework of the biased, or weighted, sampling model. Included under this framework is logistic regression under case–control sampling. For any model, atypical observations can greatly influence the maximum likelihood estimate of the parameters. Several robust alternatives have been proposed for the special case of logistic regression. However, some current techniques can exhibit poor behavior in many common situations. In this paper a new family of procedures are constructed to estimate the parameters in the semiparametric biased sampling model. The procedures incorporate a minimum distance approach, but are instead based on characteristic functions. The estimators can also be represented as the minimizers of quadratic forms in simple residuals, thus yielding straightforward computation. For the case of logistic regression, the resulting estimators are shown to be competitive with the existing robust approaches in terms of both robustness and efficiency, while maintaining affine equivariance. The approach is developed under the case–control sampling scheme, yet is shown to be applicable under prospective sampling logistic regression as well.  相似文献   

17.
This paper investigates a biased regression approach to the preliminary estimation of the Box-Jenkins transfer function weights. Using statistical simulation to generate time series, 14 estimators (various OLS, ridge and principal components estimators) are compared in terms of MSE and standard error of the weight estimators. The estimators are investigated for different levels of multicollinearity, signal-to-noise ratio, number of independent variables, length of time series and number of lags included in the estimation. The results show that the ridge estimators nearly always give lower MSE than the OLS estimator, and in the computationally difficult cases give much lower MSE than the OLS estimator. The principal components estimators can give lower MSE than the OLS, but also higher values. All biased estimators nearly always give much lower estimated standard error than OLS when estimating the weights.  相似文献   

18.
Nonnegative estimators for the variance components of a linear model are obtained by ignoring the condition for unbiasedness in the principle of the MINQUE. An estimator is derived when the priori weights are proportional to the variance components. The ordinary sample variance is shown to be the nonnegative MINQUE. Efficiencies of the three estimators are examined for some special cases of the model.  相似文献   

19.
Summary.  Multilevel modelling is sometimes used for data from complex surveys involving multistage sampling, unequal sampling probabilities and stratification. We consider generalized linear mixed models and particularly the case of dichotomous responses. A pseudolikelihood approach for accommodating inverse probability weights in multilevel models with an arbitrary number of levels is implemented by using adaptive quadrature. A sandwich estimator is used to obtain standard errors that account for stratification and clustering. When level 1 weights are used that vary between elementary units in clusters, the scaling of the weights becomes important. We point out that not only variance components but also regression coefficients can be severely biased when the response is dichotomous. The pseudolikelihood methodology is applied to complex survey data on reading proficiency from the American sample of the 'Program for international student assessment' 2000 study, using the Stata program gllamm which can estimate a wide range of multilevel and latent variable models. Performance of pseudo-maximum-likelihood with different methods for handling level 1 weights is investigated in a Monte Carlo experiment. Pseudo-maximum-likelihood estimators of (conditional) regression coefficients perform well for large cluster sizes but are biased for small cluster sizes. In contrast, estimators of marginal effects perform well in both situations. We conclude that caution must be exercised in pseudo-maximum-likelihood estimation for small cluster sizes when level 1 weights are used.  相似文献   

20.
This paper explores the asymptotic distribution of the restricted maximum likelihood estimator of the variance components in a general mixed model. Restricting attention to hierarchical models, central limit theorems are obtained using elementary arguments with only mild conditions on the covariates in the fixed part of the model and without having to assume that the data are either normally or spherically symmetrically distributed. Further, the REML and maximum likelihood estimators are shown to be asymptotically equivalent in this general framework, and the asymptotic distribution of the weighted least squares estimator (based on the REML estimator) of the fixed effect parameters is derived.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号