首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Whenever there is auxiliary information available in any form, the researchers want to utilize it in the method of estimation to obtain the most efficient estimator. When there exists enough amount of correlation between the study and the auxiliary variables, and parallel to these associations, the ranks of the auxiliary variables are also correlated with the study variable, which can be used a valuable device for enhancing the precision of an estimator accordingly. This article addresses the problem of estimating the finite population mean that utilizes the complementary information in the presence of (i) the auxiliary variable and (ii) the ranks of the auxiliary variable for non response. We suggest an improved estimator for estimating the finite population mean using the auxiliary information in the presence of non response. Expressions for bias and mean squared error of considered estimators are derived up to the first order of approximation. The performance of estimators is compared theoretically and numerically. A numerical study is carried out to evaluate the performances of estimators. It is observed that the proposed estimator is more efficient than the usual sample mean and the regression estimators, and some other families of ratio and exponential type of estimators.  相似文献   

2.
A regression estimator using two prior values of population mean (μx) of an auxiliary variable (x) is proposed after a preliminary test of closeness of these prior values to the true valueμx. The proposed preliminary test regression estimator has been found to be more efficient in general than the usual regression estimator when prior values are used in place of μxwithout preliminary test of significance. The efficiency of the proposed estimator over the usual regression estimator has also been computed for different values of Δ0, Δ1, n, and ρ, which showed considerable gain in precision.  相似文献   

3.
ABSTRACT

In this article we reconsider an estimator of population size previously advocated for use when sampling from a population subdivided into different types. We show that it may be usefully adopted in the simple equal-catchability model used in mark-recapture. Unlike the commonly used maximum likelihood estimator, this conditionally unbiased estimator is always finite-valued. Except in situations in which the data contain little relevant information, its performance, in terms of bias and precision, is seen to be at least as good as that of the maximum likelihood estimator. Two estimators of the standard deviation of the conditionally unbiased estimator are considered.  相似文献   

4.
When possible values of a response variable are limited, distributional assumptions about random effects may not be checkable. This may cause a distribution-robust estimator, such as the conditional maximum likelihood estimator to be recommended; however, it does not utilize all the information in the data. We show how, with binary matched pairs, the hierarchical likelihood can be used to recover information from concordant pairs, giving an improvement over the conditional maximum likelihood estimator without losing distribution-robustness.  相似文献   

5.
排序集抽样下利用辅助变量中位数构建了总体均值的改进比率估计模型,分析了该比率估计量的偏差和均方误差,并与简单随机抽样下的比率估计比较,证明了改进后的比率估计均方误差更小。以农作物播种面积和产量为研究对象进行实例分析,研究表明,基于排序集样本和辅助变量中位数的比率估计方法可以有效提高估计精度,验证了该构造方法的可行性。  相似文献   

6.
Summary. The validity of the matching estimator in programme evaluation depends on the completeness of the set of variables that are used for matching. When an attitudinal variable is relevant for the decision about participation, but is either unmeasured or measured only after entry to the programme, estimates of effects may be biased or difficult to interpret. This issue was investigated with data from an evaluation study of careers guidance for employed adults which utilized the method of propensity score matching. Job satisfaction, measured shortly after entry to the programme, was found to be strongly associated with participation but might itself have been influenced by the early experience of careers guidance. Estimates of the effects of guidance were considered both including and excluding the job satisfaction measure from the participation model. Data experiments with adjusted values of job satisfaction were also performed. The results illustrate that omitted attitudinal information poses a particular difficulty for the matching estimator.  相似文献   

7.
A new general method of combining estimators is proposed in order to obtain an estimator with “improved” small sample properties. It is based on a specification test statistic and incorporates some well-known methods like preliminary testing. It is used to derive an alternative estimator for the slope in the simple errors-in-variables model, combining OLS and the modified instrumental variable estimator by Fuller. Small sample properties of the new estimator are investigated by means of a Monte Carlo study.  相似文献   

8.
Our article presents a general treatment of the linear regression model, in which the error distribution is modelled nonparametrically and the error variances may be heteroscedastic, thus eliminating the need to transform the dependent variable in many data sets. The mean and variance components of the model may be either parametric or nonparametric, with parsimony achieved through variable selection and model averaging. A Bayesian approach is used for inference with priors that are data-based so that estimation can be carried out automatically with minimal input by the user. A Dirichlet process mixture prior is used to model the error distribution nonparametrically; when there are no regressors in the model, the method reduces to Bayesian density estimation, and we show that in this case the estimator compares favourably with a well-regarded plug-in density estimator. We also consider a method for checking the fit of the full model. The methodology is applied to a number of simulated and real examples and is shown to work well.  相似文献   

9.
Abstract

Many researchers used auxiliary information together with survey variable to improve the efficiency of population parameters like mean, variance, total and proportion. Ratio and regression estimation are the most commonly used methods that utilized auxiliary information in different ways to get the maximum benefits in the form of high precision of the estimators. Thompson first introduced the concept of Adaptive cluster sampling, which is an appropriate technique for collecting the samples from rare and clustered populations. In this article, a generalized exponential type estimator is proposed and its properties have been studied for the estimation of rare and highly clustered population variance using single auxiliary information. A numerical study is carried out on a real and artificial population to judge the performance of the proposed estimator over the competing estimators. It is shown that the proposed generalized exponential type estimator is more efficient than the adaptive and non adaptive estimators under conventional sampling design.  相似文献   

10.
Several estimators, including the classical and the regression estimators of finite population mean, are compared, both theoretically and empirically, under a calibration model, where the dependent variable(y), and not the independent variable(x), can be observed for all units of the finite population. It is shown asymptotically that when conditioned on x, the bias of the classical estimator may be much smaller than that of the regression estimators; whereas when conditioned on y, the regression estimator may have much smaller conditional bias than the classical estimator. Since all the y's(not x's) can be observed, it seems appropriate to make comparison under the conditional distribution of each estimator with y fixed. In this case, the regression estimator has smaller variance, smaller conditional bias, and the conditional coverage probability closer to its nominal level  相似文献   

11.
ABSTRACT

The present work is an attempt to make use of several auxiliary variables at both the occasions for improving the precision of estimates at the current occasion on two occasions of successive sampling. Chain-type ratio estimator has been proposed for estimating the population mean at current occasion in two occasions rotation (successive) sampling. Theoretical properties of the proposed estimator have been investigated. The proposed estimator has been compared with simple mean estimator when there is no matching and with the ratio estimator in successive sampling when information is available on one auxiliary variable on both the occasions. Optimum replacement strategy has also been discussed. Theoretical results have been justified through empirical investigation.  相似文献   

12.
Calibration on the available auxiliary variables is widely used to increase the precision of the estimates of parameters. Singh and Sedory [Two-step calibration of design weights in survey sampling. Commun Stat Theory Methods. 2016;45(12):3510–3523.] considered the problem of calibration of design weights under two-step for single auxiliary variable. For a given sample, design weights and calibrated weights are set proportional to each other, in the first step. While, in the second step, the value of proportionality constant is determined on the basis of objectives of individual investigator/user for, for example, to get minimum mean squared error or reduction of bias. In this paper, we have suggested to use two auxiliary variables for two-step calibration of the design weights and compared the results with single auxiliary variable for different sample sizes based on simulated and real-life data set. The simulated and real-life application results show that two-auxiliary variables based two-step calibration estimator outperforms the estimator under single auxiliary variable in terms of minimum mean squared error.  相似文献   

13.
We examine the risk of a pre-test estimator for regression coefficients after a pre-test for homoskedasticity under the Balanced Loss Function (BLF). We show analytically that the two stage Aitken estimator is dominated by the pre-test estimator with the critical value of unity, even if the BLF is used. We also show numerically that both the two stage Aitken estimator and the pre-test estimator can be dominated by the ordinary least squares estimator when “goodness of fit” is regarded as more important than precision of estimation.  相似文献   

14.
The marginal likelihood can be notoriously difficult to compute, and particularly so in high-dimensional problems. Chib and Jeliazkov employed the local reversibility of the Metropolis–Hastings algorithm to construct an estimator in models where full conditional densities are not available analytically. The estimator is free of distributional assumptions and is directly linked to the simulation algorithm. However, it generally requires a sequence of reduced Markov chain Monte Carlo runs which makes the method computationally demanding especially in cases when the parameter space is large. In this article, we study the implementation of this estimator on latent variable models which embed independence of the responses to the observables given the latent variables (conditional or local independence). This property is employed in the construction of a multi-block Metropolis-within-Gibbs algorithm that allows to compute the estimator in a single run, regardless of the dimensionality of the parameter space. The counterpart one-block algorithm is also considered here, by pointing out the difference between the two approaches. The paper closes with the illustration of the estimator in simulated and real-life data sets.  相似文献   

15.
In RSS, the variance of observations in each ranked set plays an important role in finding an optimal design for unbalanced RSS and in inferring the population mean. The empirical estimator (i.e., the sample variance in a given ranked set) is most commonly used for estimating the variance in the literature. However, the empirical estimator does not use the information in the entire data over different ranked sets. Further, it is highly variable when the sample size is not large enough, as is typical in RSS applications. In this paper, we propose a plug-in estimator for the variance of each set, which is more efficient than the empirical one. The estimator uses a result in order statistics which characterizes the cumulative distribution function (CDF) of the rth order statistics as a function of the population CDF. We analytically prove the asymptotic normality of the proposed estimator. We further apply it to estimate the standard error of the RSS mean estimator. Both our simulation and empirical study show that our estimators consistently outperform existing methods.  相似文献   

16.
One of the standard variable selection procedures in multiple linear regression is to use a penalisation technique in least‐squares (LS) analysis. In this setting, many different types of penalties have been introduced to achieve variable selection. It is well known that LS analysis is sensitive to outliers, and consequently outliers can present serious problems for the classical variable selection procedures. Since rank‐based procedures have desirable robustness properties compared to LS procedures, we propose a rank‐based adaptive lasso‐type penalised regression estimator and a corresponding variable selection procedure for linear regression models. The proposed estimator and variable selection procedure are robust against outliers in both response and predictor space. Furthermore, since rank regression can yield unstable estimators in the presence of multicollinearity, in order to provide inference that is robust against multicollinearity, we adjust the penalty term in the adaptive lasso function by incorporating the standard errors of the rank estimator. The theoretical properties of the proposed procedures are established and their performances are investigated by means of simulations. Finally, the estimator and variable selection procedure are applied to the Plasma Beta‐Carotene Level data set.  相似文献   

17.
Simple nonparametric estimates of the conditional distribution of a response variable given a covariate are often useful for data exploration purposes or to help with the specification or validation of a parametric or semi-parametric regression model. In this paper we propose such an estimator in the case where the response variable is interval-censored and the covariate is continuous. Our approach consists in adding weights that depend on the covariate value in the self-consistency equation proposed by Turnbull (J R Stat Soc Ser B 38:290–295, 1976), which results in an estimator that is no more difficult to implement than Turnbull’s estimator itself. We show the convergence of our algorithm and that our estimator reduces to the generalized Kaplan–Meier estimator (Beran, Nonparametric regression with randomly censored survival data, 1981) when the data are either complete or right-censored. We demonstrate by simulation that the estimator, bootstrap variance estimation and bandwidth selection (by rule of thumb or cross-validation) all perform well in finite samples. We illustrate the method by applying it to a dataset from a study on the incidence of HIV in a group of female sex workers from Kinshasa.  相似文献   

18.
The sampling designs dependent on sample moments of auxiliary variables are well known. Lahiri (Bull Int Stat Inst 33:133–140, 1951) considered a sampling design proportionate to a sample mean of an auxiliary variable. Sing and Srivastava (Biometrika 67(1):205–209, 1980) proposed the sampling design proportionate to a sample variance while Wywiał (J Indian Stat Assoc 37:73–87, 1999) a sampling design proportionate to a sample generalized variance of auxiliary variables. Some other sampling designs dependent on moments of an auxiliary variable were considered e.g. in Wywiał (Some contributions to multivariate methods in, survey sampling. Katowice University of Economics, Katowice, 2003a); Stat Transit 4(5):779–798, 2000) where accuracy of some sampling strategies were compared, too.These sampling designs cannot be useful in the case when there are some censored observations of the auxiliary variable. Moreover, they can be much too sensitive to outliers observations. In these cases the sampling design proportionate to the order statistic of an auxiliary variable can be more useful. That is why such an unequal probability sampling design is proposed here. Its particular cases as well as its conditional version are considered, too. The sampling scheme implementing this sampling design is proposed. The inclusion probabilities of the first and second orders were evaluated. The well known Horvitz–Thompson estimator is taken into account. A ratio estimator dependent on an order statistic is constructed. It is similar to the well known ratio estimator based on the population and sample means. Moreover, it is an unbiased estimator of the population mean when the sample is drawn according to the proposed sampling design dependent on the appropriate order statistic.  相似文献   

19.
Oracle Inequalities for Convex Loss Functions with Nonlinear Targets   总被引:1,自引:1,他引:0  
This article considers penalized empirical loss minimization of convex loss functions with unknown target functions. Using the elastic net penalty, of which the Least Absolute Shrinkage and Selection Operator (Lasso) is a special case, we establish a finite sample oracle inequality which bounds the loss of our estimator from above with high probability. If the unknown target is linear, this inequality also provides an upper bound of the estimation error of the estimated parameter vector. Next, we use the non-asymptotic results to show that the excess loss of our estimator is asymptotically of the same order as that of the oracle. If the target is linear, we give sufficient conditions for consistency of the estimated parameter vector. We briefly discuss how a thresholded version of our estimator can be used to perform consistent variable selection. We give two examples of loss functions covered by our framework.  相似文献   

20.
A polynomial functional relationship with errors in both variables can be consistently estimated by constructing an ordinary least squares estimator for the regression coefficients, assuming hypothetically the latent true regressor variable to be known, and then adjusting for the errors. If normality of the error variables can be assumed, the estimator can be simplified considerably. Only the variance of the errors in the regressor variable and its covariance with the errors of the response variable need to be known. If the variance of the errors in the dependent variable is also known, another estimator can be constructed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号