首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 22 毫秒
1.
Abstract

This article focuses on reducing the additional variance due to randomization of the responses. The idea of additive scrambling and its inverse has been used along with (i) split sample approach and (ii) double response approach. Specifically, our proposal is based on Gupta et al. (2006) randomized response model. We selected this model for improvement because it provides estimator of mean and sensitivity level of a sensitive variable and is better than all of its competitors proposed earlier to it and even Gupta et al. (2006) sensitivity estimator is better than that of Gupta et al. (2010). Our suggested estimators are unbiased estimators and perform better than Gupta et al. (2006) estimator. The issue of privacy protection is also discussed.  相似文献   

2.
ABSTRACT

In this paper, assuming that there exist omitted variables in the specified model, we analytically derive the exact formula for the mean squared error (MSE) of a heterogeneous pre-test (HPT) estimator whose components are the ordinary least squares (OLS) and feasible ridge regression (FRR) estimators. Since we cannot examine the MSE performance analytically, we execute numerical evaluations to investigate small sample properties of the HPT estimator, and compare the MSE performance of the HPT estimator with those of the FRR estimator and the usual OLS estimator. Our numerical results show that (1) the HPT estimator is more efficient when the model misspecification is severe; (2) the HPT estimator with the optimal critical value obtained under the correctly specified model can be safely used even when there exist omitted variables in the specified model.  相似文献   

3.
Randomized response techniques are widely employed in surveys dealing with sensitive questions to ensure interviewee anonymity and reduce nonrespondents rates and biased responses. Since Warner’s (J Am Stat Assoc 60:63–69, 1965) pioneering work, many ingenious devices have been suggested to increase respondent’s privacy protection and to better estimate the proportion of people, π A , bearing a sensitive attribute. In spite of the massive use of auxiliary information in the estimation of non-sensitive parameters, very few attempts have been made to improve randomization strategy performance when auxiliary variables are available. Moving from Zaizai’s (Model Assist Stat Appl 1:125–130, 2006) recent work, in this paper we provide a class of estimators for π A , for a generic randomization scheme, when the mean of a supplementary non-sensitive variable is known. The minimum attainable variance bound of the class is obtained and the best estimator is also identified. We prove that the best estimator acts as a regression-type estimator which is at least as efficient as the corresponding estimator evaluated without allowing for the auxiliary variable. The general results are then applied to Warner and Simmons’ model.  相似文献   

4.
Combining-100 information from multiple samples is often needed in biomedical and economic studies, but differences between these samples must be appropriately taken into account in the analysis of the combined data. We study the estimation for moment restriction models with data combined from two samples under an ignorability-type assumption while allowing for different marginal distributions of variables common to both samples. Suppose that an outcome regression (OR) model and a propensity score (PS) model are specified. By leveraging semi-parametric efficiency theory, we derive an augmented inverse probability-weighted (AIPW) estimator that is locally efficient and doubly robust with respect to these models. Furthermore, we develop calibrated regression and likelihood estimators that are not only locally efficient and doubly robust but also intrinsically efficient in achieving smaller variances than the AIPW estimator when the PS model is correctly specified but the OR model may be mispecified. As an important application, we study the two-sample instrumental variable problem and derive the corresponding estimators while allowing for incompatible distributions of variables common to the two samples. Finally, we provide a simulation study and an econometric application on public housing projects to demonstrate the superior performance of our improved estimators. The Canadian Journal of Statistics 48: 259–284; 2020 © 2019 Statistical Society of Canada  相似文献   

5.
Abstract

In this paper, we consider the estimation of a sensitive character when the population is consisted of several strata; this is undertaken by applying Niharika et al.’s model which is using geometric distribution as a randomization device. A sensitive parameter is estimated for the case in which stratum size is known, and proportional and optimum allocation methods are taken into account. We extended the Niharika et al.’s model to the case of an unknown stratum size; a sensitive parameter is estimated by applying stratified double sampling to the Niharika et al.’s model. Finally, the efficiency of the proposed model is compared with that of Niharika et al. in terms of the estimator variance.  相似文献   

6.
ABSTRACT

The clinical trials are usually designed with the implicit assumption that data analysis will occur only after the trial is completed. It is a challenging problem if the sponsor wishes to evaluate the drug efficacy in the middle of the study without breaking the randomization codes. In this article, the randomized response model and mixture model are introduced to analyze the data, masking the randomization codes of the crossover design. Given the probability of treatment sequence, the test of mixture model provides higher power than the test of randomized response model, which is inadequate in the example. The paired t-test has higher powers than both models if the randomization codes are broken. The sponsor may stop the trial early to claim the effectiveness of the study drug if the mixture model concludes a positive result.  相似文献   

7.
We consider the situation where sample surveys are to be undertaken on sensitive or stigmatizing issues. For such surveys, direct questioning methods usually lead to non-compliance or incorrect responses and so, the randomized response technique, where the responses are collected through some randomization device, is found to be useful. A majority of the literature on these techniques focus on dichotomous sensitive variables, while some techniques are also available for continuous sensitive variables. In this article, we focus on the extent of privacy protection available in sample surveys to respondents for continuous response variables. We also propose two measures of privacy protection. We demonstrate that the parameters of our randomization scheme can be so chosen as to achieve a pre-assigned level of privacy protection while at the same time yielding efficient estimates. We also show some numerical comparisons.  相似文献   

8.
Summary.  A new methodology is developed for estimating unemployment or employment characteristics in small areas, based on the assumption that the sample totals of unemployed and employed individuals follow a multinomial logit model with random area effects. The method is illustrated with UK labour force data aggregated by sex–age groups. For these data, the accuracy of direct estimates is poor in comparison with estimates that are derived from the multinomial logit model. Furthermore, two different estimators of the mean-squared errors are given: an analytical approximation obtained by Taylor linearization and an estimator based on bootstrapping. A simulation study for comparison of the two estimators shows the good performance of the bootstrap estimator.  相似文献   

9.
Randomized response methods for quantitative sensitive data are treated in an unified approach which includes the use of auxiliary information at the estimation stage. A class of estimators for the mean of a sensitive variable is proposed under a generic randomization model and the optimum estimator is obtained. Some special models are discussed in detail. To evaluate the degree of respondents’ confidentiality in models using auxiliary variables, a new measure of privacy protection is introduced. Different models are then compared both from the perspective of efficiency and privacy protection.  相似文献   

10.
ABSTRACT

In this article we evaluate the performance of a randomization test for a subset of regression coefficients in a linear model. This randomization test is based on random permutations of the independent variables. It is shown that the method maintains its level of significance, except for extreme situations, and has power that approximates the power of another randomization test, which is based on the permutation of residuals from the reduced model. We also show, via an example, that the method of permuting independent variables is more valuable than other randomization methods because it can be used in connection with the downweighting of outliers.  相似文献   

11.
Consider a linear regression model with some relevant regressors are unobservable. In such a situation, we estimate the model by using the proxy variables as regressors or by simply omitting the relevant regressors. In this paper, we derive the explicit formula of predictive mean squared error (PMSE) of a general family of shrinkage estimators of regression coefficients. It is shown analytically that the positive-part shrinkage estimator dominates the ordinary shrinkage estimator even when proxy variables are used in place of the unobserved variables. Also, as an example, our result is applied to the double k-class estimator proposed by Ullah and Ullah (Double k-class estimators of coefficients in linear regression. Econometrica. 1978;46:705–722). Our numerical results show that the positive-part double k-class estimator with proxy variables has preferable PMSE performance.  相似文献   

12.
ABSTRACT

In the paper, we consider a natural estimator of the offspring mean of a branching process with non stationary immigration based on observation of population sizes and number of immigrating individuals to each generation. We demonstrate that using a central limit theorem for multiple sums of dependent random variables it is possible to derive asymptotic distributions for the estimator without prior knowledge about the behavior (criticality) of the reproduction process. Before the three cases of criticality have been considered separately. Assuming that the immigration mean and variance vary regularly, conditions guaranteeing the strong consistency of the proposed estimator is also derived.  相似文献   

13.
ABSTRACT

This paper deals with the problem of estimating the finite population mean in stratified random sampling by using two auxiliary variables. This paper proposed a ratio-cum-product exponential type estimator of population mean under different situations: (i) when there is presence of non-response and measurement errors on the study as well as auxiliary variables; (ii) when there is non-response on the study and auxiliary variables but with no measurement error; (iii) when there is complete response on study variable but there is presence of non-response and measurement error on the auxiliary variables and (iv) when there are complete response and measurement error on study as well as auxiliary variables. The expressions of the bias and mean square error of the proposed estimator have been obtained up to the first degree of approximation. The proposed estimator has been compared with usual unbiased estimator, ratio estimator and other existing estimators and the conditions obtained to show the efficacy of the proposed estimator over other considered estimators. Simulation study is carried out to support the theoretical findings.  相似文献   

14.

Cressie et al. (2000; 2003) introduced and studied a new family of statistics, based on the φ-divergence measure, for solving the problem of testing a nested sequence of loglinear models. In that family of test statistics the parameters are estimated using the minimum φ-divergence estimator which is a generalization of the maximum likelihood estimator. In this paper we study the minimum power-divergence estimator (the most important family of minimum φ-divergence estimator) for a nested sequence of loglinear models in three-way contingency tables under assumptions of multinomial sampling. A simulation study illustrates that the minimum chi-squared estimator is simultaneously the most robust and efficient estimator among the family of the minimum power-divergence estimator.  相似文献   

15.
Kalucha et al. (Kalucha G., Gupta S., Dass B. K. (accepted). Ratio estimation of finite population mean using optional randomized response models. Journal of Statistical Theory and Practice) introduced an additive ratio estimator for finite population mean of a sensitive variable in simple random sampling without replacement and showed that this estimator performs better than the ordinary mean estimator based on an optional randomized response technique (RRT). In this paper, we introduce a regression estimator that performs better than the ratio estimator even for the modest correlation between the study and the auxiliary variables. A comparison of the proposed estimator with the corresponding ratio estimator and the ordinary RRT mean estimator is carried out theoretically, and is also illustrated with a simulation study.  相似文献   

16.
Kurt Hoffmann 《Statistics》2013,47(3):302-311
The purpose of this paper consists in deriving estimators which are less sensitive than the least squares estimator, when the assumption that the expectation vector lies in a certain linear subspace is violated. The obtained robust estimators are convex combinations of the least squares estimator and of the random vector Y.  相似文献   

17.
ABSTRACT

In order to investigate the convergence rate of the asymptotic normality for the estimator of the conditional mode function for the left-truncation model, we derive a Berry–Esseen type bound of the estimator when the lifetime observations with multivariate covariates form a stationary α-mixing sequence. The finite sample performance of the estimator of the conditional mode function is explored through simulations.  相似文献   

18.
The randomized-response (RR) technique is an effective survey method when collecting sensitive information. In this technique, a probability mechanism using randomization devices is commonly involved in answering to sensitive questions. In order to evaluate the survey at the most accurate extend, self-protection (SP) is introduced to describe the responses by participants who give the evasive answer without taking the result of the randomization device into account. In this study, we propose a Bayesian approach to modeling RR sum score variables under SP assumption. RR data from a Dutch survey on non-compliance with social security regulation in 2004 is used to demonstrate the proposed models.  相似文献   

19.
ABSTRACT

In this article, we discuss the superiority of r-k class estimator over some estimators in a misspecified linear model. We derive the necessary and sufficient conditions for the superiority of the r-k class estimator over each of these estimators under the Mahalanobis loss function by the average loss criterion in the misspecified linear model.  相似文献   

20.
In this paper, we suggest a new randomized response model useful for collecting information on quantitative sensitive variables such as drug use and income. The resultant estimator has been found to be better than the usual additive randomized response model. An interesting feature of the proposed model is that it is free from the known parameters of the scrambling variable unlike the additive model due to Himmelfarb and Edgell [S. Himmelfarb and S.E. Edgell, Additive constant model: a randomized response technique for eliminating evasiveness to quantitative response questions, Psychol. Bull. 87(1980), 525–530]. Relative efficiency of the proposed model has also been studied with the corresponding competitors. At the end, an application of the proposed model has been discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号