首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
基于回归组合技术的连续性抽样估计方法研究   总被引:1,自引:1,他引:0  
在使用样本轮换的连续性抽样调查中,不仅可以利用前期调查的研究变量的信息,还可使用现期调查的辅助变量信息来建立回归模型进行回归估计,进而构造回归组合估计量,并在此基础上确定最优样本轮换率和最优权重系数,使得回归组合估计量的方差最小,从而更大程度地提高连续性抽样调查的估计精度。  相似文献   

2.
In many biomedical studies, it is common that due to budget constraints, the primary covariate is only collected in a randomly selected subset from the full study cohort. Often, there is an inexpensive auxiliary covariate for the primary exposure variable that is readily available for all the cohort subjects. Valid statistical methods that make use of the auxiliary information to improve study efficiency need to be developed. To this end, we develop an estimated partial likelihood approach for correlated failure time data with auxiliary information. We assume a marginal hazard model with common baseline hazard function. The asymptotic properties for the proposed estimators are developed. The proof of the asymptotic results for the proposed estimators is nontrivial since the moments used in estimating equation are not martingale-based and the classical martingale theory is not sufficient. Instead, our proofs rely on modern empirical process theory. The proposed estimator is evaluated through simulation studies and is shown to have increased efficiency compared to existing methods. The proposed method is illustrated with a data set from the Framingham study.  相似文献   

3.
We consider failure time regression analysis with an auxiliary variable in the presence of a validation sample. We extend the nonparametric inference procedure of Zhou and Pepe to handle a continuous auxiliary or proxy covariate. We estimate the induced relative risk function with a kernel smoother and allow the selection probability of the validation set to depend on the observed covariates. We present some asymptotic properties for the kernel estimator and provide some simulation results. The method proposed is illustrated with a data set from an on-going epidemiologic study.  相似文献   

4.
This paper deals with the analysis of proportional rate model for recurrent event data when covariates are subject to missing. The true covariate is measured only on a randomly chosen validation set, whereas auxiliary information is available for all cohort subjects. To further utilize the auxiliary information to improve study efficiency, we propose an estimated estimating equation for the regression parameters. The resulting estimators are shown to be consistent and asymptotically normal. Both graphical and numerical techniques for checking the adequacy of the model are presented. Simulations are conducted to evaluate the finite sample performance of the proposed estimators. Illustration with a real medical study is provided.  相似文献   

5.
Abstract

Many researchers used auxiliary information together with survey variable to improve the efficiency of population parameters like mean, variance, total and proportion. Ratio and regression estimation are the most commonly used methods that utilized auxiliary information in different ways to get the maximum benefits in the form of high precision of the estimators. Thompson first introduced the concept of Adaptive cluster sampling, which is an appropriate technique for collecting the samples from rare and clustered populations. In this article, a generalized exponential type estimator is proposed and its properties have been studied for the estimation of rare and highly clustered population variance using single auxiliary information. A numerical study is carried out on a real and artificial population to judge the performance of the proposed estimator over the competing estimators. It is shown that the proposed generalized exponential type estimator is more efficient than the adaptive and non adaptive estimators under conventional sampling design.  相似文献   

6.
In this paper we propose a modified version of the estimator of Hansen and Hurwitz [12] in the case of quantitative sensitive variable and consider a randomization mechanism on the second call that provides privacy protection to the respondents to get truthful information. We use variance of the modified estimator as a tool to measure privacy protection and it is observed that the higher is the variance, the lower is the efficiency but the higher is the privacy protection. To overcome this efficiency loss, we consider a linear regression estimator using known non-sensitive auxiliary information. With consideration of four scrambled models, we try to make a trade-off between efficiency and privacy protection. To show this compromise, analytical and numerical comparisons are obtained.  相似文献   

7.
Regression Type Estimators Using Multiple Auxiliary Information   总被引:2,自引:0,他引:2  
In this paper we consider a practical situation where information on two auxiliary variables related to the study variable is available at different levels. Following Kiregyera (1980, 1984) who has obtained a chain ratio-to-regression estimator and regression to regression estimator, we shall study several estimators that arise naturally in this context and compare them under the mean square error criterion. We extend these results to the case when multiple auxiliary information is available.  相似文献   

8.
Abstract. Systematic sampling is frequently used in surveys, because of its ease of implementation and its design efficiency. An important drawback of systematic sampling, however, is that no direct estimator of the design variance is available. We describe a new estimator of the model‐based expectation of the design variance, under a non‐parametric model for the population. The non‐parametric model is sufficiently flexible that it can be expected to hold at least approximately in many situations with continuous auxiliary variables observed at the population level. We prove the model consistency of the estimator for both the anticipated variance and the design variance under a non‐parametric model with a univariate covariate. The broad applicability of the approach is demonstrated on a dataset from a forestry survey.  相似文献   

9.
In recent years, regression models have been shown to be useful for predicting the long-term survival probabilities of patients in clinical trials. The importance of a regression model is that once the regression parameters are estimated information about the regressed quantity is immediate. A simple estimator is proposed for the regression parameters in a model for the long-term survival rate. The proposed estimator is seen to arise from an estimating function that has the missing information principle underlying its construction. When the covariate takes values in a finite set, the proposed estimating function is equivalent to an ad hoc estimating function proposed in the literature. However, in general, the two estimating functions lead to different estimators of the regression parameter. For discrete covariates, the asymptotic covariance matrix of the proposed estimator is simple to calculate using standard techniques involving the predictable covariation process of martingale transforms. An ad hoc extension to the case of a one-dimensional continuous covariate is proposed. Simplicity and generalizability are two attractive features of the proposed approach. The last mentioned feature is not enjoyed by the other estimator.  相似文献   

10.
容越彦  陈光慧 《统计研究》2015,32(12):88-94
在总结现有模型辅助估计方法的基础上,本文通过构造一种半参数超总体模型,同时结合广义差分估计思想提出一种新型的模型辅助估计量。该估计量比传统的非参数和半参数回归估计利用更少、更易得到的辅助信息,即只需利用和广义回归估计相同的辅助信息,但一般会比广义回归估计拥有更高的估计精度。理论证明了该估计量是渐近设计无偏和设计一致的,其渐近设计均方误差为广义差分估计量的方差。模拟结果显示:其至少与广义回归估计一样好;对于线性程度越低的超总体模型,其估计精度比广义回归估计有越明显的提高;就本文模拟而言,光滑参数在0.04~0.12间适当取值时其会取到相对较好的估计效果。  相似文献   

11.
In this article, we study a marginal hazard model with common baseline hazard for correlated failure time data. We assume that the true covariate is measured precisely in a subset of the whole study cohort, whereas an auxiliary information for the true covariate is available for the whole cohort. We first estimate the relative risk function empirically. Then we obtain the estimator for the regression parameter by replacing the relative risk function with its estimator in a generalized estimating equation (GEE) proposed by Cai (1992 Cai , J. ( 1992 ). Generalized estimation equations for censored multivariate failure time data. Ph.D. thesis, University of Washington, Seattle, Washington . [Google Scholar]). A key feature of this method is that it is nonparametric with respect to the association between the missing covariate and the observed auxiliary covariate. The proposed estimator is shown to be consistent and asymptotically normal. Furthermore, we present a corrected Breslow-type estimator for the cumulative hazard function. Simulation studies are conducted to evaluate the proposed method.  相似文献   

12.
In this article, we consider the problem of estimation of population mean using the known median of auxiliary variable. We proposed an estimator and its efficiency is studied analytically as well as empirically for different conditions. The proposed estimator is found to be more efficient than traditional estimators such as sample mean and linear regression estimator.  相似文献   

13.
Summary. The paper considers canonical link generalized linear models with stratum-specific nuisance intercepts and missing covariate data. This family includes the conditional logistic regression model. Existing methods for this problem, each of which uses a conditioning argu- ment to eliminate the nuisance intercept, model either the missing covariate data or the missingness process. The paper compares these methods under a common likelihood framework. The semiparametric efficient estimator is identified, and a new estimator, which reduces dependence on the model for the missing covariate, is proposed. A simulation study compares the methods with respect to efficiency and robustness to model misspecification.  相似文献   

14.
Random coefficient model (RCM) is a powerful statistical tool in analyzing correlated data collected from studies with different clusters or from longitudinal studies. In practice, there is a need for statistical methods that allow biomedical researchers to adjust for the measured and unmeasured covariates that might affect the regression model. This article studies two nonparametric methods dealing with auxiliary covariate data in linear random coefficient models. We demonstrate how to estimate the coefficients of the models and how to predict the random effects when the covariates are missing or mismeasured. We employ empirical estimator and kernel smoother to handle a discrete and continuous auxiliary, respectively. Simulation results show that the proposed methods perform better than an alternative method that only uses data in the validation data set and ignores the random effects in the random coefficient model.  相似文献   

15.
This article presents the calibration procedure of the two-phase randomized response (RR) technique for surveying the sensitive characteristic. When the sampling scheme is two-phase or double sampling, auxiliary information known from the entire population can be used, but the auxiliary information should be information available from both the first and second phases of the sample. If there is auxiliary information available from both the first and second phases, then we can improve the ordinary two-phase RR estimator by incorporating this information in the estimation procedure. In this article, we used the new two-step Newton's method for computing unknown constants in the calibration procedure and compared the efficiency of the proposed estimator through some numerical study.  相似文献   

16.
黄莺  李金昌 《统计研究》2008,25(7):66-69
校正估计法已被大量运用于抽样调查中,它利用辅助信息构造的校正权重提高了对总体总值(或均值)的估计精度。本文提出了分层抽样中的校正组合比率估计量,并推广到分层双重抽样中。同时给出新估计量的近似方差表达式。最后利用计算机随机模拟验证较正估计量对估计精度的改进。  相似文献   

17.
We propose a profile conditional likelihood approach to handle missing covariates in the general semiparametric transformation regression model. The method estimates the marginal survival function by the Kaplan-Meier estimator, and then estimates the parameters of the survival model and the covariate distribution from a conditional likelihood, substituting the Kaplan-Meier estimator for the marginal survival function in the conditional likelihood. This method is simpler than full maximum likelihood approaches, and yields consistent and asymptotically normally distributed estimator of the regression parameter when censoring is independent of the covariates. The estimator demonstrates very high relative efficiency in simulations. When compared with complete-case analysis, the proposed estimator can be more efficient when the missing data are missing completely at random and can correct bias when the missing data are missing at random. The potential application of the proposed method to the generalized probit model with missing continuous covariates is also outlined.  相似文献   

18.
Semiparametric transformation models provide flexible regression models for survival analysis, including the Cox proportional hazards and the proportional odds models as special cases. We consider the application of semiparametric transformation models in case-cohort studies, where the covariate data are observed only on cases and on a subcohort randomly sampled from the full cohort. We first propose an approximate profile likelihood approach with full-cohort data, which amounts to the pseudo-partial likelihood approach of Zucker [2005. A pseudo-partial likelihood method for semiparametric survival regression with covariate errors. J. Amer. Statist. Assoc. 100, 1264–1277]. Simulation results show that our proposal is almost as efficient as the nonparametric maximum likelihood estimator. We then extend this approach to the case-cohort design, applying the Horvitz–Thompson weighting method to the estimating equations from the approximated profile likelihood. Two levels of weights can be utilized to achieve unbiasedness and to gain efficiency. The resulting estimator has a closed-form asymptotic covariance matrix, and is found in simulations to be substantially more efficient than the estimator based on martingale estimating equations. The extension to left-truncated data will be discussed. We illustrate the proposed method on data from a cardiovascular risk factor study conducted in Taiwan.  相似文献   

19.
Whenever there is auxiliary information available in any form, the researchers want to utilize it in the method of estimation to obtain the most efficient estimator. When there exists enough amount of correlation between the study and the auxiliary variables, and parallel to these associations, the ranks of the auxiliary variables are also correlated with the study variable, which can be used a valuable device for enhancing the precision of an estimator accordingly. This article addresses the problem of estimating the finite population mean that utilizes the complementary information in the presence of (i) the auxiliary variable and (ii) the ranks of the auxiliary variable for non response. We suggest an improved estimator for estimating the finite population mean using the auxiliary information in the presence of non response. Expressions for bias and mean squared error of considered estimators are derived up to the first order of approximation. The performance of estimators is compared theoretically and numerically. A numerical study is carried out to evaluate the performances of estimators. It is observed that the proposed estimator is more efficient than the usual sample mean and the regression estimators, and some other families of ratio and exponential type of estimators.  相似文献   

20.
In this note, the asymptotic variance formulas are explicitly derived and compared between the parametric and semiparametric estimators of a regression parameter and survival probability under the additive hazards model. To obtain explicit formulas, it is assumed that the covariate term including a regression coefficient follows a gamma distribution and the baseline hazard function is constant. The results show that the semiparametric estimator of the regression coefficient parameter is fully efficient relative to the parametric counterpart when the survival time and a covariate are independent, as in the proportional hazards model. Relative to a more realistic case of the parametric additive hazards model with a Weibull baseline, the loss of efficiency of the semiparametric estimator of survival probability is moderate.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号