期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Efficient inverse probability weighting method for quantile regression with nonignorable missing data

Pu-Ying Zhao De-Peng Jiang 《Statistics》2017,51(2):363-386

Quantitle regression (QR) is a popular approach to estimate functional relations between variables for all portions of a probability distribution. Parameter estimation in QR with missing data is one of the most challenging issues in statistics. Regression quantiles can be substantially biased when observations are subject to missingness. We study several inverse probability weighting (IPW) estimators for parameters in QR when covariates or responses are subject to missing not at random. Maximum likelihood and semiparametric likelihood methods are employed to estimate the respondent probability function. To achieve nice efficiency properties, we develop an empirical likelihood (EL) approach to QR with the auxiliary information from the calibration constraints. The proposed methods are less sensitive to misspecified missing mechanisms. Asymptotic properties of the proposed IPW estimators are shown under general settings. The efficiency gain of EL-based IPW estimator is quantified theoretically. Simulation studies and a data set on the work limitation of injured workers from Canada are used to illustrated our proposed methodologies. 相似文献

2.

Improved methods for moment restriction models with data combination and an application to two-sample instrumental variable estimation

Heng Shu Zhiqiang Tan 《Revue canadienne de statistique》2020,48(2):259-284

Combining-100 information from multiple samples is often needed in biomedical and economic studies, but differences between these samples must be appropriately taken into account in the analysis of the combined data. We study the estimation for moment restriction models with data combined from two samples under an ignorability-type assumption while allowing for different marginal distributions of variables common to both samples. Suppose that an outcome regression (OR) model and a propensity score (PS) model are specified. By leveraging semi-parametric efficiency theory, we derive an augmented inverse probability-weighted (AIPW) estimator that is locally efficient and doubly robust with respect to these models. Furthermore, we develop calibrated regression and likelihood estimators that are not only locally efficient and doubly robust but also intrinsically efficient in achieving smaller variances than the AIPW estimator when the PS model is correctly specified but the OR model may be mispecified. As an important application, we study the two-sample instrumental variable problem and derive the corresponding estimators while allowing for incompatible distributions of variables common to the two samples. Finally, we provide a simulation study and an econometric application on public housing projects to demonstrate the superior performance of our improved estimators. The Canadian Journal of Statistics 48: 259–284; 2020 © 2019 Statistical Society of Canada 相似文献

3.

Doubly robust empirical likelihood inference in covariate-missing data problems

Biao Zhang 《Statistics》2016,50(5):1173-1194

Missing covariate data occurs often in regression analysis. We study methods for estimating the regression coefficients in an assumed conditional mean function when some covariates are completely observed but other covariates are missing for some subjects. We adopt the semiparametric perspective of Robins et al. [Estimation of regression coefficients when some regressors are not always observed. J Amer Statist Assoc. 1994;89:846–866] on regression analyses with missing covariates, in which they pioneered the use of two working models, the working propensity score model and the working conditional score model. A recent approach to missing covariate data analysis is the empirical likelihood method of Qin et al. [Empirical likelihood in missing data problems. J Amer Statist Assoc. 2009;104:1492–1503], which effectively combines unbiased estimating equations. In this paper, we consider an alternative likelihood approach based on the full likelihood of the observed data. This full likelihood-based method enables us to generate estimators for the vector of the regression coefficients that are (a) asymptotically equivalent to those of Qin et al. [Empirical likelihood in missing data problems. J Amer Statist Assoc. 2009;104:1492–1503] when the working propensity score model is correctly specified, and (b) doubly robust, like the augmented inverse probability weighting (AIPW) estimators of Robins et al. [Estimation of regression coefficients when some regressors are not always observed. J Am Statist Assoc. 1994;89:846–866]. Thus, the proposed full likelihood-based estimators improve on the efficiency of the AIPW estimators when the working propensity score model is correct but the working conditional score model is possibly incorrect, and also improve on the empirical likelihood estimators of Qin, Zhang and Leung [Empirical likelihood in missing data problems. J Amer Statist Assoc. 2009;104:1492–1503] when the reverse is true, that is, the working conditional score model is correct but the working propensity score model is possibly incorrect. In addition, we consider a regression method for estimation of the regression coefficients when the working conditional score model is correctly specified; the asymptotic variance of the resulting estimator is no greater than the semiparametric variance bound characterized by the theory of Robins et al. [Estimation of regression coefficients when some regressors are not always observed. J Amer Statist Assoc. 1994;89:846–866]. Finally, we compare the finite-sample performance of various estimators in a simulation study. 相似文献

4.

Estimation of conditional cumulative incidence functions under generalized semiparametric regression models with missing covariates,with application to analysis of biomarker correlates in vaccine trials

Yanqing Sun Fei Heng Unkyung Lee Peter B. Gilbert 《Revue canadienne de statistique》2023,51(1):235-257

This article presents generalized semiparametric regression models for conditional cumulative incidence functions with competing risks data when covariates are missing by sampling design or happenstance. A doubly robust augmented inverse probability weighted (AIPW) complete-case approach to estimation and inference is investigated. This approach modifies IPW complete-case estimating equations by exploiting the key features in the relationship between the missing covariates and the phase-one data to improve efficiency. An iterative numerical procedure is derived to solve the nonlinear estimating equations. The asymptotic properties of the proposed estimators are established. A simulation study examining the finite-sample performances of the proposed estimators shows that the AIPW estimators are more efficient than the IPW estimators. The developed method is applied to the RV144 HIV-1 vaccine efficacy trial to investigate vaccine-induced IgG binding antibodies to HIV-1 as correlates of acquisition of HIV-1 infection while taking account of whether the HIV-1 sequences are near or far from the HIV-1 sequences represented in the vaccine construct. 相似文献

5.

Efficient Robust Estimation for Linear Models with Missing Response at Random

《Scandinavian Journal of Statistics》2018,45(2):366-381

Coefficient estimation in linear regression models with missing data is routinely carried out in the mean regression framework. However, the mean regression theory breaks down if the error variance is infinite. In addition, correct specification of the likelihood function for existing imputation approach is often challenging in practice, especially for skewed data. In this paper, we develop a novel composite quantile regression and a weighted quantile average estimation procedure for parameter estimation in linear regression models when some responses are missing at random. Instead of imputing the missing response by randomly drawing from its conditional distribution, we propose to impute both missing and observed responses by their estimated conditional quantiles given the observed data and to use the parametrically estimated propensity scores to weigh check functions that define a regression parameter. Both estimation procedures are resistant to heavy‐tailed errors or outliers in the response and can achieve nice robustness and efficiency. Moreover, we propose adaptive penalization methods to simultaneously select significant variables and estimate unknown parameters. Asymptotic properties of the proposed estimators are carefully investigated. An efficient algorithm is developed for fast implementation of the proposed methodologies. We also discuss a model selection criterion, which is based on an IC_Q‐type statistic, to select the penalty parameters. The performance of the proposed methods is illustrated via simulated and real data sets. 相似文献

6.

Model averaging procedure for varying-coefficient partially linear models with missing responses

Jie Zeng Weihu Cheng Guozhi Hu Yaohua Rong 《Journal of the Korean Statistical Society》2018,47(3):379-394

This paper is concerned with model averaging procedure for varying-coefficient partially linear models with missing responses. The profile least-squares estimation process and inverse probability weighted method are employed to estimate regression coefficients of the partially restricted models, in which the propensity score is estimated by the covariate balancing propensity score method. The estimators of the linear parameters are shown to be asymptotically normal. Then we develop the focused information criterion, formulate the frequentist model averaging estimators and construct the corresponding confidence intervals. Some simulation studies are conducted to examine the finite sample performance of the proposed methods. We find that the covariate balancing propensity score improves the performance of the inverse probability weighted estimator. We also demonstrate the superiority of the proposed model averaging estimators over those of existing strategies in terms of mean squared error and coverage probability. Finally, our approach is further applied to a real data example. 相似文献

7.

Mean response estimation with missing response in the presence of high-dimensional covariates

Yongjin Li Qihua Wang Liping Zhu 《统计学通讯:理论与方法》2017,46(2):628-643

This paper studies the problem of mean response estimation where missingness occurs to the response but multiple-dimensional covariates are observable. Two main challenges occur in this situation: curse of dimensionality and model specification. The non parametric imputation method relieves model specification but suffers curse of dimensionality, while some model-based methods such as inverse probability weighting (IPW) and augmented inverse probability weighting (AIPW) methods are the opposite. We propose a unified non parametric method to overcome the two challenges with the aiding of sufficient dimension reduction. It imposes no parametric structure on propensity score or conditional mean response, and thus retains the non parametric flavor. Moreover, the estimator achieves the optimal efficiency that a double robust estimator can attain. Simulations were conducted and it demonstrates the excellent performances of our method in various situations. 相似文献

8.

Statistical inference for fixed-effects partially linear regression models with errors in variables

Haibo Zhou Jinhong You Bin Zhou 《Statistical Papers》2010,51(3):629-650

Fixed-effects partially linear regression models are useful tools to analyze data from economic, genetic and other fields. In this paper, we consider estimation and inference procedures when some of the covariates are measured with errors. The previously proposed estimations, including difference-based series estimation (Baltagi and Li in Ann Econ Finan 3:103--116, 2002) and profile least squares estimation (Fan et al. in J Am Stat Assoc 100:781--813, 2005) are no longer consistent because of the attenuation. We propose a new estimation by taking the measurement errors into account. Our proposed estimators are shown to be consistent and asymptotically normal. Consistent estimations of the error variance are also developed. In addition, we propose a variable-selection procedure to variable selection in the parametric part. The procedure is an extension of the nonconcave penalized likelihood (Fan and Li in J Am Stat Assoc 85:1348--1360, 2001), which simultaneously selects the important variables and estimates the unknown parameters. The resulting estimate is shown to possess an oracle property. Extensive simulation studies are conducted to illustrate the finite sample performance of the proposed procedures. 相似文献

9.

Empirical likelihood weighted composite quantile regression with partially missing covariates

Jing Sun Yunyan Ma 《Journal of nonparametric statistics》2017,29(1):137-150

This paper develops a novel weighted composite quantile regression (CQR) method for estimation of a linear model when some covariates are missing at random and the probability for missingness mechanism can be modelled parametrically. By incorporating the unbiased estimating equations of incomplete data into empirical likelihood (EL), we obtain the EL-based weights, and then re-adjust the inverse probability weighted CQR for estimating the vector of regression coefficients. Theoretical results show that the proposed method can achieve semiparametric efficiency if the selection probability function is correctly specified, therefore the EL weighted CQR is more efficient than the inverse probability weighted CQR. Besides, our algorithm is computationally simple and easy to implement. Simulation studies are conducted to examine the finite sample performance of the proposed procedures. Finally, we apply the new method to analyse the US news College data. 相似文献

10.

Cluster-based multivariate outlier identification and re-weighted regression in linear models

Ekele Alih Hong Choon Ong 《Journal of applied statistics》2015,42(5):938-955

A cluster methodology, motivated by a robust similarity matrix is proposed for identifying likely multivariate outlier structure and to estimate weighted least-square (WLS) regression parameters in linear models. The proposed method is an agglomeration of procedures that begins from clustering the n-observations through a test of ‘no-outlier hypothesis’ (TONH) to a weighted least-square regression estimation. The cluster phase partition the n-observations into h-set called main cluster and a minor cluster of size n?h. A robust distance emerge from the main cluster upon which a test of no outlier hypothesis’ is conducted. An initial WLS regression estimation is computed from the robust distance obtained from the main cluster. Until convergence, a re-weighted least-squares (RLS) regression estimate is updated with weights based on the normalized residuals. The proposed procedure blends an agglomerative hierarchical cluster analysis of a complete linkage through the TONH to the Re-weighted regression estimation phase. Hence, we propose to call it cluster-based re-weighted regression (CBRR). The CBRR is compared with three existing procedures using two data sets known to exhibit masking and swamping. The performance of CBRR is further examined through simulation experiment. The results obtained from the data set illustration and the Monte Carlo study shows that the CBRR is effective in detecting multivariate outliers where other methods are susceptible to it. The CBRR does not require enormous computation and is substantially not susceptible to masking and swamping. 相似文献

11.

Generalized empirical likelihood inference in partially linear model for longitudinal data with missing response variables and error-prone covariates

Juanfang Liu Liugen Xue Ruiqin Tian 《统计学通讯:理论与方法》2017,46(19):9743-9762

In this article, we consider statistical inference for longitudinal partial linear models when the response variable is sometimes missing with missingness probability depending on the covariate that is measured with error. A generalized empirical likelihood (GEL) method is proposed by combining correction attenuation and quadratic inference functions. The method that takes into consideration the correlation within groups is used to estimate the regression coefficients. Furthermore, residual-adjusted empirical likelihood (EL) is employed for estimating the baseline function so that undersmoothing is avoided. The empirical log-likelihood ratios are proven to be asymptotically Chi-squared, and the corresponding confidence regions for the parameters of interest are then constructed. Compared with methods based on NAs, the GEL does not require consistent estimators for the asymptotic variance and bias. The numerical study is conducted to compare the performance of the EL and the normal approximation-based method, and a real example is analysed. 相似文献

12.

Two step estimation for Neyman-Scott point process with inhomogeneous cluster centers

T. Mrkvička M. Muška J. Kubečka 《Statistics and Computing》2014,24(1):91-100

This paper is concerned with parameter estimation for the Neyman-Scott point process with inhomogeneous cluster centers. Inhomogeneity depends on spatial covariates. The regression parameters are estimated at the first step using a Poisson likelihood score function. Three estimation procedures (minimum contrast method based on a modified K function, composite likelihood and Bayesian methods) are introduced for estimation of clustering parameters at the second step. The performance of the estimation methods are studied and compared via a simulation study. This work has been motivated and illustrated by ecological studies of fish spatial distribution in an inland reservoir. 相似文献

13.

Dimension-reduced empirical likelihood inference for response mean with data missing at random

Lei Wang 《Journal of nonparametric statistics》2017,29(3):594-614

To make efficient inference for mean of a response variable when the data are missing at random and the dimension of covariate is not low, we construct three bias-corrected empirical likelihood (EL) methods in conjunction with dimension-reduced kernel estimation of propensity or/and conditional mean response function. Consistency and asymptotic normality of the maximum dimension-reduced EL estimators are established. We further study the asymptotic properties of the resulting dimension-reduced EL ratio functions and the corresponding EL confidence intervals for the response mean are constructed. The finite-sample performance of the proposed estimators is studied through simulation, and an application to HIV-CD4 data set is also presented. 相似文献

14.

Optimal estimation in surrogate outcome regression problems

Xiaogang Duan Jing Qin Qihua Wang 《Revue canadienne de statistique》2010,38(4):633-646

The authors consider a double robust estimation of the regression parameter defined by an estimating equation in a surrogate outcome set‐up. Under a correct specification of the propensity score, the proposed estimator has smallest trace of asymptotic covariance matrix whether the “working outcome regression model” involved is specified correct or not, and it is particularly meaningful when it is incorrectly specified. Simulations are conducted to examine the finite sample performance of the proposed procedure. Data on obesity and high blood pressure are analyzed for illustration. The Canadian Journal of Statistics 38: 633–646; 2010 © 2010 Statistical Society of Canada 相似文献

15.

Estimation and inference for varying coefficient partially nonlinear errors-in-variables models

Yanting Xiao Zhanshou Chen 《统计学通讯:模拟与计算》2018,47(7):2023-2039

In this article, we study the varying coefficient partially nonlinear model with measurement errors in the nonparametric part. A local corrected profile nonlinear least-square estimation procedure is proposed and the asymptotic properties of the resulting estimators are established. Further, a generalized likelihood ratio (GLR) statistic is proposed to test whether the varying coefficients are constant. The asymptotic null distribution of the statistic is obtained and a residual-based bootstrap procedure is employed to compute the p-value of the statistic. Some simulations are conducted to evaluate the performance of the proposed methods. The results show that the estimating and testing procedures work well in finite samples. 相似文献

16.

A Baseline-free Procedure for Transformation Models Under Interval Censorship

Gu MG Sun L Zuo G 《Lifetime data analysis》2005,11(4):473-488

An important property of Cox regression model is that the estimation of regression parameters using the partial likelihood procedure does not depend on its baseline survival function. We call such a procedure baseline-free. Using marginal likelihood, we show that an baseline-free procedure can be derived for a class of general transformation models under interval censoring framework. The baseline-free procedure results a simplified and stable computation algorithm for some complicated and important semiparametric models, such as frailty models and heteroscedastic hazard/rank regression models, where the estimation procedures so far available involve estimation of the infinite dimensional baseline function. A detailed computational algorithm using Markov Chain Monte Carlo stochastic approximation is presented. The proposed procedure is demonstrated through extensive simulation studies, showing the validity of asymptotic consistency and normality. We also illustrate the procedure with a real data set from a study of breast cancer. A heuristic argument showing that the score function is a mean zero martingale is provided. 相似文献

17.

Restricted maximum likelihood estimation of joint mean‐covariance models

Georgios Papageorgiou 《Revue canadienne de statistique》2012,40(2):225-242

The class of joint mean‐covariance models uses the modified Cholesky decomposition of the within subject covariance matrix in order to arrive to an unconstrained, statistically meaningful reparameterisation. The new parameterisation of the covariance matrix has two sets of parameters that separately describe the variances and correlations. Thus, with the mean or regression parameters, these models have three sets of distinct parameters. In order to alleviate the problem of inefficient estimation and downward bias in the variance estimates, inherent in the maximum likelihood estimation procedure, the usual REML estimation procedure adjusts for the degrees of freedom lost due to the estimation of the mean parameters. Because of the parameterisation of the joint mean covariance models, it is possible to adapt the usual REML procedure in order to estimate the variance (correlation) parameters by taking into account the degrees of freedom lost by the estimation of both the mean and correlation (variance) parameters. To this end, here we propose adjustments to the estimation procedures based on the modified and adjusted profile likelihoods. The methods are illustrated by an application to a real data set and simulation studies. The Canadian Journal of Statistics 40: 225–242; 2012 © 2012 Statistical Society of Canada 相似文献

18.

Smoothed empirical likelihood for GARCH models with heavy-tailed errors

Jinyu Li Xingtong Chen Song Zhu 《统计学通讯:理论与方法》2013,42(24):7275-7293

ABSTRACT

This paper proposes an empirical likelihood (EL) method for estimating the GARCH(p, q) models with heavy-tailed errors. Using the kernel smoothing method, we derive a smoothed EL ratio statistic, which yields a smoothed EL estimator. Moreover, we derive a profile EL for the partial parameters in the presence of nuisance parameters. Simulations and empirical results are conducted to illustrate our proposed method. 相似文献

19.

Divergences and duality for estimation and test under moment condition models

Michel Broniatowski Amor Keziou 《Journal of statistical planning and inference》2012

We introduce estimation and test procedures through divergence minimization for models satisfying linear constraints with unknown parameter. These procedures extend the empirical likelihood (EL) method and share common features with generalized empirical likelihood approach. We treat the problems of existence and characterization of the divergence projections of probability distributions on sets of signed finite measures. We give a precise characterization of duality, for the proposed class of estimates and test statistics, which is used to derive their limiting distributions (including the EL estimate and the EL ratio statistic) both under the null hypotheses and under alternatives or misspecification. An approximation to the power function is deduced as well as the sample size which ensures a desired power for a given alternative. 相似文献

20.

Estimation in zero-inflated binomial regression with missing covariates

Alpha Oumar Diallo Aliou Diop 《Statistics》2019,53(4):839-865

We investigate inverse-probability-weighted (IPW) maximum likelihood estimation in zero-inflated binomial regression with missing-at-random covariates. Large sample properties (consistency, asymptotic normality) of the IPW estimator are established. Finite sample properties are assessed via simulations. The methodology is illustrated on a real data set. 相似文献