期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Theoretical evaluation of prediction error in linear regression with a bivariate response variable containing missing data

Lars Erik Gangsei Trygve Almøy Solve Sæbø 《统计学通讯:理论与方法》2017,46(20):9921-9929

Methods for linear regression with multivariate response variables are well described in statistical literature. In this study we conduct a theoretical evaluation of the expected squared prediction error in bivariate linear regression where one of the response variables contains missing data. We make the assumption of known covariance structure for the error terms. On this basis, we evaluate three well-known estimators: standard ordinary least squares, generalized least squares, and a James–Stein inspired estimator. Theoretical risk functions are worked out for all three estimators to evaluate under which circumstances it is advantageous to take the error covariance structure into account. 相似文献

2.

Empirical likelihood for nonlinear regression models with nonignorable missing responses

Zhihuang Yang Niansheng Tang 《Revue canadienne de statistique》2020,48(3):386-416

This article develops three empirical likelihood (EL) approaches to estimate parameters in nonlinear regression models in the presence of nonignorable missing responses. These are based on the inverse probability weighted (IPW) method, the augmented IPW (AIPW) method and the imputation technique. A logistic regression model is adopted to specify the propensity score. Maximum likelihood estimation is used to estimate parameters in the propensity score by combining the idea of importance sampling and imputing estimating equations. Under some regularity conditions, we obtain the asymptotic properties of the maximum EL estimators of these unknown parameters. Simulation studies are conducted to investigate the finite sample performance of our proposed estimation procedures. Empirical results provide evidence that the AIPW procedure exhibits better performance than the other two procedures. Data from a survey conducted in 2002 are used to illustrate the proposed estimation procedure. The Canadian Journal of Statistics 48: 386–416; 2020 © 2020 Statistical Society of Canada 相似文献

3.

Empirical likelihood-based inference in nonlinear regression models with missing responses at random

Nian-Sheng Tang Pu-Ying Zhao 《Statistics》2013,47(6):1141-1159

This paper investigates the estimations of regression parameters and response mean in nonlinear regression models in the presence of missing response variables that are missing with missingness probabilities depending on covariates. We propose four empirical likelihood (EL)-based estimators for the regression parameters and the response mean. The resulting estimators are shown to be consistent and asymptotically normal under some general assumptions. To construct the confidence regions for the regression parameters as well as the response mean, we develop four EL ratio statistics, which are proven to have the χ² distribution asymptotically. Simulation studies and an artificial data set are used to illustrate the proposed methodologies. Empirical results show that the EL method behaves better than the normal approximation method and that the coverage probabilities and average lengths depend on the selection probability function. 相似文献

4.

Weighted empirical likelihood for quantile regression with non ignorable missing covariates

Xiaohui Yuan Xiaogang Dong 《统计学通讯:理论与方法》2019,48(12):3068-3084

In this paper, we propose an empirical likelihood-based weighted estimator of regression parameter in quantile regression model with non ignorable missing covariates. The proposed estimator is computationally simple and achieves semiparametric efficiency if the probability of missingness on the fully observed variables is correctly specified. The efficiency gain of the proposed estimator over the complete-case-analysis estimator is quantified theoretically and illustrated via simulation and a real data application. 相似文献

5.

Logistic regression analysis of randomized response data with missing covariates

S.H. Hsieh S.M. Lee P.S. Shen 《Journal of statistical planning and inference》2010

Randomized response is an interview technique designed to eliminate response bias when sensitive questions are asked. In this paper, we present a logistic regression model on randomized response data when the covariates on some subjects are missing at random. In particular, we propose Horvitz and Thompson (1952)-type weighted estimators by using different estimates of the selection probabilities. We present large sample theory for the proposed estimators and show that they are more efficient than the estimator using the true selection probabilities. Simulation results support theoretical analysis. We also illustrate the approach using data from a survey of cable TV. 相似文献

6.

An alternative algorithm of the empirical likelihood estimation for the parameter of a linear regression model

Şenay Özdemir Olcay Arslan 《统计学通讯:模拟与计算》2013,42(7):1913-1921

ABSTRACT

Empirical likelihood (EL) is a nonparametric method based on observations. EL method is defined as a constrained optimization problem. The solution of this constrained optimization problem is carried on using duality approach. In this study, we propose an alternative algorithm to solve this constrained optimization problem. The new algorithm is based on a newton-type algorithm for Lagrange multipliers for the constrained optimization problem. We provide a simulation study and a real data example to compare the performance of the proposed algorithm with the classical algorithm. Simulation and the real data results show that the performance of the proposed algorithm is comparable with the performance of the existing algorithm in terms of efficiencies and cpu-times. 相似文献

7.

Optimal robust influence functions in semiparametric regression

R. Hable P. Ruckdeschel H. Rieder 《Journal of statistical planning and inference》2010,140(1):1890

Robust statistics allows the distribution of the observations to be any member of a suitable neighborhood about an ideal model distribution. In this paper, the ideal models are semiparametric with finite-dimensional parameter of interest and a possibly infinite-dimensional nuisance parameter.In the asymptotic setup of shrinking neighborhoods, we derive and study the Hampel-type problem and the minmax MSE-problem. We show that, for all common types of neighborhood systems, the optimal influence function can be approximated by the optimal influence functions for certain parametric models.For general semiparametric regression models, we determine in case of error-in-variables and in case of error-free-variables.Finally, the results are applied to Cox regression where we compare our approach to that of Bednarski [1993. Robust estimation in Cox's regression model. Scand. J. Statist. 20, 213–225] in a small simulation study and on a real data set. 相似文献

8.

Lack-of-fit testing of a regression model with response missing at random

Xiaoyu Li 《Journal of statistical planning and inference》2012,142(1):155-170

This paper proposes a class of lack-of-fit tests for fitting a linear regression model when some response variables are missing at random. These tests are based on a class of minimum integrated square distances between a kernel type estimator of a regression function and the parametric regression function being fitted. These tests are shown to be consistent against a large class of fixed alternatives. The corresponding test statistics are shown to have asymptotic normal distributions under null hypothesis and a class of nonparametric local alternatives. Some simulation results are also presented. 相似文献

9.

Empirical Likelihood Inference of the Partial Linear Isotonic Errors-in-variables Regression Models with Missing Data

Zhimeng Sun 《统计学通讯:模拟与计算》2016,45(2):671-688

This article is concerned with statistical inference of the partial linear isotonic regression model missing response and measurement errors in covariates. We proposed an empirical likelihood ratio test statistics and show that it has a limiting weighted chi-square distribution. An adjusted empirical likelihood ratio statistic, which is shown to have a limiting standard central chi-square distribution, is then proposed further. A maximum empirical likelihood estimator is also developed. A simulation study is conducted to examine the finite-sample property of proposed procedure. 相似文献

10.

Using logistic regression for semiparametric comparison of population means and variances

Shuwen Wan Binrong Xu Biao Zhang 《统计学通讯:理论与方法》2013,42(9):2485-2503

Abstract

We propose to compare population means and variances under a semiparametric density ratio model. The proposed method is easy to implement by employing logistic regression procedures in many statistical software, and it often works very well when data are not normal. In this paper, we construct semiparametric estimators of the differences of two population means and variances, and derive their asymptotic distributions. We prove that the proposed semiparametric estimators are asymptotically more efficient than the corresponding non parametric ones. In addition, a simulation study and the analysis of two real data sets are presented. Finally, a short discussion is provided. 相似文献

11.

Empirical likelihood inference for a semiparametric hazards regression model

Wei Chen Dehui Wang 《统计学通讯:理论与方法》2013,42(11):3236-3248

ABSTRACT

We investigated the empirical likelihood inference approach under a general class of semiparametric hazards regression models with survival data subject to right-censoring. An empirical likelihood ratio for the full 2p regression parameters involved in the model is obtained. We showed that it converged weakly to a random variable which could be written as a weighted sum of 2p independent chi-squared variables with one degree of freedom. Using this, we could construct a confidence region for parameters. We also suggested an adjusted version for the preceding statistic, whose limit followed a standard chi-squared distribution with 2p degrees of freedom. 相似文献

12.

The exact density of nonparametric regression estimators: fixed design case

Aman Ullah 《统计学通讯:理论与方法》2013,42(5):1251-1254

This paper studies the exact density of a general nonparametric regression estimator when the errors are non-normal. The fixed design case is considered. The density function is derived by an application of the technique of Davis (1976) 相似文献

13.

Variable importance assessment in sliced inverse regression for variable selection

Ines Jlassi 《统计学通讯:模拟与计算》2019,48(1):169-199

相似文献

14.

Methods for missing covariates in logistic regression

Myunghee Cho Paik 《统计学通讯:模拟与计算》2013,42(1):1-19

Various methods have been suggested in the literature to handle a missing covariate in the presence of surrogate covariates. These methods belong to one of two paradigms. In the imputation paradigm, Pepe and Fleming (1991) and Reilly and Pepe (1995) suggested filling in missing covariates using the empirical distribution of the covariate obtained from the observed data. We can proceed one step further by imputing the missing covariate using nonparametric maximum likelihood estimates (NPMLE) of the density of the covariate. Recently Murphy and Van der Vaart (1998a) showed that such an approach yields a consistent, asymptotically normal, and semiparametric efficient estimate for the logistic regression coefficient. In the weighting paradigm, Zhao and Lipsitz (1992) suggested an estimating function using completely observed records after weighting inversely by the probability of observation. An extension of this weighting approach designed to achieve semiparametric efficient bound is considered by Robins, Hsieh and Newey (RHN) (1995). The two ends of each paradigm (NPMLE and RHN) attain the efficiency bound and are asymptotically equivalent. However, both require a substantial amount of computation. A question arises whether and when, in practical situations, this extensive computation is worthwhile. In this paper we investigate the performance of single and multiple imputation estimates, weighting estimates, semiparametric efficient estimates, and two new imputation estimates. Simulation studies suggest that the sample size should be substantially large (e.g. n=2000) for NPMLE and RHN to be more efficient than simpler imputation estimates. When the sample size is moderately large (n≤ 1500), simpler imputation estimates have as small a variance as semiparametric efficient estimates. 相似文献

15.

Semi-empirical pseudo-likelihood for estimating equations in the presence of missing responses

Qihua Wang Ruimiao Luo 《Journal of statistical planning and inference》2011,141(8):2589-2599

Consider a semiparametric model which parameterizes only the conditional distribution of Y given X, f(y|x,β), and allows the marginal distribution of X to be completely arbitrary. Under the semiparametric model, we develop semi-empirical pseudo-likelihood inference with estimating equation in the presence of missing responses. We define semi-empirical likelihood pseudo-score estimates for both the model parameter and the parameter in the estimating equation simultaneously. Also, we develop semi-empirical pseudo-likelihood ratio inference for them, respectively. A simulation was conducted to evaluate the finite sample properties of the proposed estimators and semi-empirical pseudo-likelihood approach. 相似文献

16.

Inadmissibility of the maximum likelihood estimator for a multivariate normal distribution when some observations are missing

R. Radhakrishnan 《统计学通讯:理论与方法》2013,42(8):941-955

相似文献

17.

Maximum likelihood estimator in a multi-phase random regression model

Gabriela Ciuperca Nicolas Dapzol 《Statistics》2013,47(4):363-381

We consider a random regression model with several-fold change-points. The results for one change-point are generalized. The maximum likelihood estimator of the parameters is shown to be consistent, and the asymptotic distribution for the estimators of the coefficients is shown to be Gaussian. The estimators of the change-points converge, with n ^?1 rate, to the vector whose components are the left end points of the maximizing interval with respect to each change-point. The likelihood process is asymptotically equivalent to the sum of independent compound Poisson processes. 相似文献

18.

Improved two-parameter estimators for the negative binomial and Poisson regression models

Merve Kandemir Çetinkaya Selahattin Kaçıranlar 《Journal of Statistical Computation and Simulation》2019,89(14):2645-2660

Negative binomial regression (NBR) and Poisson regression (PR) applications have become very popular in the analysis of count data in recent years. However, if there is a high degree of relationship between the independent variables, the problem of multicollinearity arises in these models. We introduce new two-parameter estimators (TPEs) for the NBR and the PR models by unifying the two-parameter estimator (TPE) of Özkale and Kaç?ranlar [The restricted and unrestricted two-parameter estimators. Commun Stat Theory Methods. 2007;36:2707–2725]. These new estimators are general estimators which include maximum likelihood (ML) estimator, ridge estimator (RE), Liu estimator (LE) and contraction estimator (CE) as special cases. Furthermore, biasing parameters of these estimators are given and a Monte Carlo simulation is done to evaluate the performance of these estimators using mean square error (MSE) criterion. The benefits of the new TPEs are also illustrated in an empirical application. The results show that the new proposed TPEs for the NBR and the PR models are better than the ML estimator, the RE and the LE. 相似文献

19.

A two-parameter estimator in the negative binomial regression model

《Journal of Statistical Computation and Simulation》2012,82(1):124-134

In this article, a two-parameter estimator is proposed to combat multicollinearity in the negative binomial regression model. The proposed two-parameter estimator is a general estimator which includes the maximum likelihood (ML) estimator, the ridge estimator (RE) and the Liu estimator as special cases. Some properties on the asymptotic mean-squared error (MSE) are derived and necessary and sufficient conditions for the superiority of the two-parameter estimator over the ML estimator and sufficient conditions for the superiority of the two-parameter estimator over the RE and the Liu estimator in the asymptotic mean-squared error (MSE) matrix sense are obtained. Furthermore, several methods and three rules for choosing appropriate shrinkage parameters are proposed. Finally, a Monte Carlo simulation study is given to illustrate some of the theoretical results. 相似文献

20.

A new biased estimator in logistic regression model

M. Revan Özkale Engin Arıcan 《Statistics》2016,50(2):233-253

相似文献