期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Using auxiliary data for parameter estimation with non-ignorably missing outcomes

Joseph G. Ibrahim Stuart R. Lipsitz & Nick Horton 《Journal of the Royal Statistical Society. Series C, Applied statistics》2001,50(3):361-373

We propose a method for estimating parameters in generalized linear models when the outcome variable is missing for some subjects and the missing data mechanism is non-ignorable. We assume throughout that the covariates are fully observed. One possible method for estimating the parameters is maximum likelihood with a non-ignorable missing data model. However, caution must be used when fitting non-ignorable missing data models because certain parameters may be inestimable for some models. Instead of fitting a non-ignorable model, we propose the use of auxiliary information in a likelihood approach to reduce the bias, without having to specify a non-ignorable model. The method is applied to a mental health study. 相似文献

2.

Doubly robust estimation of partially linear models for longitudinal data with dropouts and measurement error in covariates

Huiming Lin Jiajia Zhang Wing K. Fung 《Statistics》2018,52(1):84-98

In longitudinal studies, missing responses and mismeasured covariates are commonly seen due to the data collection process. Without cautiousness in data analysis, inferences from the standard statistical approaches may lead to wrong conclusions. In order to improve the estimation for longitudinal data analysis, a doubly robust estimation method for partially linear models, which can simultaneously account for the missing responses and mismeasured covariates, is proposed. Imprecisions of covariates are corrected by taking advantage of the independence between replicate measurement errors, and missing responses are handled by the doubly robust estimation under the mechanism of missing at random. The asymptotic properties of the proposed estimators are established under regularity conditions, and simulation studies demonstrate desired properties. Finally, the proposed method is applied to data from the Lifestyle Education for Activity and Nutrition study. 相似文献

3.

INCOMPLETE DATA IN GENERALIZED LINEAR MODELS WITH CONTINUOUS COVARIATES

Joseph G. Brahim Sanford Weisberg 《Australian & New Zealand Journal of Statistics》1992,34(3):461-470

This paper proposes a method for estimating the parameters in a generalized linear model with missing covariates. The missing covariates are assumed to come from a continuous distribution, and are assumed to be missing at random. In particular, Gaussian quadrature methods are used on the E-step of the EM algorithm, leading to an approximate EM algorithm. The parameters are then estimated using the weighted EM procedure given in Ibrahim (1990). This approximate EM procedure leads to approximate maximum likelihood estimates, whose standard errors and asymptotic properties are given. The proposed procedure is illustrated on a data set. 相似文献

4.

A cautionary note on the analysis of randomized block designs with a few missing values

Devan V. Mehrotra 《Statistical Papers》2004,45(1):51-66

The randomized block design is routinely employed in the social and biopharmaceutical sciences. With no missing values, analysis of variance (AOV) can be used to analyze such experiments. However, if some data are missing, the AOV formulae are no longer applicable, and iterative methods such as restricted maximum likelihood (REML) are recommended, assuming block effects are treated as random. Despite the well-known advantages of REML, methods like AOV based on complete cases (blocks) only (CC-AOV) continue to be used by researchers, particularly in situations where routinely only a few missing values are encountered. Reasons for this appear to include a natural proclivity for non-iterative, summary-statistic-based methods, and a presumption that CC-AOV is only trivially less efficient than REML with only a few missing values (say≤10%). The purpose of this note is two-fold. First, to caution that CC-AOV can be considerably less powerful than REML even with only a few missing values. Second, to offer a summary-statistic-based, pairwise-available-case-estimation (PACE) alternative to CC-AOV. PACE, which is identical to AOV (and REML) with no missing values, outperforms CC-AOV in terms of statistical power. However, it is recommended in lieu of REMLonly if software to implement the latter is unavailable, or the use of a “transparent” formula-based approach is deemed necessary. An example using real data is provided for illustration. 相似文献

5.

Multiple imputation for gamma outcome variable using generalized linear model

Vinay K. Gupta Gurprit Grover 《Journal of Statistical Computation and Simulation》2017,87(10):1980-1988

We used a proper multiple imputation (MI) through Gibbs sampling approach to impute missing values of a gamma distributed outcome variable which were missing at random, using generalized linear model (GLM) with identity link function. The missing values of the outcome variable were multiply imputed using GLM and then the complete data sets obtained after MI were analysed through GLM again for the estimation purpose. We examined the performance of the proposed technique through a simulation study with the data sets having four moderate and large proportions of missing values, 10%, 20%, 30% and 50%. We also applied this technique on a real life data and compared the results with those obtained by applying GLM only on observed cases. The results showed that the proposed technique gave better results for moderate proportions of missing values. 相似文献

6.

Gaussian Scale Mixture Models for Robust Linear Multivariate Regression with Missing Data

Juha Ala-Luhtala Robert Piché 《统计学通讯:模拟与计算》2016,45(3):791-813

We present an algorithm for multivariate robust Bayesian linear regression with missing data. The iterative algorithm computes an approximative posterior for the model parameters based on the variational Bayes (VB) method. Compared to the EM algorithm, the VB method has the advantage that the variance for the model parameters is also computed directly by the algorithm. We consider three families of Gaussian scale mixture models for the measurements, which include as special cases the multivariate t distribution, the multivariate Laplace distribution, and the contaminated normal model. The observations can contain missing values, assuming that the missing data mechanism can be ignored. A Matlab/Octave implementation of the algorithm is presented and applied to solve three reference examples from the literature. 相似文献

7.

Distribution of the biased hypothesis sum of squares in linear models with missing observations

Anant M. Kshirsagar Sheela Deo 《统计学通讯:理论与方法》2013,42(8):2747-2754

In a linear model with missing observations, one can substitute algebraic quantities and then minimize the error sum of squares for the augmented model. This gives the correct error sum of squares. But this method does not produce the correct hypothesis sum of squares for testing a linear hypothesis about the parameters. The sum of squares obtained is biased but practitioners still use it. The distribution of this biased sum of squares is derived in this paper and the consequences of using this biased sum of squares on the type I and II errors is examined. 相似文献

8.

On the information content of partial observations

Da-Hsiang Donald Lien David Rearden 《统计学通讯:理论与方法》2013,42(10):2869-2880

The modified zero order approach to estimating coefficients in the face of missing observations treats them as parameters to be estimated simultaneously with the missing observations. The paper then investigates (in the context of Han's generalized regression model)(i) when parameter estimators don't vary between using the partial data points and using only the complete ones (the informationless result), and (ii) large sample properties of the modified zero order estimator. It's found the sequential cut property is crucial to the informationless result for coefficient estimators; consistency of the modified zero order estimator depends on the percentage of observations with missing elements for large sample sizes or the sequential cut property. 相似文献

9.

Analysis of repeated categorical responses from fully and partially cross-classified data

Michael Haber Catherine C. H. Chen C. David Williamson 《统计学通讯:理论与方法》2013,42(10):3293-3313

Many follow-up studies involve categorical data measured on the same individual at different times. Frequently, some of the individuals are missing one or more of the measurements. This results in a contingency table with both fully and partially cross-classified data. Two models can be used to analyze data of this type: (i) The multiple-sample model, in which all the study subjects with the same configuration of missing observations are considered a separate sample. (ii) The single-sample model, which assumes that the missing observations are the result of a mechanism causing subjects to lose the informtion from one or some of the measurements. In this work we compare the two approaches and show that under certain conditions, the two models yield the same maximum likelihood estimates of the cell probabilities in the underlying contingency table. 相似文献

10.

Skew-mixed effects model for multivariate longitudinal data with categorical outcomes and missingness

S. Eftekhari Mahabadi E. Rahimi Jafari 《Journal of applied statistics》2018,45(12):2182-2201

A longitudinal study commonly follows a set of variables, measured for each individual repeatedly over time, and usually suffers from incomplete data problem. A common approach for dealing with longitudinal categorical responses is to use the Generalized Linear Mixed Model (GLMM). This model induces the potential relation between response variables over time via a vector of random effects, assumed to be shared parameters in the non-ignorable missing mechanism. Most GLMMs assume that the random-effects parameters follow a normal or symmetric distribution and this leads to serious problems in real applications. In this paper, we propose GLMMs for the analysis of incomplete multivariate longitudinal categorical responses with a non-ignorable missing mechanism based on a shared parameter framework with the less restrictive assumption of skew-normality for the random effects. These models may contain incomplete data with monotone and non-monotone missing patterns. The performance of the model is evaluated using simulation studies and a well-known longitudinal data set extracted from a fluvoxamine trial is analyzed to determine the profile of fluvoxamine in ambulatory clinical psychiatric practice. 相似文献

11.

Estimation of regression models with equi-correlated responses when some observations on the response variable are missing

H. Toutenburg Shalabh 《Statistical Papers》2003,44(2):217-232

The present article deals with the problem of estimation of parameters in a linear regression model when some data on response variable is missing and the responses are equi-correlated. The ordinary least squares and optimal homogeneous predictors are employed to find the imputed values of missing observations. Their efficiency properties are analyzed using the small disturbances asymptotic theory. The estimation of regression coefficients using these imputed values is also considered and a comparison of estimators is presented. 相似文献

12.

Estimation and selection in linear mixed models with missing data under compound symmetric structure

Yi-Ching Lee Junfeng Shang 《Journal of applied statistics》2022,49(15):4003

It is quite appealing to extend existing theories in classical linear models to correlated responses where linear mixed-effects models are utilized and the dependency in the data is modeled by random effects. In the mixed modeling framework, missing values occur naturally due to dropouts or non-responses, which is frequently encountered when dealing with real data. Motivated by such problems, we aim to investigate the estimation and model selection performance in linear mixed models when missing data are present. Inspired by the property of the indicator function for missingness and its relation to missing rates, we propose an approach that records missingness in an indicator-based matrix and derive the likelihood-based estimators for all parameters involved in the linear mixed-effects models. Based on the proposed method for estimation, we explore the relationship between estimation and selection behavior over missing rates. Simulations and a real data application are conducted for illustrating the effectiveness of the proposed method in selecting the most appropriate model and in estimating parameters. 相似文献

13.

缺失偏态数据下线性回归模型的统计推断

吴刘仓张家茂邱贻涛《统计与信息论坛》2013,28(9):22-26

研究缺失偏态数据下线性回归模型的参数估计问题,针对缺失偏态数据,为克服样本分布扭曲缺点和提高模型的回归系数、尺度参数和偏度参数的估计效果,提出了一种适合偏态数据下线性回归模型中缺失数据的修正回归插补方法.通过随机模拟和实例研究,并与均值插补、回归插补、随机回归插补方法比较,结果表明所提出的修正回归插补方法是有效可行的. 相似文献

14.

Impact of missing data on type 1 error rates in non‐inferiority trials

Bongin Yoo 《Pharmaceutical statistics》2010,9(2):87-99

In this paper, a simulation study is conducted to systematically investigate the impact of different types of missing data on six different statistical analyses: four different likelihood‐based linear mixed effects models and analysis of covariance (ANCOVA) using two different data sets, in non‐inferiority trial settings for the analysis of longitudinal continuous data. ANCOVA is valid when the missing data are completely at random. Likelihood‐based linear mixed effects model approaches are valid when the missing data are at random. Pattern‐mixture model (PMM) was developed to incorporate non‐random missing mechanism. Our simulations suggest that two linear mixed effects models using unstructured covariance matrix for within‐subject correlation with no random effects or first‐order autoregressive covariance matrix for within‐subject correlation with random coefficient effects provide well control of type 1 error (T1E) rate when the missing data are completely at random or at random. ANCOVA using last observation carried forward imputed data set is the worst method in terms of bias and T1E rate. PMM does not show much improvement on controlling T1E rate compared with other linear mixed effects models when the missing data are not at random but is markedly inferior when the missing data are at random. Copyright © 2009 John Wiley & Sons, Ltd. 相似文献

15.

The impact of dichotomization in longitudinal data analysis: a simulation study

Bongin Yoo 《Pharmaceutical statistics》2010,9(4):298-312

In this paper, a simulation study is conducted to systematically investigate the impact of dichotomizing longitudinal continuous outcome variables under various types of missing data mechanisms. Generalized linear models (GLM) with standard generalized estimating equations (GEE) are widely used for longitudinal outcome analysis, but these semi‐parametric approaches are only valid under missing data completely at random (MCAR). Alternatively, weighted GEE (WGEE) and multiple imputation GEE (MI‐GEE) were developed to ensure validity under missing at random (MAR). Using a simulation study, the performance of standard GEE, WGEE and MI‐GEE on incomplete longitudinal dichotomized outcome analysis is evaluated. For comparisons, likelihood‐based linear mixed effects models (LMM) are used for incomplete longitudinal original continuous outcome analysis. Focusing on dichotomized outcome analysis, MI‐GEE with original continuous missing data imputation procedure provides well controlled test sizes and more stable power estimates compared with any other GEE‐based approaches. It is also shown that dichotomizing longitudinal continuous outcome will result in substantial loss of power compared with LMM. Copyright © 2009 John Wiley & Sons, Ltd. 相似文献

16.

Best linear unbiased interpolation of order statistics

Isaac Almasi Adel Mohammadpour Mohammad Mohammadi 《统计学通讯:模拟与计算》2017,46(5):4161-4171

We derive the best linear unbiased interpolation for the missing order statistics of a random sample using the well-known projection theorem. The proposed interpolation method only needs the first two moments on both sides of a missing order statistic. A simulation study is performed to compare the proposed method with a few interpolation methods for exponential and Lévy distributions. 相似文献

17.

EMPIRICAL LIKELIHOOD‐BASED INFERENCES FOR PARTIALLY LINEAR MODELS WITH MISSING COVARIATES

Hua Liang Yongsong Qin 《Australian & New Zealand Journal of Statistics》2008,50(4):347-359

This paper considers statistical inference for partially linear models Y = X ^? β +ν(Z) +? when the linear covariate X is missing with missing probability π depending upon (Y, Z). We propose empirical likelihood‐based statistics to construct confidence regions for β and ν(z). The resulting empirical likelihood ratio statistics are shown to be asymptotically chi‐squared‐distributed. The finite‐sample performance of the proposed statistics is assessed by simulation experiments. The proposed methods are applied to a dataset from an AIDS clinical trial. 相似文献

18.

Imputation of missing variance data using non-linear mixed effects modelling to enable an inverse variance weighted meta-analysis of summary-level longitudinal data: a case study

Boucher M 《Pharmaceutical statistics》2012,11(4):318-324

Missing variances, on the basis of the summary-level data, can be a problem when an inverse variance weighted meta-analysis is undertaken. A wide range of approaches in dealing with this issue exist, such as excluding data without a variance measure, using a function of sample size as a weight and imputing the missing standard errors/deviations. A non-linear mixed effects modelling approach was taken to describe the time-course of standard deviations across 14 studies. The model was then used to make predictions of the missing standard deviations, thus, enabling a precision weighted model-based meta-analysis of a mean pain endpoint over time. Maximum likelihood and Bayesian approaches were implemented with example code to illustrate how this imputation can be carried out and to compare the output from each method. The resultant imputations were nearly identical for the two approaches. This modelling approach acknowledges the fact that standard deviations are not necessarily constant over time and can differ between treatments and across studies in a predictable way. 相似文献

19.

Some properties of clasical multi-dimesional scaling

K.V. Mardia 《统计学通讯:理论与方法》2013,42(13):1233-1241

The paper gives a new optimal property of the classical method of multi-dimensional scaling when the distance matrix is non-Euclidean. We also examine robustness of the method under a linear model. A technique to estimate missing values is also given. 相似文献

20.

Approximate maximum likelihood predictors of future failure times of shifted exponential distributions under multiple type II censoring

Mohammad?Z.?Raqab Email author 《Statistical Methods and Applications》2004,13(1):43-54

Suppose we consider a general multiple type II censored sample (some middle observations being censored) from a shifted exponential distribution. The maximum likelihood prediction method does not admit explicit solutions. We introduce a simple approximation to one of prediction likelihood equations and derive approximate predictors of missing failure times. We compute their mean square prediction errors by simulation and compare them with the best linear predictors. Further, we present two real examples to illustrate this method of prediction.AMS Subject Classification (2000): 62G30, 62M20, 62F99 相似文献