期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

multiple and conditional deletion diagnostics for general linear models

Ingrid A. Baade Anthony N. Pettitt 《统计学通讯:理论与方法》2013,42(8):1899-1910

In this paper we develop multiple case deletion statistics for the general linear model so that a residual vector and a leverage matrix are identified which have roles analogous to residuals and leverage for ordinary least squares models. We extend the notion of the conditional deletion diagnostic to general linear models. The residuals, leverage and deletion diagnostics are illustrated with data modelled by a linear growth curve. 相似文献

2.

Mean shift and influence measures in linear measurement error models with stochastic linear restrictions

F. Ghapani B. Babadi 《统计学通讯:模拟与计算》2017,46(6):4499-4512

We present influence diagnostics for linear measurement error models with stochastic linear restrictions using the corrected likelihood of Nakamura in 1990. The case deletion and mean shift outlier models are developed to identify outlying and influential observations. We derive a corrected score test statistic for outlier detection based on mean shift outlier models. The analogs of Cook's distance and likelihood distance are proposed to determine influential observations based on case deletion models. A parametric bootstrap procedure is used to obtain empirical distributions of the test statistics and a simulation study has been used to evaluate the performance of the proposed estimators based on the mean squares error criterion and the score test statistic. Finally, a numerical example is given to illustrate the theoretical results. 相似文献

3.

Local influence diagnostics for incomplete overdispersed longitudinal counts

Trias Wahyuni Rakhmawati Geert Verbeke Christel Faes 《Journal of applied statistics》2016,43(9):1722-1737

We develop local influence diagnostics to detect influential subjects when generalized linear mixed models are fitted to incomplete longitudinal overdispersed count data. The focus is on the influence stemming from the dropout model specification. In particular, the effect of small perturbations around an MAR specification are examined. The method is applied to data from a longitudinal clinical trial in epileptic patients. The effect on models allowing for overdispersion is contrasted with that on models that do not. 相似文献

4.

Influence Diagnostics on Testing Linear Hypothesis in Linear Models with Correlated Errors

Hadi Emami 《统计学通讯:理论与方法》2014,43(6):1050-1060

To assess the influence of observations on the parameter estimates, case deletion diagnostics are commonly used in linear regression models. For linear models with correlated errors we study the influence of observations on testing a linear hypothesis using single and multiple case deletions. The change in likelihood ratio test and F test theoretically is derived and it is shown these tests to be completely determined by two proposed generalized externally studentized residuals. An illustrative example of a real data set is also reported. 相似文献

5.

Influence on tests with focus on linear models

Christian Ritz Ib.M. Skovgaard 《Journal of statistical planning and inference》2007

To assess the influence of single observations on the parameter estimates, case-deletion diagnostics are commonly used in linear regression models; one example is Cook's distance. For nested parametric models we consider a deletion diagnostic for evaluating the influence of a single observation on the likelihood ratio (LR) test. In order to have a common scale as reference, the asymptotic distribution of the diagnostic is derived and the values of the diagnostic are converted to percentiles. We focus on linear models and general linear models, and in these cases explicit results are derived. The performance of the diagnostic is explored in two small bench mark examples from linear regression and in a larger linear mixed model example. 相似文献

6.

Estimated estimating equations: semiparametric inference for clustered and longitudinal data

Jeng-Min Chiou Hans-Georg Müller 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2005,67(4):531-553

Summary. We introduce a flexible marginal modelling approach for statistical inference for clustered and longitudinal data under minimal assumptions. This estimated estimating equations approach is semiparametric and the proposed models are fitted by quasi-likelihood regression, where the unknown marginal means are a function of the fixed effects linear predictor with unknown smooth link, and variance–covariance is an unknown smooth function of the marginal means. We propose to estimate the nonparametric link and variance–covariance functions via smoothing methods, whereas the regression parameters are obtained via the estimated estimating equations. These are score equations that contain nonparametric function estimates. The proposed estimated estimating equations approach is motivated by its flexibility and easy implementation. Moreover, if data follow a generalized linear mixed model, with either a specified or an unspecified distribution of random effects and link function, the model proposed emerges as the corresponding marginal (population-average) version and can be used to obtain inference for the fixed effects in the underlying generalized linear mixed model, without the need to specify any other components of this generalized linear mixed model. Among marginal models, the estimated estimating equations approach provides a flexible alternative to modelling with generalized estimating equations. Applications of estimated estimating equations include diagnostics and link selection. The asymptotic distribution of the proposed estimators for the model parameters is derived, enabling statistical inference. Practical illustrations include Poisson modelling of repeated epileptic seizure counts and simulations for clustered binomial responses. 相似文献

7.

Mixed effects smoothing spline analysis of variance

Yuedong Wang 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1998,60(1):159-174

We propose a general family of nonparametric mixed effects models. Smoothing splines are used to model the fixed effects and are estimated by maximizing the penalized likelihood function. The random effects are generic and are modelled parametrically by assuming that the covariance function depends on a parsimonious set of parameters. These parameters and the smoothing parameter are estimated simultaneously by the generalized maximum likelihood method. We derive a connection between a nonparametric mixed effects model and a linear mixed effects model. This connection suggests a way of fitting a nonparametric mixed effects model by using existing programs. The classical two-way mixed models and growth curve models are used as examples to demonstrate how to use smoothing spline analysis-of-variance decompositions to build nonparametric mixed effects models. Similarly to the classical analysis of variance, components of these nonparametric mixed effects models can be interpreted as main effects and interactions. The penalized likelihood estimates of the fixed effects in a two-way mixed model are extensions of James–Stein shrinkage estimates to correlated observations. In an example three nested nonparametric mixed effects models are fitted to a longitudinal data set. 相似文献

8.

Local Influence Analysis for Penalized Gaussian Likelihood Estimators in Partially Linear Models 总被引：1，自引：0，他引：1

Zhong-Yi Zhu Xuming He Wing-Kam Fung 《Scandinavian Journal of Statistics》2003,30(4):767-780

Abstract. Partially linear models are extensions of linear models to include a non-parametric function of some covariate. They have been found to be useful in both cross-sectional and longitudinal studies. This paper provides a convenient means to extend Cook's local influence analysis to the penalized Gaussian likelihood estimator that uses a smoothing spline as a solution to its non-parametric component. Insight is also provided into the interplay of the influence or leverage measures between the linear and the non-parametric components in the model. The diagnostics are applied to a mouthwash data set and a longitudinal hormone study with informative results. 相似文献

9.

Omitted variables in longitudinal data models

Edward W. Frees 《Revue canadienne de statistique》2001,29(4):573-595

The omission of important variables is a well‐known model specification issue in regression analysis and mixed linear models. The author considers longitudinal data models that are special cases of the mixed linear models; in particular, they are linear models of repeated observations on a subject. Models of omitted variables have origins in both the econometrics and biostatistics literatures. The author describes regression coefficient estimators that are robust to and that provide the basis for detecting the influence of certain types of omitted variables. New robust estimators and omitted variable tests are introduced and illustrated with a case study that investigates the determinants of tax liability. 相似文献

10.

Bayesian modeling of autoregressive partial linear models with scale mixture of normal errors

Guillermo Ferreira Luis M. Castro Ronaldo Dias 《Journal of applied statistics》2013,40(8):1796-1816

Normality and independence of error terms are typical assumptions for partial linear models. However, these assumptions may be unrealistic in many fields, such as economics, finance and biostatistics. In this paper, a Bayesian analysis for partial linear model with first-order autoregressive errors belonging to the class of the scale mixtures of normal distributions is studied in detail. The proposed model provides a useful generalization of the symmetrical linear regression model with independent errors, since the distribution of the error term covers both correlated and thick-tailed distributions, and has a convenient hierarchical representation allowing easy implementation of a Markov chain Monte Carlo scheme. In order to examine the robustness of the model against outlying and influential observations, a Bayesian case deletion influence diagnostics based on the Kullback–Leibler (K–L) divergence is presented. The proposed method is applied to monthly and daily returns of two Chilean companies. 相似文献

11.

Predictive comparison of joint longitudinal-survival modeling: a case study illustrating competing approaches

Timothy E. Hanson Adam J. Branscum Wesley O. Johnson 《Lifetime data analysis》2011,17(1):3-28

The joint modeling of longitudinal and survival data has received extraordinary attention in the statistics literature recently, with models and methods becoming increasingly more complex. Most of these approaches pair a proportional hazards survival with longitudinal trajectory modeling through parametric or nonparametric specifications. In this paper we closely examine one data set previously analyzed using a two parameter parametric model for Mediterranean fruit fly (medfly) egg-laying trajectories paired with accelerated failure time and proportional hazards survival models. We consider parametric and nonparametric versions of these two models, as well as a proportional odds rate model paired with a wide variety of longitudinal trajectory assumptions reflecting the types of analyses seen in the literature. In addition to developing novel nonparametric Bayesian methods for joint models, we emphasize the importance of model selection from among joint and non joint models. The default in the literature is to omit at the outset non joint models from consideration. For the medfly data, a predictive diagnostic criterion suggests that both the choice of survival model and longitudinal assumptions can grossly affect model adequacy and prediction. Specifically for these data, the simple joint model used in by Tseng et al. (Biometrika 92:587–603, 2005) and models with much more flexibility in their longitudinal components are predictively outperformed by simpler analyses. This case study underscores the need for data analysts to compare on the basis of predictive performance different joint models and to include non joint models in the pool of candidates under consideration. 相似文献

12.

A NONPARAMETRIC MIXED‐EFFECTS MODEL FOR CANCER MORTALITY

Megu Ohtaki 《Australian & New Zealand Journal of Statistics》2011,53(2):247-256

There are several ways to handle within‐subject correlations with a longitudinal discrete outcome, such as mortality. The most frequently used models are either marginal or random‐effects types. This paper deals with a random‐effects‐based approach. We propose a nonparametric regression model having time‐varying mixed effects for longitudinal cancer mortality data. The time‐varying mixed effects in the proposed model are estimated by combining kernel‐smoothing techniques and a growth‐curve model. As an illustration based on real data, we apply the proposed method to a set of prefecture‐specific data on mortality from large‐bowel cancer in Japan. 相似文献

13.

Posterior consistency of random effects models for binary data

Yongdai Kim Dohyun Kim 《Journal of statistical planning and inference》2011,141(11):3391-3399

In longitudinal studies or clustered designs, observations for each subject or cluster are dependent and exhibit intra-correlation. To account for this dependency, we consider Bayesian analysis for conditionally specified models, so-called generalized linear mixed model. In nonlinear mixed models, the maximum likelihood estimator of the regression coefficients is typically a function of the distribution of random effects, and so the misspecified choice of the distribution of random effects can cause bias in the estimator. To avoid the problem of the misspecification of the distribution of random effects, one can resort in nonparametric approaches. We give sufficient conditions for posterior consistency of the distribution of random effects as well as regression coefficients. 相似文献

14.

Semiparametric model average prediction in panel data analysis

Tao Huang 《Journal of nonparametric statistics》2018,30(1):125-144

Forecasting in economic data analysis is dominated by linear prediction methods where the predicted values are calculated from a fitted linear regression model. With multiple predictor variables, multivariate nonparametric models were proposed in the literature. However, empirical studies indicate the prediction performance of multi-dimensional nonparametric models may be unsatisfactory. We propose a new semiparametric model average prediction (SMAP) approach to analyse panel data and investigate its prediction performance with numerical examples. Estimation of individual covariate effect only requires univariate smoothing and thus may be more stable than previous multivariate smoothing approaches. The estimation of optimal weight parameters incorporates the longitudinal correlation and the asymptotic properties of the estimated results are carefully studied in this paper. 相似文献

15.

Deletion diagnostics for marginal mean and correlation model parameters in estimating equations

John S. Preisser Jamie Perin 《Statistics and Computing》2007,17(4):381-393

Regression diagnostics are introduced for parameters in marginal association models for clustered binary outcomes in an implementation of generalized estimating equations. Estimating equations for intracluster correlations facilitate computational formulae for one-step deletion diagnostics in an extension of earlier work on diagnostics for parameters in the marginal mean model. The proposed diagnostics measure the influence of an observation or a cluster of observations on the estimated regression parameters and on the overall fit of the model. The diagnostics are applied to data from four research studies from public health and medicine. 相似文献

16.

Efficient estimation in partially linear single‐index models for longitudinal data

Quan Cai Suojin Wang 《Scandinavian Journal of Statistics》2019,46(1):116-141

In this paper, we consider the estimation of both the parameters and the nonparametric link function in partially linear single‐index models for longitudinal data that may be unbalanced. In particular, a new three‐stage approach is proposed to estimate the nonparametric link function using marginal kernel regression and the parametric components with generalized estimating equations. The resulting estimators properly account for the within‐subject correlation. We show that the parameter estimators are asymptotically semiparametrically efficient. We also show that the asymptotic variance of the link function estimator is minimized when the working error covariance matrices are correctly specified. The new estimators are more efficient than estimators in the existing literature. These asymptotic results are obtained without assuming normality. The finite‐sample performance of the proposed method is demonstrated by simulation studies. In addition, two real‐data examples are analyzed to illustrate the methodology. 相似文献

17.

Some algebra and geometry for hierarchical models, applied to diagnostics

J. S. Hodges 《Journal of the Royal Statistical Society. Series B, Statistical methodology》1998,60(3):497-536

Recent advances in computing make it practical to use complex hierarchical models. However, the complexity makes it difficult to see how features of the data determine the fitted model. This paper describes an approach to diagnostics for hierarchical models, specifically linear hierarchical models with additive normal or t -errors. The key is to express hierarchical models in the form of ordinary linear models by adding artificial `cases' to the data set corresponding to the higher levels of the hierarchy. The error term of this linear model is not homoscedastic, but its covariance structure is much simpler than that usually used in variance component or random effects models. The re-expression has several advantages. First, it is extremely general, covering dynamic linear models, random effect and mixed effect models, and pairwise difference models, among others. Second, it makes more explicit the geometry of hierarchical models, by analogy with the geometry of linear models. Third, the analogy with linear models provides a rich source of ideas for diagnostics for all the parts of hierarchical models. This paper gives diagnostics to examine candidate added variables, transformations, collinearity, case influence and residuals. 相似文献

18.

Efficient estimation for time series following generalized linear models

下载免费PDF全文

T. Thomson S. Hossain M. Ghahramani 《Australian & New Zealand Journal of Statistics》2016,58(4):493-513

In this paper, we consider James–Stein shrinkage and pretest estimation methods for time series following generalized linear models when it is conjectured that some of the regression parameters may be restricted to a subspace. Efficient estimation strategies are developed when there are many covariates in the model and some of them are not statistically significant. Statistical properties of the pretest and shrinkage estimation methods including asymptotic distributional bias and risk are developed. We investigate the relative performances of shrinkage and pretest estimators with respect to the unrestricted maximum partial likelihood estimator (MPLE). We show that the shrinkage estimators have a lower relative mean squared error as compared to the unrestricted MPLE when the number of significant covariates exceeds two. Monte Carlo simulation experiments were conducted for different combinations of inactive covariates and the performance of each estimator was evaluated in terms of its mean squared error. The practical benefits of the proposed methods are illustrated using two real data sets. 相似文献

19.

The geometry of case deletion and the assessment of influence in nonlinear regression

William H. Ross 《Revue canadienne de statistique》1987,15(2):91-103

Numerous influence measures are available for use in linear regression. By contrast, very little has been done for nonlinear models. A notable exception is Chapter 4 of Cook and Weisberg (1982). The extension of measures based on case deletion from the linear to the nonlinear model usually involve linear approximation. In this paper, the geometry of case deletion is studied with a view to assessing the adequacy of linear approximation in the construction of influence measures for nonlinear regression. Of particular interest is the adequacy of the one-step estimator for the jack-knifed pseudoestimates of the unknown parameter vector. 相似文献

20.

On estimation and influence diagnostics for log-Birnbaum–Saunders Student-t regression models: Full Bayesian analysis

Vicente G. Cancho Edwin M.M. Ortega Gilberto A. Paula 《Journal of statistical planning and inference》2010

The purpose of this paper is to develop a Bayesian approach for log-Birnbaum–Saunders Student-t regression models under right-censored survival data. Markov chain Monte Carlo (MCMC) methods are used to develop a Bayesian procedure for the considered model. In order to attenuate the influence of the outlying observations on the parameter estimates, we present in this paper Birnbaum–Saunders models in which a Student-t distribution is assumed to explain the cumulative damage. Also, some discussions on the model selection to compare the fitted models are given and case deletion influence diagnostics are developed for the joint posterior distribution based on the Kullback–Leibler divergence. The developed procedures are illustrated with a real data set. 相似文献