期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian estimation and influence diagnostics of generalized partially linear mixed-effects models for longitudinal data

Xing-De Duan 《Statistics》2016,50(3):525-539

This paper develops a Bayesian approach to obtain the joint estimates of unknown parameters, nonparametric functions and random effects in generalized partially linear mixed models (GPLMMs), and presents three case deletion influence measures to identify influential observations based on the φ-divergence, Cook's posterior mean distance and Cook's posterior mode distance of parameters. Fisher's iterative scoring algorithm is developed to evaluate the posterior modes of parameters in GPLMMs. The first-order approximation to Cook's posterior mode distance is presented. The computationally feasible formulae for the φ-divergence diagnostic and Cook's posterior mean distance are given. Several simulation studies and an example are presented to illustrate our proposed methodologies. 相似文献

2.

Cook's distance in linear longitudinal models

Mousumi Banerjee 《统计学通讯:理论与方法》2013,42(12):2973-2983

Cook's distance (1977) has become the standard influence diagnostic tool for analyzing cross–sectional regression studies. This paper introduces an analogue of Cook's distance in fixed effects models for longitudinal data. We demonstrate that this statistic is dominated by the effects of nuisance parameters, and hence its effectiveness as an influence measure in the longitudinal data setting is limited. 相似文献

3.

Procedures for the identification of multiple influential observations in linear regression

A.A.M. Nurunnabi Ali S. Hadi A.H.M.R. Imon 《Journal of applied statistics》2014,41(6):1315-1331

Since the seminal paper by Cook (1977) in which he introduced Cook's distance, the identification of influential observations has received a great deal of interest and extensive investigation in linear regression. It is well documented that most of the popular diagnostic measures that are based on single-case deletion can mislead the analysis in the presence of multiple influential observations because of the well-known masking and/or swamping phenomena. Atkinson (1981) proposed a modification of Cook's distance. In this paper we propose a further modification of the Cook's distance for the identification of a single influential observation. We then propose new measures for the identification of multiple influential observations, which are not affected by the masking and swamping problems. The efficiency of the new statistics is presented through several well-known data sets and a simulation study. 相似文献

4.

Exploring Cook's Statistic Graphically

R. J. O'Hara Hines W. G. S. Hines 《The American statistician》2013,67(4):389-394

An index plot of Cook's statistic is frequently used to highlight influential observations. In this article we illustrate how enhanced higher dimensional plots of Cook's statistic can provide further useful information about sets of influential observations. We provide examples using normal and generalized linear models. 相似文献

5.

Graphical and Numerical Methods for Detecting Influential Observations in Complex Bingham Data

Getulio Jose amorim Amaral Olga Patricia reyes Floréz Francisco José A Cysneiros 《统计学通讯:模拟与计算》2013,42(8):1801-1814

We describe methods to detect influential observations in a sample of pre-shapes when the underlying distribution is assumed to be complex Bingham. One of these methods is based on Cook's distance, which is derived from the likelihood of the complex Bingham distribution. Other method is related to the tangent space, which is based on the local influence for the multivariate normal distribution. A method to detect outliers is also explained. The application of the methods is illustrated in both a real dataset and a simulated sample. 相似文献

6.

Relative local influence and the condition number

R W Farebrother 《统计学通讯:模拟与计算》2013,42(3):707-710

In this paper we show that the condition number traditionally used by numerical analysts is closely related to a variant of Cook's (1986) absolute measure of local influence. The nature of the assumptions made in our derivation of this result raises several questions concerning the relevance of the traditional condition number as a measure of computational accuracy in statistical studies. 相似文献

7.

Identification of multiple influential observations in logistic regression

A. A.M. Nurunnabi A. H.M. Rahmatullah Imon M. Nasser 《Journal of applied statistics》2010,37(10):1605-1624

The identification of influential observations in logistic regression has drawn a great deal of attention in recent years. Most of the available techniques like Cook's distance and difference of fits (DFFITS) are based on single-case deletion. But there is evidence that these techniques suffer from masking and swamping problems and consequently fail to detect multiple influential observations. In this paper, we have developed a new measure for the identification of multiple influential observations in logistic regression based on a generalized version of DFFITS. The advantage of the proposed method is then investigated through several well-referred data sets and a simulation study. 相似文献

8.

Model diagnostics for marginal regression analysis of correlated binary data

Ming Tan Yingsheng Qu Michael H.Kutner 《统计学通讯:模拟与计算》2013,42(2):539-558

We propose several diagnostic methods for checking the adequacy of marginal regression models for analyzing correlated binary data. We use a parametric marginal model based on latent variables and derive the projection (hat) matrix, Cook's distance, various residuals and Mahalanobis distance between the observed binary responses and the estimated probabilities for a cluster. Emphasized are several graphical methods including the simulated Q-Q plot, the half-normal probability plot with a simulated envelope, and the partial residual plot. The methods are illustrated with a real life example. 相似文献

9.

Sufficient jackknife-after-bootstrap method for detection of influential observations in linear regression models

Ufuk Beyaztas Aylin Alin 《Statistical Papers》2014,55(4):1001-1018

In this study, we adapt sufficient bootstrap into the jackknife-after-bootstrap (JaB) algorithm. The performances of the sufficient and conventional JaB methods have been compared for detecting influential observations in linear regression. Comparison is based on two real-world examples and an extensive designed simulation study. Design includes different sample sizes and various modeling scenarios. The results reveal that proposed method is a good competitor for conventional JaB method with less standard error and amount of computation. 相似文献

10.

A comparative study on detection of influential observations in linear regression

A. Hossain D. N. Naik 《Statistical Papers》1991,32(1):55-69

A large number of statistics are used in the literature to detect outliers and influential observations in the linear regression model. In this paper comparison studies have been made for determining a statistic which performs better than the other. This includes: (i) a detailed simulation study, and (ii) analyses of several data sets studied by different authors. Different choices of the design matrix of regression model are considered. Design A studies the performance of the various statistics for detecting the scale shift type outliers, and designs B and C provide information on the performance of the statistics for identifying the influential observations. We have used cutoff points using the exact distributions and Bonferroni's inequality for each statistic. The results show that the studentized residual which is used for detection of mean shift outliers is appropriate for detection of scale shift outliers also, and the Welsch's statistic and the Cook's distance are appropriate for detection of influential observations. 相似文献

11.

On the asymptotic distribution of Cook's distance in logistic regression models

Nirian Martín Leandro Pardo 《Journal of applied statistics》2009,36(10):1119-1146

It sometimes occurs that one or more components of the data exert a disproportionate influence on the model estimation. We need a reliable tool for identifying such troublesome cases in order to decide either eliminate from the sample, when the data collect was badly realized, or otherwise take care on the use of the model because the results could be affected by such components. Since a measure for detecting influential cases in linear regression setting was proposed by Cook [Detection of influential observations in linear regression, Technometrics 19 (1977), pp. 15–18.], apart from the same measure for other models, several new measures have been suggested as single-case diagnostics. For most of them some cutoff values have been recommended (see [D.A. Belsley, E. Kuh, and R.E. Welsch, Regression Diagnostics: Identifying Influential Data and Sources of Collinearity, 2nd ed., John Wiley & Sons, New York, Chichester, Brisban, (2004).], for instance), however the lack of a quantile type cutoff for Cook's statistics has induced the analyst to deal only with index plots as worthy diagnostic tools. Focussed on logistic regression, the aim of this paper is to provide the asymptotic distribution of Cook's distance in order to look for a meaningful cutoff point for detecting influential and leverage observations. 相似文献

12.

Mean shift and influence measures in linear measurement error models with stochastic linear restrictions

F. Ghapani B. Babadi 《统计学通讯:模拟与计算》2017,46(6):4499-4512

We present influence diagnostics for linear measurement error models with stochastic linear restrictions using the corrected likelihood of Nakamura in 1990. The case deletion and mean shift outlier models are developed to identify outlying and influential observations. We derive a corrected score test statistic for outlier detection based on mean shift outlier models. The analogs of Cook's distance and likelihood distance are proposed to determine influential observations based on case deletion models. A parametric bootstrap procedure is used to obtain empirical distributions of the test statistics and a simulation study has been used to evaluate the performance of the proposed estimators based on the mean squares error criterion and the score test statistic. Finally, a numerical example is given to illustrate the theoretical results. 相似文献

13.

Residual analysis and outliers in loglinear models based on phi-divergence statistics

A.K. Gupta T. NguyenL. Pardo 《Journal of statistical planning and inference》2007

In this paper we consider new families of residuals and influential measures, under the assumption of multinomial sampling, for loglinear models. These new families are based on φ

φ

-divergence test statistic. The asymptotic normality of the standardized residuals is obtained as well as the relation of the new family of influential measures with the appropriate Cook's distance in this context. The expression of the new family of residuals is obtained in two important problems: independence and symmetry in two-dimensional contingence tables. A numerical example illustrates the results obtained. 相似文献

14.

On the potential in the estimation of linear functions in regression

Manuel del Rio 《统计学通讯:理论与方法》2013,42(3):729-738

New aspects of potential in Cook's Influence measure for linear combinations are explored. It is shown that this potential can be considered as a case influence measure in the scatter of estimated combinations. The potential is related to precise estimation directions and multicollinearity concepts; It Is also used as a basis for selection of new cases. 相似文献

15.

Influential observations in view of design and inference

Subir Ghosh 《统计学通讯:理论与方法》2013,42(14):1675-1683

In this paper we consider the measures for detecting the influential observations w.r.t. one or several parameters of interest at the design stage. We also consider the Cook's measure for detecting the influential observations at the inference stage. We study the interrelationship between two kinds of measures. 相似文献

16.

Influence on tests with focus on linear models

Christian Ritz Ib.M. Skovgaard 《Journal of statistical planning and inference》2007

To assess the influence of single observations on the parameter estimates, case-deletion diagnostics are commonly used in linear regression models; one example is Cook's distance. For nested parametric models we consider a deletion diagnostic for evaluating the influence of a single observation on the likelihood ratio (LR) test. In order to have a common scale as reference, the asymptotic distribution of the diagnostic is derived and the values of the diagnostic are converted to percentiles. We focus on linear models and general linear models, and in these cases explicit results are derived. The performance of the diagnostic is explored in two small bench mark examples from linear regression and in a larger linear mixed model example. 相似文献

17.

Inference and diagnostics in skew scale mixtures of normal regression models

《Journal of Statistical Computation and Simulation》2012,82(3):517-537

Skew scale mixtures of normal distributions are often used for statistical procedures involving asymmetric data and heavy-tailed. The main virtue of the members of this family of distributions is that they are easy to simulate from and they also supply genuine expectation-maximization (EM) algorithms for maximum likelihood estimation. In this paper, we extend the EM algorithm for linear regression models and we develop diagnostics analyses via local influence and generalized leverage, following Zhu and Lee's approach. This is because Cook's well-known approach cannot be used to obtain measures of local influence. The EM-type algorithm has been discussed with an emphasis on the skew Student-t-normal, skew slash, skew-contaminated normal and skew power-exponential distributions. Finally, results obtained for a real data set are reported, illustrating the usefulness of the proposed method. 相似文献

18.

Estimation and diagnostic analysis in skew-generalized-normal regression models

Clécio S. Ferreira Reinaldo B. Arellano-Valle 《Journal of Statistical Computation and Simulation》2018,88(6):1039-1059

The skew-generalized-normal distribution [Arellano-Valle, RB, Gómez, HW, Quintana, FA. A new class of skew-normal distributions. Comm Statist Theory Methods 2004;33(7):1465–1480] is a class of asymmetric normal distributions, which contains the normal and skew-normal distributions as special cases. The main virtues of this distribution is that it is easy to simulate from and it also supplies a genuine expectation–maximization (EM) algorithm for maximum likelihood estimation. In this paper, we extend the EM algorithm for linear regression models assuming skew-generalized-normal random errors and we develop a diagnostics analyses via local influence and generalized leverage, following Zhu and Lee's approach. This is because Cook's well-known approach would be more complicated to use to obtain measures of local influence. Finally, results obtained for a real data set are reported, illustrating the usefulness of the proposed method. 相似文献

19.

Diagnostics for identifying influential cases in manova

Charles W. Kish Jr Vernon M. Chinchilli 《统计学通讯:理论与方法》2013,42(7):2667-2681

Measures of influence of multivariate cases on the estimated parameter matrix in MANOVA are developed. The development is based on the confidence region resulting from the likelihood ratio criterion, and is patterned after the development of Cook's univariate D_Imeasure. Influence measures corresponding to the other 3 common multivariate test criteria are presented. Relationships to the measures of Caroni (1987), and Barrett and Ling (1988) are noted. A numerical example is included. 相似文献

20.

Local influence: a new approach

N. Billor R.M. Loynes 《统计学通讯:理论与方法》2013,42(6):1595-1611

The concept of local influence was introduced by Cook(1986). Closer study of the idea of perturbations suggests that it is important to distinguish between those of the data and those of the model, and that in the latter case Cook's definition has a theoretical difficulty. Here a new measure is proposed, which has the incidental benefit of being simpler to compute. 相似文献