期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Comparisons of various types of normality tests

《Journal of Statistical Computation and Simulation》2012,82(12):2141-2155

Normality tests can be classified into tests based on chi-squared, moments, empirical distribution, spacings, regression and correlation and other special tests. This paper studies and compares the power of eight selected normality tests: the Shapiro–Wilk test, the Kolmogorov–Smirnov test, the Lilliefors test, the Cramer–von Mises test, the Anderson–Darling test, the D'Agostino–Pearson test, the Jarque–Bera test and chi-squared test. Power comparisons of these eight tests were obtained via the Monte Carlo simulation of sample data generated from alternative distributions that follow symmetric short-tailed, symmetric long-tailed and asymmetric distributions. Our simulation results show that for symmetric short-tailed distributions, D'Agostino and Shapiro–Wilk tests have better power. For symmetric long-tailed distributions, the power of Jarque–Bera and D'Agostino tests is quite comparable with the Shapiro–Wilk test. As for asymmetric distributions, the Shapiro–Wilk test is the most powerful test followed by the Anderson–Darling test. 相似文献

2.

Comparing the performance of normality tests with ROC analysis and confidence intervals

Miguel Patrício Fábio Ferreira Bárbara Oliveiros Francisco Caramelo 《统计学通讯:模拟与计算》2017,46(10):7535-7551

There are several statistical hypothesis tests available for assessing normality assumptions, which is an a priori requirement for most parametric statistical procedures. The usual method for comparing the performances of normality tests is to use Monte Carlo simulations to obtain point estimates for the corresponding powers. The aim of this work is to improve the assessment of 9 normality hypothesis tests. For that purpose, random samples were drawn from several symmetric and asymmetric nonnormal distributions and Monte Carlo simulations were carried out to compute confidence intervals for the power achieved, for each distribution, by two of the most usual normality tests, Kolmogorov–Smirnov with Lilliefors correction and Shapiro–Wilk. In addition, the specificity was computed for each test, again resorting to Monte Carlo simulations, taking samples from standard normal distributions. The analysis was then additionally extended to the Anderson–Darling, Cramer-Von Mises, Pearson chi-square Shapiro–Francia, Jarque–Bera, D'Agostino and uncorrected Kolmogorov–Smirnov tests by determining confidence intervals for the areas under the receiver operating characteristic curves. Simulations were performed to this end, wherein for each sample from a nonnormal distribution an equal-sized sample was taken from a normal distribution. The Shapiro–Wilk test was seen to have the best global performance overall, though in some circumstances the Shapiro–Francia or the D'Agostino tests offered better results. The differences between the tests were not as clear for smaller sample sizes. Also to be noted, the SW and KS tests performed generally quite poorly in distinguishing between samples drawn from normal distributions and t Student distributions. 相似文献

3.

Testing for normality in linear regression models

《Journal of Statistical Computation and Simulation》2012,82(10):1101-1113

The importance of the normal distribution for fitting continuous data is well known. However, in many practical situations data distribution departs from normality. For example, the sample skewness and the sample kurtosis are far away from 0 and 3, respectively, which are nice properties of normal distributions. So, it is important to have formal tests of normality against any alternative. D'Agostino et al. [A suggestion for using powerful and informative tests of normality, Am. Statist. 44 (1990), pp. 316–321] review four procedures Z ²(g ₁), Z ²(g ₂), D and K ² for testing departure from normality. The first two of these procedures are tests of normality against departure due to skewness and kurtosis, respectively. The other two tests are omnibus tests. An alternative to the normal distribution is a class of skew-normal distributions (see [A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Statist. 12 (1985), pp. 171–178]). In this paper, we obtain a score test (W) and a likelihood ratio test (LR) of goodness of fit of the normal regression model against the skew-normal family of regression models. It turns out that the score test is based on the sample skewness and is of very simple form. The performance of these six procedures, in terms of size and power, are compared using simulations. The level properties of the three statistics LR, W and Z ²(g ₁) are similar and close to the nominal level for moderate to large sample sizes. Also, their power properties are similar for small departure from normality due to skewness (γ₁≤0.4). Of these, the score test statistic has a very simple form and computationally much simpler than the other two statistics. The LR statistic, in general, has highest power, although it is computationally much complex as it requires estimates of the parameters under the normal model as well as those under the skew-normal model. So, the score test may be used to test for normality against small departure from normality due to skewness. Otherwise, the likelihood ratio statistic LR should be used as it detects general departure from normality (due to both skewness and kurtosis) with, in general, largest power. 相似文献

4.

Shapiro–Francia test compared to other normality test using expected p-value

《Journal of Statistical Computation and Simulation》2012,82(15):3002-3016

The Shapiro–Francia (SF) normality test is an important test in statistical modelling. However, little has been done by researchers to compare the performance of this test to other normality tests. This paper therefore measures the performance of the SF and other normality tests by studying the distribution of their p-values. For the purpose of this study, we selected eight well-known normality tests to compare with the SF test: (i) Kolmogorov–Smirnov (KS), (ii) Anderson–Darling (AD), (iii) Cramer von Mises (CM), (iv) Lilliefors (LF), (v) Shapiro–Wilk (SW), (vi) Pearson chi-square (PC), (vii) Jarque– Bera (JB) and (viii) D'Agostino (DA). The distribution of p-values of these normality tests were obtained by generating data from normal distribution and well-known symmetric non-normal distribution at various sample sizes (small, medium and large). Our simulation results showed that the SF normality test was the best test statistic in detecting deviation from normality among the nine tests considered at all sample sizes. 相似文献

5.

An Empirical Analysis of Some Nonparametric Goodness-of-Fit Tests for Censored Data

N. Balakrishnan M. Vedernikova 《统计学通讯:模拟与计算》2015,44(4):1101-1115

In this article, we consider some nonparametric goodness-of-fit tests for right censored samples, viz., the modified Kolmogorov, Cramer–von Mises–Smirnov, Anderson–Darling, and Nikulin–Rao–Robson χ² tests. We also consider an approach based on a transformation of the original censored sample to a complete one and the subsequent application of classical goodness-of-fit tests to the pseudo-complete sample. We then compare these tests in terms of power in the case of Type II censored data along with the power of the Neyman–Pearson test, and draw some conclusions. Finally, we present an illustrative example. 相似文献

6.

Was Quetelet’s Average Man Normal?

Eugene D. Gallagher 《The American statistician》2020,74(3):301-306

Abstract

Quetelet’s data on Scottish chest girths are analyzed with eight normality tests. In contrast to Quetelet’s conclusion that the data are fit well by what is now known as the normal distribution, six of eight normality tests provide strong evidence that the chest circumferences are not normally distributed. Using corrected chest circumferences from Stigler, the χ² test no longer provides strong evidence against normality, but five commonly used normality tests do. The D’Agostino–Pearson K² and Jarque–Bera tests, based only on skewness and kurtosis, find that both Quetelet’s original data and the Stigler-corrected data are consistent with the hypothesis of normality. The major reason causing most normality tests to produce low p-values, indicating that Quetelet’s data are not normally distributed, is that the chest circumferences were reported in whole inches and rounding of large numbers of observations can produce many tied values that strongly affect most normality tests. Users should be cautious using many standard normality tests if data have ties, are rounded, and the ratio of the standard deviation to rounding interval is small. 相似文献

7.

A powerful and interpretable alternative to the Jarque–Bera test of normality based on 2nd-power skewness and kurtosis,using the Rao's score test on the APD family

A. Desgagné 《Journal of applied statistics》2018,45(13):2307-2327

We introduce the 2nd-power skewness and kurtosis, which are interesting alternatives to the classical Pearson's skewness and kurtosis, called 3rd-power skewness and 4th-power kurtosis in our terminology. We use the sample 2nd-power skewness and kurtosis to build a powerful test of normality. This test can also be derived as Rao's score test on the asymmetric power distribution, which combines the large range of exponential tail behavior provided by the exponential power distribution family with various levels of asymmetry. We find that our test statistic is asymptotically chi-squared distributed. We also propose a modified test statistic, for which we show numerically that the distribution can be approximated for finite sample sizes with very high precision by a chi-square. Similarly, we propose a directional test based on sample 2nd-power kurtosis only, for the situations where the true distribution is known to be symmetric. Our tests are very similar in spirit to the famous Jarque–Bera test, and as such are also locally optimal. They offer the same nice interpretation, with in addition the gold standard power of the regression and correlation tests. An extensive empirical power analysis is performed, which shows that our tests are among the most powerful normality tests. Our test is implemented in an R package called PoweR. 相似文献

8.

A Monte Carlo comparison of Jarque–Bera type tests and Henze–Zirkler test of multivariate normality

Zofia Hanusz Rie Enomoto Takashi Seo Kazuyuki Koizumi 《统计学通讯:模拟与计算》2018,47(5):1439-1452

In the paper, tests for multivariate normality (MVN) of Jarque-Bera type, based on skewness and kurtosis, have been considered. Tests proposed by Mardia and Srivastava, and the combined tests based on skewness and kurtosis defined by Jarque and Bera have been taken into account. In the Monte Carlo simulations, for each combination of p = 2, 3, 4, 5 number of traits and n = 10(5)50(10)100 sample sizes 10,000 runs have been done to calculate empirical Type I errors of tests under consideration, and empirical power against different alternative distributions. Simulation results have been compared to the Henze–Zirkler’s test. It should be stressed that no test yet proposed is uniformly better than all the others in every combination of conditions examined. 相似文献

9.

Jarque–Bera Test and its Competitors for Testing Normality – A Power Comparison

Thorsten Thadewald 《Journal of applied statistics》2007,34(1):87-105

For testing normality we investigate the power of several tests, first of all, the well-known test of Jarque & Bera (1980) and furthermore the tests of Kuiper (1960) and Shapiro & Wilk (1965) as well as tests of Kolmogorov–Smirnov and Cramér-von Mises type. The tests on normality are based, first, on independent random variables (model I) and, second, on the residuals in the classical linear regression (model II). We investigate the exact critical values of the Jarque–Bera test and the Kolmogorov–Smirnov and Cramér-von Mises tests, in the latter case for the original and standardized observations where the unknown parameters μ and σ have to be estimated. The power comparison is carried out via Monte Carlo simulation assuming the model of contaminated normal distributions with varying parameters μ and σ and different proportions of contamination. It turns out that for the Jarque–Bera test the approximation of critical values by the chi-square distribution does not work very well. The test is superior in power to its competitors for symmetric distributions with medium up to long tails and for slightly skewed distributions with long tails. The power of the Jarque–Bera test is poor for distributions with short tails, especially if the shape is bimodal – sometimes the test is even biased. In this case a modification of the Cramér-von Mises test or the Shapiro–Wilk test may be recommended. 相似文献

10.

Behaviour of skewness, kurtosis and normality tests in long memory data

Mohamed Boutahar 《Statistical Methods and Applications》2010,19(2):193-215

We establish the limiting distributions for empirical estimators of the coefficient of skewness, kurtosis, and the Jarque–Bera normality test statistic for long memory linear processes. We show that these estimators, contrary to the case of short memory, are neither ${\sqrt{n}}We establish the limiting distributions for empirical estimators of the coefficient of skewness, kurtosis, and the Jarque–Bera normality test statistic for long memory linear processes. We show that these estimators, contrary to the case of short memory, are neither ?n{\sqrt{n}}-consistent nor asymptotically normal. The normalizations needed to obtain the limiting distributions depend on the long memory parameter d. A direct consequence is that if data are long memory then testing normality with the Jarque–Bera test by using the chi-squared critical values is not valid. Therefore, statistical inference based on skewness, kurtosis, and the Jarque–Bera normality test, needs a rescaling of the corresponding statistics and computing new critical values of their nonstandard limiting distributions. 相似文献

11.

The Sensitivity of Chi-Squared Goodness-of-Fit Tests to the Partitioning of Data

《Econometric Reviews》2013,32(4):341-370

Abstract

The power of Pearson's overall goodness-of-fit test and the components-of-chi-squared or “Pearson analog” tests of Anderson [Anderson, G. (1994). Simple tests of distributional form. J. Econometrics 62:265–276] to detect rejections due to shifts in location, scale, skewness and kurtosis is studied, as the number and position of the partition points is varied. Simulations are conducted for small and moderate sample sizes. It is found that smaller numbers of classes than are used in practice may be appropriate, and that the choice of non-equiprobable classes can result in substantial gains in power. 相似文献

12.

Testing high-dimensional normality based on classical skewness and Kurtosis with a possible small sample size

Jiajuan Liang Man-Lai Tang Xuejing Zhao 《统计学通讯:理论与方法》2013,42(23):5719-5732

Abstract

By using the idea of principal component analysis, we propose an approach to applying the classical skewness and kurtosis statistics for detecting univariate normality to testing high-dimensional normality. High-dimensional sample data are projected to the principal component directions on which the classical skewness and kurtosis statistics can be constructed. The theory of spherical distributions is employed to derive the null distributions of the combined statistics constructed from the principal component directions. A Monte Carlo study is carried out to demonstrate the performance of the statistics on controlling type I error rates and a simple power comparison with some existing statistics. The effectiveness of the proposed statistics is illustrated by two real-data examples. 相似文献

13.

Tests for multivariate normality based on canonical correlations

Måns Thulin 《Statistical Methods and Applications》2014,23(2):189-208

We propose new affine invariant tests for multivariate normality, based on independence characterizations of the sample moments of the normal distribution. The test statistics are obtained using canonical correlations between sets of sample moments in a way that resembles the construction of Mardia’s skewness measure and generalizes the Lin–Mudholkar test for univariate normality. The tests are compared to some popular tests based on Mardia’s skewness and kurtosis measures in an extensive simulation power study and are found to offer higher power against many of the alternatives. 相似文献

14.

Testing residual normality in the ANOVA model

Douglas G. Bonett J. Arthur Woodward 《Journal of applied statistics》1990,17(3):383-387

The use of single group skewness and kurtosis critical values for the assessment of residual normality in the ANOVA model is examined. Using single group critical values gives a conservative test of residual normality in multiple group designs. As the sample size per group increases, the empirical Type I error rates for the skewness and kurtosis tests of residual normality approach a. These results supplement previous work which has focused on testing residual normality in the linear regression model. 相似文献

15.

Implementation of Kolmogorov–Smirnov P-value computation in Visual Basic®: implication for Microsoft Excel® library function

《Journal of Statistical Computation and Simulation》2012,82(12):1727-1741

This paper investigates methodologies for evaluating the probabilistic value (P-value) of the Kolmogorov–Smirnov (K–S) goodness-of-fit test using algorithmic program development implemented in Microsoft® Visual Basic® (VB). Six methods were examined for the one-sided one-sample and two methods for the two-sided one-sample cumulative sampling distributions in the investigative software implementation that was based on machine-precision arithmetic. For sample sizes n≤2000 considered, results from the Smirnov iterative method found optimal accuracy for K–S P-values≥0.02, while those from the SmirnovD were more accurate for lower P-values for the one-sided one-sample distribution statistics. Also, the Durbin matrix method sustained better P-value results than the Durbin recursion method for the two-sided one-sample tests up to n≤700 sample sizes. Based on these results, an algorithm for Microsoft Excel® function was proposed from which a model function was developed and its implementation was used to test the performance of engineering students in a general engineering course across seven departments. 相似文献

16.

The robustness of the two—sample t—test over the Pearson system

《Journal of Statistical Computation and Simulation》2012,82(3-4):295-311

The present paper has as its objective an accurate quantification of the robustness of the two–sample t-test over an extensive practical range of distributions. The method is that of a major Monte Carlo study over the Pearson system of distributions and the details indicate that the results are quite accurate. The study was conducted over the range β ₁ =0.0(0.4)2.0 (negative and positive skewness) and β ₂ =1.4 (0.4)7.8 with equal sample sizes and for both the one-and two-tail t-tests. The significance level and power levels (for nominal values of 0.05, 0.50, and 0.95, respectively) were evaluated for each underlying distribution and for each sample size, with each probability evaluated from 100,000 generated values of the test-statistic. The results precisely quantify the degree of robustness inherent in the two-sample t-test and indicate to a user the degree of confidence one can have in this procedure over various regions of the Pearson system. The results indicate that the equal-sample size two-sample t-test is quite robust with respect to departures from normality, perhaps even more so than most people realize. 相似文献

17.

On the distributions of multivariate sample skewness

Naoya Okamoto Takashi Seo 《Journal of statistical planning and inference》2010

In this paper, we consider the multivariate normality test based on measure of multivariate sample skewness defined by Srivastava (1984). Srivastava derived asymptotic expectation up to the order N⁻¹ for the multivariate sample skewness and approximate χ²

χ^{2}

test statistic, where N is sample size. Under normality, we derive another expectation and variance for Srivastava's multivariate sample skewness in order to obtain a better test statistic. From this result, improved approximate χ²

χ^{2}

test statistic using the multivariate sample skewness is also given for assessing multivariate normality. Finally, the numerical result by Monte Carlo simulation is shown in order to evaluate accuracy of the obtained expectation, variance and improved approximate χ²

χ^{2}

test statistic. Furthermore, upper and lower percentiles of χ²

χ^{2}

test statistic derived in this paper are compared with those of χ²

χ^{2}

test statistic derived by Mardia (1974) which is used multivariate sample skewness defined by Mardia (1970). 相似文献

18.

Graphical comparison of normality tests for unimodal distribution data

José A. Sánchez-Espigares Lluís Marco-Almagro 《Journal of Statistical Computation and Simulation》2019,89(1):145-154

A methodology is proposed to compare the power of normality tests with a wide variety of alternative unimodal distributions. It is based on the representation of a distribution mosaic in which kurtosis varies vertically and skewness horizontally. The mosaic includes distributions such as exponential, Laplace or uniform, with normal occupying the centre. Simulation is used to determine the probability of a sample from each distribution in the mosaic being accepted as normal. We demonstrate our proposal by applying it to the analysis and comparison of some of the most well-known tests. 相似文献

19.

The Distribution of the Kolmogorov–Smirnov,Cramer–von Mises,and Anderson–Darling Test Statistics for Exponential Populations with Estimated Parameters

Diane L. Evans John H. Drew 《统计学通讯:模拟与计算》2013,42(7):1396-1421

This article presents a derivation of the distribution of the Kolmogorov–Smirnov, Cramer–von Mises, and Anderson–Darling test statistics in the case of exponential sampling when the parameters are unknown and estimated from sample data for small sample sizes via maximum likelihood. 相似文献

20.

On a goodness-of-fit test for normality with unknown parameters and type-II censored data

Claudia Castro-Kuriss Diana M. Kelmansky Elena J. Martínez 《Journal of applied statistics》2010,37(7):1193-1211

We propose a new goodness-of-fit test for normal and lognormal distributions with unknown parameters and type-II censored data. This test is a generalization of Michael's test for censored samples, which is based on the empirical distribution and a variance stabilizing transformation. We estimate the parameters of the model by using maximum likelihood and Gupta's methods. The quantiles of the distribution of the test statistic under the null hypothesis are obtained through Monte Carlo simulations. The power of the proposed test is estimated and compared to that of the Kolmogorov–Smirnov test also using simulations. The new test is more powerful than the Kolmogorov–Smirnov test in most of the studied cases. Acceptance regions for the PP, QQ and Michael's stabilized probability plots are derived, making it possible to visualize which data contribute to the decision of rejecting the null hypothesis. Finally, an illustrative example is presented. 相似文献