首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The use of the correlation coefficient is suggested as a technique for summarizing and objectively evaluating the information contained in probability plots. Goodness-of-fit tests are constructed using this technique for several commonly used plotting positions for the normal distribution. Empirical sampling methods are used to construct the null distribution for these tests, which are then compared on the basis of power against certain nonnormal alternatives. Commonly used regression tests of fit are also included in the comparisons. The results indicate that use of the plotting position pi = (i - .375)/(n + .25) yields a competitive regression test of fit for normality.  相似文献   

2.
The family of symmetric generalized exponential power (GEP) densities offers a wide range of tail behaviors, which may be exponential, polynomial, and/or logarithmic. In this article, a test of normality based on Rao's score statistic and this family of GEP alternatives is proposed. This test is tailored to detect departures from normality in the tails of the distribution. The main interest of this approach is that it provides a test with a large family of symmetric alternatives having non-normal tails. In addition, the test's statistic consists of a combination of three quantities that can be interpreted as new measures of tail thickness. In a Monte-Carlo simulation study, the proposed test is shown to perform well in terms of power when compared to its competitors.  相似文献   

3.
In statistical modeling, we strive to specify models that resemble data collected in studies or observed from processes. Consequently, distributional specification and parameter estimation are central to parametric models. Graphical procedures, such as the quantile–quantile (QQ) plot, are arguably the most widely used method of distributional assessment, though critics find their interpretation to be overly subjective. Formal goodness of fit tests are available and are quite powerful, but only indicate whether there is a lack of fit, not why there is lack of fit. In this article, we explore the use of the lineup protocol to inject rigor into graphical distributional assessment and compare its power to that of formal distributional tests. We find that lineup tests are considerably more powerful than traditional tests of normality. A further investigation into the design of QQ plots shows that de-trended QQ plots are more powerful than the standard approach as long as the plot preserves distances in x and y to be the same. While we focus on diagnosing nonnormality, our approach is general and can be directly extended to the assessment of other distributions.  相似文献   

4.
Sample kurtosis is a member of the large class of absolute moment tests of normality. We compare kurtosis to other absolute moment tests to determine which are the most powerful at detecting long‐tailed symmetric departures from normality for large samples. The large sample power of the tests is calculated using Geary's (1947) approximations of the moments of the test statistics. Using the system of Gram-Charlier symmetric distributions as alternatives, the most power is obtained using a moment in the range 2.5 ‐ 3.5.  相似文献   

5.
For location–scale families, we consider a random distance between the sample order statistics and the quasi sample order statistics derived from the null distribution as a measure of discrepancy. The conditional qth quantile and expectation of the random discrepancy on the given sample are chosen as test statistics. Simulation results of powers against various alternatives are illustrated under the normal and exponential hypotheses for moderate sample size. The proposed tests, especially the qth quantile tests with a small or large q, are shown to be more powerful than other prominent goodness-of-fit tests in most cases.  相似文献   

6.
Abstract. We consider the problem of testing the equality of J quantile curves from independent samples. A test statistic based on an L2‐distance between non‐crossing non‐parametric estimates of the quantile curves from the individual samples is proposed. Asymptotic normality of this statistic is established under the null hypothesis, local and fixed alternatives, and the finite sample properties of a bootstrap‐based version of this test statistic are investigated by means of a simulation study.  相似文献   

7.
We introduce the 2nd-power skewness and kurtosis, which are interesting alternatives to the classical Pearson's skewness and kurtosis, called 3rd-power skewness and 4th-power kurtosis in our terminology. We use the sample 2nd-power skewness and kurtosis to build a powerful test of normality. This test can also be derived as Rao's score test on the asymmetric power distribution, which combines the large range of exponential tail behavior provided by the exponential power distribution family with various levels of asymmetry. We find that our test statistic is asymptotically chi-squared distributed. We also propose a modified test statistic, for which we show numerically that the distribution can be approximated for finite sample sizes with very high precision by a chi-square. Similarly, we propose a directional test based on sample 2nd-power kurtosis only, for the situations where the true distribution is known to be symmetric. Our tests are very similar in spirit to the famous Jarque–Bera test, and as such are also locally optimal. They offer the same nice interpretation, with in addition the gold standard power of the regression and correlation tests. An extensive empirical power analysis is performed, which shows that our tests are among the most powerful normality tests. Our test is implemented in an R package called PoweR.  相似文献   

8.
The Shapiro–Francia (SF) normality test is an important test in statistical modelling. However, little has been done by researchers to compare the performance of this test to other normality tests. This paper therefore measures the performance of the SF and other normality tests by studying the distribution of their p-values. For the purpose of this study, we selected eight well-known normality tests to compare with the SF test: (i) Kolmogorov–Smirnov (KS), (ii) Anderson–Darling (AD), (iii) Cramer von Mises (CM), (iv) Lilliefors (LF), (v) Shapiro–Wilk (SW), (vi) Pearson chi-square (PC), (vii) Jarque– Bera (JB) and (viii) D'Agostino (DA). The distribution of p-values of these normality tests were obtained by generating data from normal distribution and well-known symmetric non-normal distribution at various sample sizes (small, medium and large). Our simulation results showed that the SF normality test was the best test statistic in detecting deviation from normality among the nine tests considered at all sample sizes.  相似文献   

9.
Probability plots allow us to determine whether a set of sample observations is distributed according to a theoretical distribution. Plotting positions are fundamental elements in statistics and, in particular, for the construction of probability plots. In this paper, a new plotting position to construct different probability plots, such as Q–Q Plot, P–P Plot and S–P Plot, is proposed. The proposed definition is based on the median of the ith order statistic of the theoretical distribution considered. The main feature of this plotting position formula is that it is independent of the theoretical distribution selected. Moreover, the procedure developed is ‘almost’ exact, reaching, without a high cost of time, an accuracy as great as we want, which avoids using approximations (proposed by other authors).  相似文献   

10.
A goodness-of-fit test for multivariate normality is proposed which is based on Shapiro–Wilk's statistic for univariate normality and on an empirical standardization of the observations. The critical values can be approximated by using a transformation of the univariate standard normal distribution. A Monte Carlo study reveals that this test has a better power performance than some of the best known tests for multinormality against a wide range of alternatives.  相似文献   

11.
In the last few years, two adaptive tests for paired data have been proposed. One test proposed by Freidlin et al. [On the use of the Shapiro–Wilk test in two-stage adaptive inference for paired data from moderate to very heavy tailed distributions, Biom. J. 45 (2003), pp. 887–900] is a two-stage procedure that uses a selection statistic to determine which of three rank scores to use in the computation of the test statistic. Another statistic, proposed by O'Gorman [Applied Adaptive Statistical Methods: Tests of Significance and Confidence Intervals, Society for Industrial and Applied Mathematics, Philadelphia, 2004], uses a weighted t-test with the weights determined by the data. These two methods, and an earlier rank-based adaptive test proposed by Randles and Hogg [Adaptive Distribution-free Tests, Commun. Stat. 2 (1973), pp. 337–356], are compared with the t-test and to Wilcoxon's signed-rank test. For sample sizes between 15 and 50, the results show that the adaptive test proposed by Freidlin et al. and the adaptive test proposed by O'Gorman have higher power than the other tests over a range of moderate to long-tailed symmetric distributions. The results also show that the test proposed by O'Gorman has greater power than the other tests for short-tailed distributions. For sample sizes greater than 50 and for small sample sizes the adaptive test proposed by O'Gorman has the highest power for most distributions.  相似文献   

12.
A new approach of randomization is proposed to construct goodness of fit tests generally. Some new test statistics are derived, which are based on the stochastic empirical distribution function (EDF). Note that the stochastic EDF for a set of given sample observations is a randomized distribution function. By substituting the stochastic EDF for the classical EDF in the Kolmogorov–Smirnov, Cramér–von Mises, Anderson–Darling, Berk–Jones, and Einmahl–Mckeague statistics, randomized statistics are derived, of which the qth quantile and the expectation are chosen as test statistics. In comparison to existing tests, it is shown, by a simulation study, that the new test statistics are generally more powerful than the corresponding ones based on the classical EDF or modified EDF in most cases.  相似文献   

13.
In this paper, we introduce a new estimator of entropy of a continuous random variable. We compare the proposed estimator with the existing estimators, namely, Vasicek [A test for normality based on sample entropy, J. Roy. Statist. Soc. Ser. B 38 (1976), pp. 54–59], van Es [Estimating functionals related to a density by class of statistics based on spacings, Scand. J. Statist. 19 (1992), pp. 61–72], Correa [A new estimator of entropy, Commun. Statist. Theory and Methods 24 (1995), pp. 2439–2449] and Wieczorkowski-Grzegorewski [Entropy estimators improvements and comparisons, Commun. Statist. Simulation and Computation 28 (1999), pp. 541–567]. We next introduce a new test for normality. By simulation, the powers of the proposed test under various alternatives are compared with normality tests proposed by Vasicek (1976) and Esteban et al. [Monte Carlo comparison of four normality tests using different entropy estimates, Commun. Statist.–Simulation and Computation 30(4) (2001), pp. 761–785].  相似文献   

14.
In the paper, tests for multivariate normality (MVN) of Jarque-Bera type, based on skewness and kurtosis, have been considered. Tests proposed by Mardia and Srivastava, and the combined tests based on skewness and kurtosis defined by Jarque and Bera have been taken into account. In the Monte Carlo simulations, for each combination of p = 2, 3, 4, 5 number of traits and n = 10(5)50(10)100 sample sizes 10,000 runs have been done to calculate empirical Type I errors of tests under consideration, and empirical power against different alternative distributions. Simulation results have been compared to the Henze–Zirkler’s test. It should be stressed that no test yet proposed is uniformly better than all the others in every combination of conditions examined.  相似文献   

15.
Most multivariate statistical techniques rely on the assumption of multivariate normality. The effects of nonnormality on multivariate tests are assumed to be negligible when variance–covariance matrices and sample sizes are equal. Therefore, in practice, investigators usually do not attempt to assess multivariate normality. In this simulation study, the effects of skewed and leptokurtic multivariate data on the Type I error and power of Hotelling's T 2 were examined by manipulating distribution, sample size, and variance–covariance matrix. The empirical Type I error rate and power of Hotelling's T 2 were calculated before and after the application of generalized Box–Cox transformation. The findings demonstrated that even when variance–covariance matrices and sample sizes are equal, small to moderate changes in power still can be observed.  相似文献   

16.
An omnibus test of uniformity based upon the ratios of sample moments and population moments is introduced. Results of a monte carlo power study show that for two types of alternatives considered, the proposed test has good power in comparison with Neyman's test N 2Greenwood's test, Kolmogorov-Smirnov test, and Chi-squared test.  相似文献   

17.
One of the most basic topics in many introductory statistical methods texts is inference for a population mean, μ. The primary tool for confidence intervals and tests is the Student t sampling distribution. Although the derivation requires independent identically distributed normal random variables with constant variance, σ2, most authors reassure the readers about some robustness to the normality and constant variance assumptions. Some point out that if one is concerned about assumptions, one may statistically test these prior to reliance on the Student t. Most software packages provide optional test results for both (a) the Gaussian assumption and (b) homogeneity of variance. Many textbooks advise only informal graphical assessments, such as certain scatterplots for independence, others for constant variance, and normal quantile–quantile plots for the adequacy of the Gaussian model. We concur with this recommendation. As convincing evidence against formal tests of (a), such as the Shapiro–Wilk, we offer a simulation study of the tails of the resulting conditional sampling distributions of the Studentized mean. We analyze the results of systematically screening all samples from normal, uniform, exponential, and Cauchy populations. This pretest does not correct the erroneous significance levels and makes matters worse for the exponential. In practice, we conclude that graphical diagnostics are better than a formal pretest. Furthermore, rank or permutation methods are recommended for exact validity in the symmetric case.  相似文献   

18.
The importance of the normal distribution for fitting continuous data is well known. However, in many practical situations data distribution departs from normality. For example, the sample skewness and the sample kurtosis are far away from 0 and 3, respectively, which are nice properties of normal distributions. So, it is important to have formal tests of normality against any alternative. D'Agostino et al. [A suggestion for using powerful and informative tests of normality, Am. Statist. 44 (1990), pp. 316–321] review four procedures Z 2(g 1), Z 2(g 2), D and K 2 for testing departure from normality. The first two of these procedures are tests of normality against departure due to skewness and kurtosis, respectively. The other two tests are omnibus tests. An alternative to the normal distribution is a class of skew-normal distributions (see [A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Statist. 12 (1985), pp. 171–178]). In this paper, we obtain a score test (W) and a likelihood ratio test (LR) of goodness of fit of the normal regression model against the skew-normal family of regression models. It turns out that the score test is based on the sample skewness and is of very simple form. The performance of these six procedures, in terms of size and power, are compared using simulations. The level properties of the three statistics LR, W and Z 2(g 1) are similar and close to the nominal level for moderate to large sample sizes. Also, their power properties are similar for small departure from normality due to skewness (γ1≤0.4). Of these, the score test statistic has a very simple form and computationally much simpler than the other two statistics. The LR statistic, in general, has highest power, although it is computationally much complex as it requires estimates of the parameters under the normal model as well as those under the skew-normal model. So, the score test may be used to test for normality against small departure from normality due to skewness. Otherwise, the likelihood ratio statistic LR should be used as it detects general departure from normality (due to both skewness and kurtosis) with, in general, largest power.  相似文献   

19.
ABSTRACT

In many real life problems one assumes a normal model because the sample histogram looks unimodal, symmetric, and/or the standard tests like the Shapiro-Wilk test favor such a model. However, in reality, the assumption of normality may be misplaced since the normality tests often fail to detect departure from normality (especially for small sample sizes) when the data actually comes from slightly heavier tail symmetric unimodal distributions. For this reason it is important to see how the existing normal variance estimators perform when the actual distribution is a t-distribution with k degrees of freedom (d.f.) (t k -distribution). This note deals with the performance of standard normal variance estimators under the t k -distributions. It is shown that the relative ordering of the estimators is preserved for both the quadratic loss as well as the entropy loss irrespective of the d.f. and the sample size (provided the risks exist).  相似文献   

20.
Most biomedical research is carried out using longitudinal studies. The method of generalized estimating equations (GEEs) introduced by Liang and Zeger [Longitudinal data analysis using generalized linear models, Biometrika 73 (1986), pp. 13–22] and Zeger and Liang [Longitudinal data analysis for discrete and continuous outcomes, Biometrics 42 (1986), pp. 121–130] has become a standard method for analyzing non-normal longitudinal data. Since then, a large variety of GEEs have been proposed. However, the model diagnostic problem has not been explored intensively. Oh et al. [Modeldiagnostic plots for repeated measures data using the generalized estimating equations approach, Comput. Statist. Data Anal. 53 (2008), pp. 222–232] proposed residual plots based on the quantile–quantile (Q–Q) plots of the χ2-distribution for repeated-measures data using the GEE methodology. They considered the Pearson, Anscombe and deviance residuals. In this work, we propose to extend this graphical diagnostic using a generalized residual. A simulation study is presented as well as two examples illustrating the proposed generalized Q–Q plots.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号