期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Two separate effects of variance heterogeneity on the validity and power of significance tests of location

Donald W. Zimmerman 《Statistical Methodology》2006,3(4):351-374

Heterogeneity of variances of treatment groups influences the validity and power of significance tests of location in two distinct ways. First, if sample sizes are unequal, the Type I error rate and power are depressed if a larger variance is associated with a larger sample size, and elevated if a larger variance is associated with a smaller sample size. This well-established effect, which occurs in t and F tests, and to a lesser degree in nonparametric rank tests, results from unequal contributions of pooled estimates of error variance in the computation of test statistics. It is observed in samples from normal distributions, as well as non-normal distributions of various shapes. Second, transformation of scores from skewed distributions with unequal variances to ranks produces differences in the means of the ranks assigned to the respective groups, even if the means of the initial groups are equal, and a subsequent inflation of Type I error rates and power. This effect occurs for all sample sizes, equal and unequal. For the t test, the discrepancy diminishes, and for the Wilcoxon–Mann–Whitney test, it becomes larger, as sample size increases. The Welch separate-variance t test overcomes the first effect but not the second. Because of interaction of these separate effects, the validity and power of both parametric and nonparametric tests performed on samples of any size from unknown distributions with possibly unequal variances can be distorted in unpredictable ways. 相似文献

2.

Measures of multivariate skewness and kurtosis for tests of nonnormality

H. Lütkepohl B. Theilen 《Statistical Papers》1991,32(1):179-193

Measures of multivariate skewness and kurtosis are proposed that are based on the skewness and kurtosis of individual components of standardized sample vectors. Asymptotic properties and small sample critical values of tests for nonnormality based on these measures are provided. It is demonstrated that the tests have favorable power properties. Extensions to time series data are pointed out. 相似文献

3.

Use of reference distributions when dealing with unknown regression errors

《Journal of Statistical Computation and Simulation》2012,82(10):1195-1204

A problem of estimating regression coefficients is considered when the distribution of error terms is unknown but symmetric. We propose the use of reference distributions having various kurtosis values. It is assumed that the true error distribution is one of the reference distributions, but the indicator variable for the true distribution is missing. The generalized expectation–maximization algorithm combined with a line search is developed for estimating regression coefficients. Simulation experiments are carried out to compare the performance of the proposed approach with some existing robust regression methods including least absolute deviation, Lp, Huber M regression and an approximation using normal mixtures under various error distributions. As the error distribution is far from a normal distribution, the proposed method is observed to show better performance than other methods. 相似文献

4.

Multiple Linear Regression Model Under Nonnormality

《统计学通讯:理论与方法》2013,42(10):2443-2467

Abstract

We consider multiple linear regression models under nonnormality. We derive modified maximum likelihood estimators (MMLEs) of the parameters and show that they are efficient and robust. We show that the least squares esimators are considerably less efficient. We compare the efficiencies of the MMLEs and the M estimators for symmetric distributions and show that, for plausible alternatives to an assumed distribution, the former are more efficient. We provide real-life examples. 相似文献

5.

Minimum distance lack-of-fit tests for fixed design

Hira L. Koul 《Journal of statistical planning and inference》2011,141(1):65-79

This paper discusses a class of tests of lack-of-fit of a parametric regression model when design is non-random and uniform on [0,1]. These tests are based on certain minimized distances between a nonparametric regression function estimator and the parametric model being fitted. We investigate asymptotic null distributions of the proposed tests, their consistency and asymptotic power against a large class of fixed and sequences of local nonparametric alternatives, respectively. The best fitted parameter estimate is seen to be n^1/2-consistent and asymptotically normal. A crucial result needed for proving these results is a central limit lemma for weighted degenerate U statistics where the weights are arrays of some non-random real numbers. This result is of an independent interest and an extension of a result of Hall for non-weighted degenerate U statistics. 相似文献

6.

Effects of the generalized Box–Cox transformation on Type I error rate and power of Hotelling's T 2

《Journal of Statistical Computation and Simulation》2012,82(3):199-206

Most multivariate statistical techniques rely on the assumption of multivariate normality. The effects of nonnormality on multivariate tests are assumed to be negligible when variance–covariance matrices and sample sizes are equal. Therefore, in practice, investigators usually do not attempt to assess multivariate normality. In this simulation study, the effects of skewed and leptokurtic multivariate data on the Type I error and power of Hotelling's T ² were examined by manipulating distribution, sample size, and variance–covariance matrix. The empirical Type I error rate and power of Hotelling's T ² were calculated before and after the application of generalized Box–Cox transformation. The findings demonstrated that even when variance–covariance matrices and sample sizes are equal, small to moderate changes in power still can be observed. 相似文献

7.

Approximate testing in two-stage nonlinear mixed models

J.H. Burton J. Volaufova 《Journal of Statistical Computation and Simulation》2015,85(13):2656-2665

We investigate here small sample properties of approximate F-tests about fixed effects parameters in nonlinear mixed models. For estimation of population fixed effects parameters as well as variance components, we apply the two-stage approach. This method is useful and popular when the number of observations per sampling unit is large enough. The approximate F-test is constructed based on large-sample approximation to the distribution of nonlinear least-squares estimates of subject-specific parameters. We recommend a modified test statistic that takes into consideration approximation to the large-sample Fisher information matrix (See [Volaufova J, Burton JH. Note on hypothesis testing in mixed models. Oral presentation at: LINSTAT 2012/21st IWMS; 2012; Bedlewo, Poland]). Our main focus is on comparing finite sample properties of broadly used approximate tests (Wald test and likelihood ratio test) and the modified F-test under the null hypothesis, especially accuracy of p-values (See [Volaufova J, LaMotte L. Comparison of approximate tests of fixed effects in linear repeated measures design models with covariates. Tatra Mountains. 2008;39:17–25]). For that purpose two extensive simulation studies are conducted based on pharmacokinetic models (See [Hartford A, Davidian M. Consequences of misspecifying assumptions in nonlinear mixed effects models. Comput Stat and Data Anal. 2000;34:139–164; Pinheiro J, Bates D. Approximations to the log-likelihood function in the non-linear mixed-effects model. J Comput Graph Stat. 1995;4(1):12–35]). 相似文献

8.

Small-Sample Methods for Cluster-Robust Variance Estimation and Hypothesis Testing in Fixed Effects Models

James E. Pustejovsky Elizabeth Tipton 《商业与经济统计学杂志》2013,31(4):672-683

ABSTRACT

In panel data models and other regressions with unobserved effects, fixed effects estimation is often paired with cluster-robust variance estimation (CRVE) to account for heteroscedasticity and un-modeled dependence among the errors. Although asymptotically consistent, CRVE can be biased downward when the number of clusters is small, leading to hypothesis tests with rejection rates that are too high. More accurate tests can be constructed using bias-reduced linearization (BRL), which corrects the CRVE based on a working model, in conjunction with a Satterthwaite approximation for t-tests. We propose a generalization of BRL that can be applied in models with arbitrary sets of fixed effects, where the original BRL method is undefined, and describe how to apply the method when the regression is estimated after absorbing the fixed effects. We also propose a small-sample test for multiple-parameter hypotheses, which generalizes the Satterthwaite approximation for t-tests. In simulations covering a wide range of scenarios, we find that the conventional cluster-robust Wald test can severely over-reject while the proposed small-sample test maintains Type I error close to nominal levels. The proposed methods are implemented in an R package called clubSandwich. This article has online supplementary materials. 相似文献

9.

Using principal components to test normality of high-dimensional data

Rashid Mansoor 《统计学通讯:模拟与计算》2017,46(5):3396-3405

Many multivariate statistical procedures are based on the assumption of normality and different approaches have been proposed for testing this assumption. The vast majority of these tests, however, are exclusively designed for cases when the sample size n is larger than the dimension of the variable p, and the null distributions of their test statistics are usually derived under the asymptotic case when p is fixed and n increases. In this article, a test that utilizes principal components to test for nonnormality is proposed for cases when p/n → c. The power and size of the test are examined through Monte Carlo simulations, and it is argued that the test remains well behaved and consistent against most nonnormal distributions under this type of asymptotics. 相似文献

10.

THE RELATIONSHIPS BETWEEN SKEWNESS AND KURTOSIS

H.L. MacGillivray K.P. Balanda 《Australian & New Zealand Journal of Statistics》1988,30(3):319-337

Theoretical considerations of kurtosis, whether of partial orderings of distributions with respect to kurtosis or of measures of kurtosis, have tended to focus only on symmetric distributions. With reference to historical points and recent work on skewness and kurtosis, this paper defines anti-skewness and uses it as a tool to discuss the concept of kurtosis in asymmetric univariate distributions. The discussion indicates that while kurtosis is best considered as a property of symmetrised versions of distributions, symmetrisation does not simply remove skewness. Skewness, anti-skewness and kurtosis are all inter-related aspects of shape. The Tukey g and h family and the Johnson Su family are considered as examples. 相似文献

11.

Testing for structural breaks in the presence of data perturbations: impacts and wavelet-based improvements

《Journal of Statistical Computation and Simulation》2012,82(17):3468-3479

This paper investigates how classical measurement error and additive outliers (AO) influence tests for structural change based on F-statistics. We derive theoretically the impact of general additive disturbances in the regressors on the asymptotic distribution of these tests for structural change. The small sample properties in the case of classical measurement error and AO are investigated via Monte Carlo simulations, revealing that sizes are biased upwards and that powers are reduced. Two-wavelet-based denoising methods are used to reduce these distortions. We show that these two methods can significantly improve the performance of structural break tests. 相似文献

12.

Testing for normality in linear regression models

《Journal of Statistical Computation and Simulation》2012,82(10):1101-1113

The importance of the normal distribution for fitting continuous data is well known. However, in many practical situations data distribution departs from normality. For example, the sample skewness and the sample kurtosis are far away from 0 and 3, respectively, which are nice properties of normal distributions. So, it is important to have formal tests of normality against any alternative. D'Agostino et al. [A suggestion for using powerful and informative tests of normality, Am. Statist. 44 (1990), pp. 316–321] review four procedures Z ²(g ₁), Z ²(g ₂), D and K ² for testing departure from normality. The first two of these procedures are tests of normality against departure due to skewness and kurtosis, respectively. The other two tests are omnibus tests. An alternative to the normal distribution is a class of skew-normal distributions (see [A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Statist. 12 (1985), pp. 171–178]). In this paper, we obtain a score test (W) and a likelihood ratio test (LR) of goodness of fit of the normal regression model against the skew-normal family of regression models. It turns out that the score test is based on the sample skewness and is of very simple form. The performance of these six procedures, in terms of size and power, are compared using simulations. The level properties of the three statistics LR, W and Z ²(g ₁) are similar and close to the nominal level for moderate to large sample sizes. Also, their power properties are similar for small departure from normality due to skewness (γ₁≤0.4). Of these, the score test statistic has a very simple form and computationally much simpler than the other two statistics. The LR statistic, in general, has highest power, although it is computationally much complex as it requires estimates of the parameters under the normal model as well as those under the skew-normal model. So, the score test may be used to test for normality against small departure from normality due to skewness. Otherwise, the likelihood ratio statistic LR should be used as it detects general departure from normality (due to both skewness and kurtosis) with, in general, largest power. 相似文献

13.

ON UMP INVARIANT F-TEST PROCEDURES IN A GENERAL LINEAR MODEL

《统计学通讯:理论与方法》2013,42(10):2099-2115

A simple method of setting linear hypotheses for a split mean vector testable by F-tests in a general linear model, when the covariance matrix has a general form and is completely unknown, is provided by extending the method discussed in Ukita et al. The critical functions in these F-tests are constructed as UMP invariants, when the covariance matrix has a known structure. Further critical functions in F-tests of linear hypotheses for the other split mean vector in the model are shown to be UMP invariant if the same known structure of the covariance matrix is assumed. 相似文献

14.

Asymptotic Expansion and Conditional Robustness for the Sample Multiple Correlation Coefficient Under Nonnormality

Haruhiko Ogasawara 《统计学通讯:模拟与计算》2013,42(1):177-199

ABSTRACT

Asymptotic distributions of the standardized estimators of the squared and non squared multiple correlation coefficients under nonnormality were obtained using Edgeworth expansion up to O(1/n). Conditions for the normal-theory asymptotic biases and variances to hold under nonnormality were derived with respect to the parameter values and the weighted sum of the cumulants of associated variables. The condition for the cumulants indicates a compensatory effect to yield the robust normal-theory lower-order cumulants. Simulations were performed to see the usefulness of the formulas of the asymptotic expansions using the model with the asymptotic robustness under nonnormality, which showed that the approximations by Edgeworth expansions were satisfactory. 相似文献

15.

A Parametric Bootstrap Test for Two-Way ANOVA Model Without Interaction Under Heteroscedasticity

Liwen Xu Fangqin Yang Ranran Chen 《统计学通讯:模拟与计算》2015,44(5):1264-1272

In this article we consider the two-way ANOVA model without interaction under heteroscedasticity. For the problem of testing equal effects of factors, we propose a parametric bootstrap (PB) approach and compare it with existing the generalized F (GF) test. The Type I error rates and powers of the tests are evaluated using Monte Carlo simulation. Our studies show that the PB test performs better than the GF test. The PB test performs very satisfactorily even for small samples while the GF test exhibits poor Type I error properties when the number of factorial combinations or treatments goes up. It is also noted that the same tests can be used to test the significance of random effect variance component in a two-way mixed-effects model under unequal error variances. 相似文献

16.

ESTIMATING THE MAXIMUM (MINIMUM) OF A QUADRATIC FUNCTION UNDER CERTAIN CORRELATION PATTERNS

Stephen M. Scariano 《Australian & New Zealand Journal of Statistics》1986,28(1):46-51

May nonidentity error correlation patterns encountered in regression theory allow computationally efficient techniques for modifying the usual procedures for constructing confidence intervals and performing F-tests so that valid inferences can be drawn. Such techniques are explored in light of the general theme of this note. 相似文献

17.

Parametric bootstrap tests for unbalanced nested designs under heteroscedasticity

《Journal of Statistical Computation and Simulation》2012,82(9):2059-2070

In this article, we consider the two-factor unbalanced nested design model without the assumption of equal error variance. For the problem of testing ‘main effects’ of both factors, we propose a parametric bootstrap (PB) approach and compare it with the existing generalized F (GF) test. The Type I error rates of the tests are evaluated using Monte Carlo simulation. Our studies show that the PB test performs better than the GF test. The PB test performs very satisfactorily even for small samples while the GF test exhibit poor Type I error properties when the number of factorial combinations or treatments goes up. It is also noted that the same tests can be used to test the significance of the random effect variance component in a two-factor mixed effects nested model under unequal error variances. 相似文献

18.

The split sample permutation t-tests

Shunpu Zhang 《Journal of statistical planning and inference》2009

Without the exchangeability assumption, permutation tests for comparing two population means do not provide exact control of the probability of making a Type I error. Another drawback of permutation tests is that it cannot be used to test hypothesis about one population. In this paper, we propose a new type of permutation tests for testing the difference between two population means: the split sample permutation t-tests. We show that the split sample permutation t-tests do not require the exchangeability assumption, are asymptotically exact and can be easily extended to testing hypothesis about one population. Extensive simulations were carried out to evaluate the performance of two specific split sample permutation t-tests: the split in the middle permutation t-test and the split in the end permutation t-test. The simulation results show that the split in the middle permutation t-test has comparable performance to the permutation test if the population distributions are symmetric and satisfy the exchangeability assumption. Otherwise, the split in the end permutation t-test has significantly more accurate control of level of significance than the split in the middle permutation t-test and other existing permutation tests. 相似文献

19.

Uniform robustness against nonnormality of the t and f tests

Hanfeng Chen Wei-Yin Loh 《统计学通讯:理论与方法》2013,42(10):3707-3723

The size of the two-sample t test is generally thought to be robust against nonnormal distributions if the sample sizes are large. This belief is based on central limit theory, and asymptotic expansions of the moments of the t statistic suggest that robustness may be improved for moderate sample sizes if the variance, skewness, and kurtosis of the distributions are matched, particularly if the sample sizes are also equal.

It is shown that asymptotic arguments such as these can be misleading and that, in fact, the size of the t test can be as large as unity if the distributions are allowed to be completely arbitrary. Restricting the distributions to be identical or symmetric (but otherwise arbitrary) does not guarantee that the size can be controlled either, but controlling the tail-heaviness of the distributions does. The last result is proved more generally for the k-sample F test. 相似文献

20.

Tests of regression coefficients under ridge regression models

《Journal of Statistical Computation and Simulation》2012,82(1-4):341-356

This paper investigates two “non-exact” t-type tests, t( k₂) and t(k₂), of the individual coefficients of a linear regression model, based on two ordinary ridge estimators. The reported results are built on a simulation study covering 84 different models. For models with large standard errors, the ridge-based t-tests have correct levels with considerable gain in powers over those of the least squares t-test, t(0). For models with small standard errors, t(k₁) is found to be liberal and is not safe to use while, t(k₂) is found to slightly exceed the nominal level in few cases. When tie two ridge tests art: not winners, the results indicate that they don't loose much against t(0). 相似文献