首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 562 毫秒
1.
A method is proposed for calculating the small sample powers of rank tests which are based on the method of n rankings. A class of normal shift alternative hypotheses is considered, and Hodges–Lehmann efficiencies are calculated for the Friedman test.  相似文献   

2.
Testing for ordered alternatives in randomized block designs has been a problem of interest for almost three decades (Jonckheere (1954)). Three classes of rank tests have evolved—tests based on “within-blocks” rankings (W-tests), tests based on “ranking after alignment” within blocks (RAA-tests), and tests based on “among-blocks” rankings (A-Tests). This paper focuses on the latter. A simplified version of the Skillings-Wolfe generalized Purl test (1977) is suggested and two very useful A-tests—a generalized Johnson-Mehrotra “Optimal contrast” procedure and a generalized Tryon-Hettmansperger rank test—are developed. These procedures are compared and contrasted with other recent competitors presented by Skllllngs and Wolfe (1978) and by Salama and Quade (1981).  相似文献   

3.
In this paper, we revisit the problem of testing of the hypothesis of circular symmetry of a bivariate distribution. We propose some nonparametric tests based on sector counts. These include tests based on chi-square goodness-of-fit test, the classical likelihood ratio, mean deviation, and the range. The proposed tests are easy to implement and the exact null distributions for small sample sizes of the test statistics are obtained. Two examples with small and large data sets are given to illustrate the application of the tests proposed. For small and moderate sample sizes, the performances of the proposed tests are evaluated using empirical powers (empirical sizes are also reported). Also, we evaluate the performance of these count-based tests with adaptations of several well-known tests such as the Kolmogorov–Smirnov-type tests, tests based on kernel density estimator, and the Wilcoxon-type tests. It is observed that among the count-based tests the likelihood ratio test performs better.  相似文献   

4.
A class of tests is proposed for testing Exponentiality against the Decreasing Mean Residual Life (DMRL) class of non-exponential probability distributions. These tests are consistent and asymptotically unbiased against all continuous DMRL alternatives. They are U - statistics and hence asymptotically normally distributed. The asymptotic relative efficiency (ARE) with respect to other tests for DMRL are quite high. Small sample powers are also comparable with small sample powers of the competitors.  相似文献   

5.
We revisit the well-known Behrens–Fisher problem and apply a newly developed ‘Computational Approach Test’ (CAT) to test the equality of two population means where the populations are assumed to be normal with unknown and possibly unequal variances. An advantage of the CAT is that it does not require the explicit knowledge of the sampling distribution of the test statistic. The CAT is then compared with three widely accepted tests—Welch–Satterthwaite test (WST), Cochran–Cox test (CCT), ‘Generalized p-value’ test (GPT)—and a recently suggested test based on the jackknife procedure, called Singh–Saxena–Srivastava test (SSST). Further, model robustness of these five tests are studied when the data actually came from t-distributions, but wrongly perceived as normal ones. Our detailed study based on a comprehensive simulation indicate some interesting results including the facts that the GPT is quite conservative, and the SSST is not as good as it has been claimed in the literature. To the best of our knowledge, the trends observed in our study have not been reported earlier in the existing literature.  相似文献   

6.
Cui  Ruifei  Groot  Perry  Heskes  Tom 《Statistics and Computing》2019,29(2):311-333

We consider the problem of causal structure learning from data with missing values, assumed to be drawn from a Gaussian copula model. First, we extend the ‘Rank PC’ algorithm, designed for Gaussian copula models with purely continuous data (so-called nonparanormal models), to incomplete data by applying rank correlation to pairwise complete observations and replacing the sample size with an effective sample size in the conditional independence tests to account for the information loss from missing values. When the data are missing completely at random (MCAR), we provide an error bound on the accuracy of ‘Rank PC’ and show its high-dimensional consistency. However, when the data are missing at random (MAR), ‘Rank PC’ fails dramatically. Therefore, we propose a Gibbs sampling procedure to draw correlation matrix samples from mixed data that still works correctly under MAR. These samples are translated into an average correlation matrix and an effective sample size, resulting in the ‘Copula PC’ algorithm for incomplete data. Simulation study shows that: (1) ‘Copula PC’ estimates a more accurate correlation matrix and causal structure than ‘Rank PC’ under MCAR and, even more so, under MAR and (2) the usage of the effective sample size significantly improves the performance of ‘Rank PC’ and ‘Copula PC.’ We illustrate our methods on two real-world datasets: riboflavin production data and chronic fatigue syndrome data.

  相似文献   

7.
Likelihood ratio tests are considered for two testing situations; testing for the homogeneity of k normal means against the alternative restricted by a simple tree ordering trend and testing the null hypothesis that the means satisfy the trend against all alternatives. Exact expressions are given for the power functions for k = 3 and 4 and unequal sample sizes, both for the case of known and unknown population variances, and approximations are discussed for larger k. Also, Bartholomew’s conjectures concerning minimal and maximal powers are investigated for the case of equal and unequal sample sizes. The power formulas are used to compute powers for a numerical example.  相似文献   

8.
Likelihood ratios (LRs) are used to characterize the efficiency of diagnostic tests. In this paper, we use the classical weighted least squares (CWLS) test procedure, which was originally used for testing the homogeneity of relative risks, for comparing the LRs of two or more binary diagnostic tests. We compare the performance of this method with the relative diagnostic likelihood ratio (rDLR) method and the diagnostic likelihood ratio regression (DLRReg) approach in terms of size and power, and we observe that the performances of CWLS and rDLR are the same when used to compare two diagnostic tests, while DLRReg method has higher type I error rates and powers. We also examine the performances of the CWLS and DLRReg methods for comparing three diagnostic tests in various sample size and prevalence combinations. On the basis of Monte Carlo simulations, we conclude that all of the tests are generally conservative and have low power, especially in settings of small sample size and low prevalence.  相似文献   

9.
Quade (1972, 1979) proposed a family of nonparametric tests based on weighted within-block rankings, for testing the hypothesis of no treatment effects in a complete randomized blocks layout. In this paper we give a table of the exact null distribution of these tests when the number of treatments is 3, the number of blocks is less than or equal to 14 and the block scores are linear. Moreover, a Monte Carlo study was performed to compare the powers of these tests with parametric and nonparametric competitors  相似文献   

10.
Eight goodness of fit tests are compared with respect to their simulated small sample power of detecting an inbreeding alternative to the Hardy-Weinberg null hypothesis. The Pearson's x 2 test is found to be most powerful, and the small rample levels of this test are close to the nominal (x 2) significance levels. The use of conditional expectations, rather than expected frequencies based on ML estimates, increases the power and improves thc x 2 fit to the true significance level. The small sample powers are also compared to the asymptotic (Pitman) pourer, based on the noncenlral x 2 distribution.  相似文献   

11.
A rank test based on the number of ‘near-matches’ among within-block rankings is proposed for stochastically ordered alternatives in a randomized block design with t treatments and b blocks. The asymptotic relative efficiency of this test with respect to the Page test is computed as number of blocks increases to infinity. A sequential analog of the above test procedure is also considered. A repeated significance test procedure is developed and average sample number is computed asymptotically under the null hypothesis as well as under a sequence of contiguous alternatives.  相似文献   

12.
For location–scale families, we consider a random distance between the sample order statistics and the quasi sample order statistics derived from the null distribution as a measure of discrepancy. The conditional qth quantile and expectation of the random discrepancy on the given sample are chosen as test statistics. Simulation results of powers against various alternatives are illustrated under the normal and exponential hypotheses for moderate sample size. The proposed tests, especially the qth quantile tests with a small or large q, are shown to be more powerful than other prominent goodness-of-fit tests in most cases.  相似文献   

13.
In statistical literature, the term ‘signed‐rank test’ (or ‘Wilcoxon signed‐rank test’) has been used to refer to two distinct tests: a test for symmetry of distribution and a test for the median of a symmetric distribution, sharing a common test statistic. To avoid potential ambiguity, we propose to refer to those two tests by different names, as ‘test for symmetry based on signed‐rank statistic’ and ‘test for median based on signed‐rank statistic’, respectively. The utility of such terminological differentiation should become evident through our discussion of how those tests connect and contrast with sign test and one‐sample t‐test. Published 2014. This article is a U.S. Government work and is in the public domain in the USA.  相似文献   

14.
We evaluated the properties of six statistical methods for testing equality among populations with zero-inflated continuous distributions. These tests are based on likelihood ratio (LR), Wald, central limit theorem (CLT), modified CLT (MCLT), parametric jackknife (PJ), and nonparametric jackknife (NPJ) statistics. We investigated their statistical properties using simulated data from mixed distributions with an unknown portion of non zero observations that have an underlying gamma, exponential, or log-normal density function and the remaining portion that are excessive zeros. The 6 statistical tests are compared in terms of their empirical Type I errors and powers estimated through 10,000 repeated simulated samples for carefully selected configurations of parameters. The LR, Wald, and PJ tests are preferred tests since their empirical Type I errors were close to the preset nominal 0.05 level and each demonstrated good power for rejecting null hypotheses when the sample sizes are at least 125 in each group. The NPJ test had unacceptable empirical Type I errors because it rejected far too often while the CLT and MCLT tests had low testing powers in some cases. Therefore, these three tests are not recommended for general use but the LR, Wald, and PJ tests all performed well in large sample applications.  相似文献   

15.
For the Poisson a posterior distribution for the complete sample size, N, is derived from an incomplete sample when any specified subset of the classes are missing.Means as well as other posterior characteristics of N are obtained for two examples with various classes removed. For the special case of a truncated ‘missing zero class’ Poisson sample a simulation experiment is performed for the small ‘N=25’ sample situation applying both Bayesian and maximum likelihood methods of estimation.  相似文献   

16.
The two-way two-levels crossed factorial design is a commonly used design by practitioners at the exploratory phase of industrial experiments. The F-test in the usual linear model for analysis of variance (ANOVA) is a key instrument to assess the impact of each factor and of their interactions on the response variable. However, if assumptions such as normal distribution and homoscedasticity of errors are violated, the conventional wisdom is to resort to nonparametric tests. Nonparametric methods, rank-based as well as permutation, have been a subject of recent investigations to make them effective in testing the hypotheses of interest and to improve their performance in small sample situations. In this study, we assess the performances of some nonparametric methods and, more importantly, we compare their powers. Specifically, we examine three permutation methods (Constrained Synchronized Permutations, Unconstrained Synchronized Permutations and Wald-Type Permutation Test), a rank-based method (Aligned Rank Transform) and a parametric method (ANOVA-Type Test). In the simulations, we generate datasets with different configurations of distribution of errors, variance, factor's effect and number of replicates. The objective is to elicit practical advice and guides to practitioners regarding the sensitivity of the tests in the various configurations, the conditions under which some tests cannot be used, the tradeoff between power and type I error, and the bias of the power on one main factor analysis due to the presence of effect of the other factor. A dataset from an industrial engineering experiment for thermoformed packaging production is used to illustrate the application of the various methods of analysis, taking into account the power of the test suggested by the objective of the experiment.  相似文献   

17.
We develop an omnibus two-sample test for ranked-set sampling (RSS) data. The test statistic is the conditional probability of seeing the observed sequence of ranks in the combined sample, given the observed sequences within the separate samples. We compare the test to existing tests under perfect rankings, finding that it can outperform existing tests in terms of power, particularly when the set size is large. The test does not maintain its level under imperfect rankings. However, one can create a permutation version of the test that is comparable in power to the basic test under perfect rankings and also maintains its level under imperfect rankings. Both tests extend naturally to judgment post-stratification, unbalanced RSS, and even RSS with multiple set sizes. Interestingly, the tests have no simple random sampling analog.  相似文献   

18.
Monte Carlo simulations are performed for a broad range of conditions. These simulations indicate that the powers of alternative tests under the generalized MANOVA model for small samples differ significantly, if a large reduction of the number of polynomial parameters is applied. The results show that, if the response covariance matrix ∑ is known, the best alternative is to use ∑. If, however, ∑ is unknown, substitution of an identity matrix for ∑ is recommended. This alternative usually results in a test with more power than the test with the usual estimate of ∑ employing covariates or the test with an estimate of E obtained from another sample.  相似文献   

19.
Two analysis of means type randomization tests for testing the equality of I variances for unbalanced designs are presented. Randomization techniques for testing statistical hypotheses can be used when parametric tests are inappropriate. Suppose that I independent samples have been collected. Randomization tests are based on shuffles or rearrangements of the (combined) sample. Putting each of the I samples ‘in a bowl’ forms the combined sample. Drawing samples ‘from the bowl’ forms a shuffle. Shuffles can be made with replacement (bootstrap shuffling) or without replacement (permutation shuffling). The tests that are presented offer two advantages. They are robust to non-normality and they allow the user to graphically present the results via a decision chart similar to a Shewhart control chart. A Monte Carlo study is used to verify that the permutation version of the tests exhibit excellent power when compared to other robust tests. The Monte Carlo study also identifies circumstances under which the popular Levene's test fails.  相似文献   

20.
This paper proposes a weighted sum of powers of variances test for detecting changes in variance of a data sequence. Asymptotic critical value formulas are derived for this test. The modified weighted sum of powers of variances test is also introduced so that the accuracy of change-point detection is highly improved for a sample of small size. Simulation studies and real data analysis are presented to assess the proposed tests.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号