首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
We consider the test based on theL 1-version of the Cramér-von Mises statistic for the nonparametric two-sample problem. Some quantiles of the exact distribution under H0 of the test statistic are computed for small sample sizes. We compare the test in terms of power against general alternatives to other two-sample tests, namely the Wilcoxon rank sum test, the Smirnov test and the Cramér-von Mises test in the case of unbalanced small sample sizes. The computation of the power is rather complicated when the sample sizes are unequal. Using Monte Carlo power estimates it turns out that the Smirnov test is more sensitive to non stochastically ordered alternatives than the new test. And under location-contamination alternatives the power estimates of the new test and of the competing tests are equal.  相似文献   

2.
This paper presents an investigation of the behavior of the levels of significance of the two-sample t and its related tests and the Mann-Whitney test when the samples are randomly drawn from mixtures of two normal populations (compound normals) and when the sample sizes are small (combined sample sizes ? 15). The use of the compound normal allows for investigation when the underlying populations are unequal, nonnormal, heterogeneous in variances, unimodal or bimodal, possessing smaller than normal kurtosis or containing contamination. The exact distribution of the t and its related tests are given. However, they are not readily amenable to calculations. Most of the numerical results presented were obtained by simulations  相似文献   

3.
C.L. Kaul  Kanwar Sen 《Statistics》2013,47(4):527-553
Former results on BAHADUR efficiency of signed rank tests are carried over to the class of two-sample rank tests. It is shown that the two-sample rank tests are asymptotically optimal at alternatives far away from the hypothesis under fairly general conditions. Surprisingly, the median test appears to be optimal only in case of equal sample sizes.  相似文献   

4.
In this article, we describe a new approach to compare the power of different tests for normality. This approach provides the researcher with a practical tool for evaluating which test at their disposal is the most appropriate for their sampling problem. Using the Johnson systems of distribution, we estimate the power of a test for normality for any mean, variance, skewness, and kurtosis. Using this characterization and an innovative graphical representation, we validate our method by comparing three well-known tests for normality: the Pearson χ2 test, the Kolmogorov–Smirnov test, and the D'Agostino–Pearson K 2 test. We obtain such comparison for a broad range of skewness, kurtosis, and sample sizes. We demonstrate that the D'Agostino–Pearson test gives greater power than the others against most of the alternative distributions and at most sample sizes. We also find that the Pearson χ2 test gives greater power than Kolmogorov–Smirnov against most of the alternative distributions for sample sizes between 18 and 330.  相似文献   

5.
Conditional Studentized Survival Tests for Randomly Censored Models   总被引:1,自引:0,他引:1  
It is shown that in the case of heterogenous censoring distributions Studentized survival tests can be carried out as conditional permutation tests given the order statistics and their censoring status. The result is based on a conditional central limit theorem for permutation statistics. It holds for linear test statistics as well as for sup-statistics. The procedure works under one of the following general circumstances for the two-sample problem: the unbalanced sample size case, highly censored data, certain non-convergent weight functions or under alternatives. For instance, the two-sample log rank test can be carried out asymptotically as a conditional test if the relative amount of uncensored observations vanishes asymptotically as long as the number of uncensored observations becomes infinite. Similar results hold whenever the sample sizes and are unbalanced in the sense that and hold.  相似文献   

6.
In ranked-set sampling (RSS), a stratification by ranks is used to obtain a sample that tends to be more informative than a simple random sample of the same size. Previous work has shown that if the rankings are perfect, then one can use RSS to obtain Kolmogorov–Smirnov type confidence bands for the CDF that are narrower than those obtained under simple random sampling. Here we develop Kolmogorov–Smirnov type confidence bands that work well whether the rankings are perfect or not. These confidence bands are obtained by using a smoothed bootstrap procedure that takes advantage of special features of RSS. We show through a simulation study that the coverage probabilities are close to nominal even for samples with just two or three observations. A new algorithm allows us to avoid the bootstrap simulation step when sample sizes are relatively small.  相似文献   

7.
This paper presents methods for checking the goodness-of-fit of the additive risk model with p(> 2)-dimensional time-invariant covariates. The procedures are an extension of Kim and Lee (1996) who developed a test to assess the additive risk assumption for two-sample censored data. We apply the proposed tests to survival data from South Wales nikel refinery workers. Simulation studies are carried out to investigate the performance of the proposed tests for practical sample sizes.  相似文献   

8.
The maximum vertical distance between a receiver operating characteristic (ROC) curve and its chance diagonal is a common measure of effectiveness of the classifier that gives rise to this curve. This measure is known to be equivalent to a two-sample Kolmogorov–Smirnov statistic; so the absolute difference D between two such statistics is often used informally as a measure of difference between the corresponding classifiers. A significance test of D is of great practical interest, but the available Kolmogorov–Smirnov distribution theory precludes easy analytical construction of such a significance test. We, therefore, propose a Monte Carlo procedure for conducting the test, using the binormal model for the underlying ROC curves. We provide Splus/R routines for the computation, tabulate the results for a number of illustrative cases, apply the methods to some practical examples and discuss some implications.  相似文献   

9.
We examine the sizes and powers of three tests of convergence of Markov Chain Monte Carlo draws: the Kolmogorov–Smirnov test, fluctuation test, and Geweke's test. We show that the sizes and powers are sensitive to the existence of autocorrelation in the draws. We propose a filtered test that is corrected for autocorrelation. We present a numerical illustration using the Federal funds rate.  相似文献   

10.
The two-sample, distribution-free statistics of Smirnov (1939) are used to define a new statistic. While the Smirnov statistics are used as a general goodness-of-fit test, a distribution-free scale test based on this new statistic is developed. It is shown that this new test has higher power than the two-sided Smirnov statistic in detecting differences in scale for some symmetric distributions with equal means/medians. The critical values of the proposed test statistic and its limiting distribution are given  相似文献   

11.
For the non-parametric two-sample location problem, adaptive tests based on a selector statistic are compared with a maximum and a sum test, respectively. When the class of all continuous distributions is not restricted, the sum test is not a robust test, i.e. it does not have a relatively high power across the different possible distributions. However, according to our simulation results, the adaptive tests as well as the maximum test are robust. For a small sample size, the maximum test is preferable, whereas for a large sample size the comparison between the adaptive tests and the maximum test does not show a clear winner. Consequently, one may argue in favour of the maximum test since it is a useful test for all sample sizes. Furthermore, it does not need a selector and the specification of which test is to be performed for which values of the selector. When the family of possible distributions is restricted, the maximin efficiency robust test may be a further robust alternative. However, for the family of t distributions this test is not as powerful as the corresponding maximum test.  相似文献   

12.

We address the testing problem of proportional hazards in the two-sample survival setting allowing right censoring, i.e., we check whether the famous Cox model is underlying. Although there are many test proposals for this problem, only a few papers suggest how to improve the performance for small sample sizes. In this paper, we do exactly this by carrying out our test as a permutation as well as a wild bootstrap test. The asymptotic properties of our test, namely asymptotic exactness under the null and consistency, can be transferred to both resampling versions. Various simulations for small sample sizes reveal an actual improvement of the empirical size and a reasonable power performance when using the resampling versions. Moreover, the resampling tests perform better than the existing tests of Gill and Schumacher and Grambsch and Therneau . The tests’ practical applicability is illustrated by discussing real data examples.

  相似文献   

13.
Random samples are assumed for the univariate two-sample problem. Sometimes this assumption may be violated in that an observation in one “sample”, of size m, is from a population different from that yielding the remaining m—1 observations (which are a random sample). Then, the interest is in whether this random sample of size m—1 is from the same population as the other random sample. If such a violation occurs and can be recognized, and also the non-conforming observation can be identified (without imposing conditional effects), then that observation could be removed and a two-sample test applied to the remaining samples. Unfortunately, satisfactory procedures for such a removal do not seem to exist. An alternative approach is to use two-sample tests whose significance levels remain the same when a non-conforming observation occurs, and is removed, as for the case where the samples were both truly random. The equal-tail median test is shown to have this property when the two “samples” are of the same size (and ties do not occur).  相似文献   

14.
The size of the two-sample t test is generally thought to be robust against nonnormal distributions if the sample sizes are large. This belief is based on central limit theory, and asymptotic expansions of the moments of the t statistic suggest that robustness may be improved for moderate sample sizes if the variance, skewness, and kurtosis of the distributions are matched, particularly if the sample sizes are also equal.

It is shown that asymptotic arguments such as these can be misleading and that, in fact, the size of the t test can be as large as unity if the distributions are allowed to be completely arbitrary. Restricting the distributions to be identical or symmetric (but otherwise arbitrary) does not guarantee that the size can be controlled either, but controlling the tail-heaviness of the distributions does. The last result is proved more generally for the k-sample F test.  相似文献   

15.
We develop both nonparametric and parametric methods for obtaining prediction bands for the empirical distribution function (EDF) of a future sample. These methods yield simultaneous prediction intervals for all order statistics of the future sample, and they also correspond to tests for the two-sample problem. The nonparametric prediction bands correspond to the two-sample Kolmogorov-Smirnov test and related nonparametric tests, but the parametric prediction bands correspond to entirely new parametric two-sample tests. The parametric prediction bands tend to outperform the nonparametric bands when the parametric assumptions hold, but they may have true coverage probabilities well below their nominal levels when the parametric assumptions fail. A new computational algorithm is used to obtain critical values in the nonparametric case.  相似文献   

16.
When testing hypotheses in two-sample problem, the Lepage test statistic is often used to jointly test the location and scale parameters, and this test statistic has been discussed by many authors over the years. Since two-sample nonparametric testing plays an important role in biometry, the Cucconi test statistic is generalized to the location, scale, and location–scale parameters in two-sample problem. The limiting distribution of the suggested test statistic is derived under the hypotheses. Deriving the exact critical value of the test statistic is difficult when the sample sizes are increased. A gamma approximation is used to evaluate the upper tail probability for the proposed test statistic given finite sample sizes. The asymptotic efficiencies of the proposed test statistic are determined for various distributions. The consistency of the original Cucconi test statistic is shown on the specific cases. Finally, the original Cucconi statistic is discussed in the theory of ties.  相似文献   

17.
We develop an omnibus two-sample test for ranked-set sampling (RSS) data. The test statistic is the conditional probability of seeing the observed sequence of ranks in the combined sample, given the observed sequences within the separate samples. We compare the test to existing tests under perfect rankings, finding that it can outperform existing tests in terms of power, particularly when the set size is large. The test does not maintain its level under imperfect rankings. However, one can create a permutation version of the test that is comparable in power to the basic test under perfect rankings and also maintains its level under imperfect rankings. Both tests extend naturally to judgment post-stratification, unbalanced RSS, and even RSS with multiple set sizes. Interestingly, the tests have no simple random sampling analog.  相似文献   

18.
When observational data are used to compare treatment-specific survivals, regular two-sample tests, such as the log-rank test, need to be adjusted for the imbalance between treatments with respect to baseline covariate distributions. Besides, the standard assumption that survival time and censoring time are conditionally independent given the treatment, required for the regular two-sample tests, may not be realistic in observational studies. Moreover, treatment-specific hazards are often non-proportional, resulting in small power for the log-rank test. In this paper, we propose a set of adjusted weighted log-rank tests and their supremum versions by inverse probability of treatment and censoring weighting to compare treatment-specific survivals based on data from observational studies. These tests are proven to be asymptotically correct. Simulation studies show that with realistic sample sizes and censoring rates, the proposed tests have the desired Type I error probabilities and are more powerful than the adjusted log-rank test when the treatment-specific hazards differ in non-proportional ways. A real data example illustrates the practical utility of the new methods.  相似文献   

19.
Recently, a new non-randomized parallel design is proposed by Tian (2013) for surveys with sensitive topics. However, the sample size formulae associated with testing hypotheses for the parallel model are not yet available. As a crucial component in surveys, the sample size formulae with the parallel design are developed in this paper by using the power analysis method for both the one- and two-sample problems. We consider both the one- and two-sample problems. The asymptotic power functions and the corresponding sample size formulae for both the one- and two-sided tests based on the large-sample normal approximation are derived. The performance is assessed through comparing the asymptotic power with the exact power and reporting the ratio of the sample sizes with the parallel model and the design of direct questioning. We numerically compare the sample sizes needed for the parallel design with those required for the crosswise and triangular models. Two theoretical justifications are also provided. An example from a survey on ‘sexual practices’ in San Francisco, Las Vegas and Portland is used to illustrate the proposed methods.  相似文献   

20.
The present study investigates the performance of Johnson's transformation trimmed t statistic, Welch's t test, Yuen's trimmed t , Johnson's transformation untrimmed t test, and the corresponding bootstrap methods for the two-sample case with small/unequal sample sizes when the distribution is non-normal and variances are heterogeneous. The Monte Carlo simulation is conducted in two-sided as well as one-sided tests. When the variance is proportional to the sample size, Yuen's trimmed t is as good as Johnson's transformation trimmed t . However, when the variance is disproportional to the sample size, the bootstrap Yuen's trimmed t and the bootstrap Johnson's transformation trimmed t are recommended in one-sided tests. For two-sided tests, Johnson's transformation trimmed t is not only valid but also powerful in comparison to the bootstrap methods.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号