首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
The present study investigates the performance of Johnson's transformation trimmed t statistic, Welch's t test, Yuen's trimmed t , Johnson's transformation untrimmed t test, and the corresponding bootstrap methods for the two-sample case with small/unequal sample sizes when the distribution is non-normal and variances are heterogeneous. The Monte Carlo simulation is conducted in two-sided as well as one-sided tests. When the variance is proportional to the sample size, Yuen's trimmed t is as good as Johnson's transformation trimmed t . However, when the variance is disproportional to the sample size, the bootstrap Yuen's trimmed t and the bootstrap Johnson's transformation trimmed t are recommended in one-sided tests. For two-sided tests, Johnson's transformation trimmed t is not only valid but also powerful in comparison to the bootstrap methods.  相似文献   

2.
In this paper, a new design-oriented two-stage two-sided simultaneous confidence intervals, for comparing several exponential populations with control population in terms of location parameters under heteroscedasticity, are proposed. If there is a prior information that the location parameter of k exponential populations are not less than the location parameter of control population, one-sided simultaneous confidence intervals provide more inferential sensitivity than two-sided simultaneous confidence intervals. But the two-sided simultaneous confidence intervals have advantages over the one-sided simultaneous confidence intervals as they provide both lower and upper bounds for the parameters of interest. The proposed design-oriented two-stage two-sided simultaneous confidence intervals provide the benefits of both the two-stage one-sided and two-sided simultaneous confidence intervals. When the additional sample at the second stage may not be available due to the experimental budget shortage or other factors in an experiment, one-stage two-sided confidence intervals are proposed, which combine the advantages of one-stage one-sided and two-sided simultaneous confidence intervals. The critical constants are obtained using the techniques given in Lam [9,10]. These critical constant are compared with the critical constants obtained by Bonferroni inequality techniques and found that critical constant obtained by Lam [9,10] are less conservative than critical constants computed from the Bonferroni inequality technique. Implementation of the proposed simultaneous confidence intervals is demonstrated by a numerical example.  相似文献   

3.
Editor's Report     
There are two common methods for statistical inference on 2 × 2 contingency tables. One is the widely taught Pearson chi-square test, which uses the well-known χ2statistic. The chi-square test is appropriate for large sample inference, and it is equivalent to the Z-test that uses the difference between the two sample proportions for the 2 × 2 case. Another method is Fisher’s exact test, which evaluates the likelihood of each table with the same marginal totals. This article mathematically justifies that these two methods for determining extreme do not completely agree with each other. Our analysis obtains one-sided and two-sided conditions under which a disagreement in determining extreme between the two tests could occur. We also address the question whether or not their discrepancy in determining extreme would make them draw different conclusions when testing homogeneity or independence. Our examination of the two tests casts light on which test should be trusted when the two tests draw different conclusions.  相似文献   

4.
This paper investigates methodologies for evaluating the probabilistic value (P-value) of the Kolmogorov–Smirnov (K–S) goodness-of-fit test using algorithmic program development implemented in Microsoft® Visual Basic® (VB). Six methods were examined for the one-sided one-sample and two methods for the two-sided one-sample cumulative sampling distributions in the investigative software implementation that was based on machine-precision arithmetic. For sample sizes n≤2000 considered, results from the Smirnov iterative method found optimal accuracy for K–S P-values≥0.02, while those from the SmirnovD were more accurate for lower P-values for the one-sided one-sample distribution statistics. Also, the Durbin matrix method sustained better P-value results than the Durbin recursion method for the two-sided one-sample tests up to n≤700 sample sizes. Based on these results, an algorithm for Microsoft Excel® function was proposed from which a model function was developed and its implementation was used to test the performance of engineering students in a general engineering course across seven departments.  相似文献   

5.
Summary.  We propose 'Dunnett-type' test procedures to test for simple tree order restrictions on the means of p independent normal populations. The new tests are based on the estimation procedures that were introduced by Hwang and Peddada and later by Dunbar, Conaway and Peddada. The procedures proposed are also extended to test for 'two-sided' simple tree order restrictions. For non-normal data, nonparametric versions based on ranked data are also suggested. Using computer simulations, we compare the proposed test procedures with some existing test procedures in terms of size and power. Our simulation study suggests that the procedures compete well with the existing procedures for both one-sided and two-sided simple tree alternatives. In some instances, especially in the case of two-sided alternatives or for non-normally distributed data, the gains in power due to the procedures proposed can be substantial.  相似文献   

6.
Some simple test procedures are considered for comparing several group means with a standard value when the data are in a one-way layout. The underlying distributions are assumed to be normal with possibly unequal variances. The tests are based on a union-intersection formulation and can be applied in a form similar to a Shewhart control chart. Both two-sided and one-sided alternatives are considered. The power of the tests can be obtained from tables of a non-central t distribution. Implementation of the tests is illustrated with a numerical example. The tests help identify any group means different from the standard and might lead to a decision about rejecting the null hypothesis before all the group means are observed. The resulting savings in time and resources might be valuable in applications where the number of groups is large and the cost of acquiring data is high. For situations where the normality assumption is untenable, a non-parametric procedure, based on one-sample sign tests is considered.  相似文献   

7.
Bayesian alternatives to classical tests for several testing problems are considered. One-sided and two-sided sets of hypotheses are tested concerning an exponential parameter, a Binomial proportion, and a normal mean. Hierarchical Bayes and noninformative Bayes procedures are compared with the appropriate classical procedure, either the uniformly most powerful test or the likelihood ratio test, in the different situations. The hierarchical prior employed is the conjugate prior at the first stage with the mean being the test parameter and a noninformative prior at the second stage for the hyper parameter(s) of the first stage prior. Fair comparisons are attempted in which fair means the likelihood of making a type I error is approximately the same for the different testing procedures; once this condition is satisfied, the power of the different tests are compared, the larger the power, the better the test. This comparison is difficult in the two-sided case due to the unsurprising discrepancy between Bayesian and classical measures of evidence that have been discussed for years. The hierarchical Bayes tests appear to compete well with the typical classical test in the one-sided cases.  相似文献   

8.
ABSTRACT

A statistical test can be seen as a procedure to produce a decision based on observed data, where some decisions consist of rejecting a hypothesis (yielding a significant result) and some do not, and where one controls the probability to make a wrong rejection at some prespecified significance level. Whereas traditional hypothesis testing involves only two possible decisions (to reject or not a null hypothesis), Kaiser’s directional two-sided test as well as the more recently introduced testing procedure of Jones and Tukey, each equivalent to running two one-sided tests, involve three possible decisions to infer the value of a unidimensional parameter. The latter procedure assumes that a point null hypothesis is impossible (e.g., that two treatments cannot have exactly the same effect), allowing a gain of statistical power. There are, however, situations where a point hypothesis is indeed plausible, for example, when considering hypotheses derived from Einstein’s theories. In this article, we introduce a five-decision rule testing procedure, equivalent to running a traditional two-sided test in addition to two one-sided tests, which combines the advantages of the testing procedures of Kaiser (no assumption on a point hypothesis being impossible) and Jones and Tukey (higher power), allowing for a nonnegligible (typically 20%) reduction of the sample size needed to reach a given statistical power to get a significant result, compared to the traditional approach.  相似文献   

9.
In practice the [ILM0002]-chart has been augmented with one or more zone tests to improve the sensitivity of detecting small shifts in the process mean. The average run length (ARL) is one of the indices used to evaluate the performance of control chart procedures. Two unified patterns of transition probability matrices and four closed form expressions of the ARL are found based on the Markov Chain approach. These closed form expressions can be used for computing the ARL for both one-sided and two-sided [ILM0003]-charts with zone tests.  相似文献   

10.
The present paper relates to Holm's normal probability paper test for simple hypotheses. It is shown how to obtain the statistics for the one-sided and two-sided tests using normal probability paper. Tables of critical values of these test statistics are given for practical use of the test; the equations for calculating the distribution functions of the test statistics are listed. Two different methods lead to these equations; but details are beyond the scope of the present paper and the interested reader is referred to another paper.  相似文献   

11.
Several authors have taken the worst case breakdown measures in analyzing the robustness of a test. In general, these kinds of measures give only a rough picture of breakdown robustness of a test. To overcome this limitation, a new kind of breakdown measure of a test is defined as the smallest proportion of arbitrary outliers in the sample that can distort the test decision. It is called as the sample breakdown point of a test in this paper. A distinct advantage of this new measure is that it is directly concerned with the test decision based on the present sample and with the critical region of the test. The sample breakdown points of several commonly used tests of one-sided or two-sided hypotheses are calculated and their asymptotic properties are also established. By Monte Carlo simulations and asymptotic analysis, we show that the acceptance breakdown of the t-test and the Hotelling T2-test is slightly better than that of the sample mean test. Finally, we prove that, for a one-sided hypothesis testing of location, the sign test has the maximum sample breakdown points asymptotically within a class of M-tests and score-tests.  相似文献   

12.
We present a method to generalise the scope of application of group sequential tests designed for equally sized groups of normal observations with known variance. Preserving the significance levels against which standardised statistics are compared leads to tests for unequally grouped data which maintain Type I error probabilities to a high degree of accuracy. The same approach can be followed when observations have unknown variance by setting critical values for Studentised statistics at percentiles of the appropriate t-distributions. This significance level approach is equally applicable to group sequential one-sided tests and two-sided tests, possibly with early stopping permitted to accept the null hypothesis. In applications to equivalence testing, tests are required to maintain a specified power, rather than Type I error rate: such tests can be constructed by defining the standardised test statistics used in the significance level approach with respect to appropriately chosen hypotheses.  相似文献   

13.
Fisher's exact test, difference in proportions, log odds ratio, Pearson's chi-squared, and likelihood ratio are compared as test statistics for testing independence of two dichotomous factors when the associated p values are computed by using the conditional distribution given the marginals. The statistics listed above that can be used for a one-sided alternative give identical p values. For a two-sided alternative, many of the above statistics lead to different p values. The p values are shown to differ only by which tables in the opposite tail from the observed table are considered more extreme than the observed table.  相似文献   

14.
For testing a one-sided hypothesis in a one-parameter family of distributions, it is shown that the generalized likelihood ratio (GLR) test coincides with the uniformly most powerful (UMP) test, assuming certain monotonicity properties for the likelihood function. In particular, the equivalence of GLR tests and UMP tests holds for one-parameter exponential families. In addition, the relationship between GLR and UMPU (UMP unbiased) tests is considered when testing two-sided hypotheses.  相似文献   

15.
ABSTRACT

This article examines the evidence contained in t statistics that are marginally significant in 5% tests. The bases for evaluating evidence are likelihood ratios and integrated likelihood ratios, computed under a variety of assumptions regarding the alternative hypotheses in null hypothesis significance tests. Likelihood ratios and integrated likelihood ratios provide a useful measure of the evidence in favor of competing hypotheses because they can be interpreted as representing the ratio of the probabilities that each hypothesis assigns to observed data. When they are either very large or very small, they suggest that one hypothesis is much better than the other in predicting observed data. If they are close to 1.0, then both hypotheses provide approximately equally valid explanations for observed data. I find that p-values that are close to 0.05 (i.e., that are “marginally significant”) correspond to integrated likelihood ratios that are bounded by approximately 7 in two-sided tests, and by approximately 4 in one-sided tests.

The modest magnitude of integrated likelihood ratios corresponding to p-values close to 0.05 clearly suggests that higher standards of evidence are needed to support claims of novel discoveries and new effects.  相似文献   

16.
The effect of rejecting a two-sided preliminary test of significance for the mean of a normal distribution upon subsequent interval estimation of the mean is examined. For the case where the variance is known, conditional confidence intervals may be shorter than unconditional intervals, in contrast to the one-sided preliminary test case examined by Meeks and D’Agostino (1983, The American Statistician, 7, 134-136) . For the case where the variance is unknown and must be estimated by the sample variance, it is shown that customary intervals do not offer uniformly greater or lesser coverage than the nominal level.  相似文献   

17.
C. Ittrich 《Statistics》2013,47(1):13-42
Nonlinear regression models with spherically symmetric error vectors and a single nonlinear parameter are considered. On the basis of a new geometric approach, exact one- and two-sided tests and confidence regions for the nonlinear parameter are derived in the cases of known and unknown error variances. A geometric measure representation formula is used to determine the power functions of the tests if the error variance is known and to derive different lower bounds for the power function of a one-sided test in the case of an unknown error variance. The latter can be done quite effectively by constructing and measuring several balls inside the critical region. A numerical study compares the results for different density generating functions of the error distribution.  相似文献   

18.
The classic conditional test for checking that the difference between two independent proportions is not null may not be appropriate in many circumstances. Dunnett & Gent (1977) showed that in clinical trials, in studies of drugs, etc, the aim is to prove the practical equality (equivalence) of both proportions. On other occasions the aim may be the opposite: i.e. to prove that the two proportions are substantially different (biologically significant). Both cases are usually solved by two one-sided tests (TOST test). In this article, this procedure is shown to be conservative and two true two-sided tests for each case are proposed.  相似文献   

19.
There are a number of situations in which the experimental data observed are record statistics. In this paper, optimal confidence intervals as well as uniformly most powerful (MP) tests for one-sided alternatives are developed. Since a uniformly MP test for a two-sided alternative does not exist, generalized likelihood ratio and uniformly unbiased and invariant tests are derived for the two parameters of the exponential distribution based on record data. For illustrative purposes, a data set on the times between consecutive telephone calls to a company's switchboard is analysed using the proposed procedures. Finally, some open problems in this direction are pointed out.  相似文献   

20.
To increase the sensitivity of Shewhart control charts in detecting small process shifts sensitizing rules based on runs and scans are often used in practice. Shewhart control charts supplemented with runs rules for detecting shifts in process variance have not received as much attention as their counterparts for detecting shifts in process mean. In this article, we examine the performance of simple runs rules schemes for monitoring increases and/or decreases in process variance based on the sample standard deviation. We introduce one-sided S charts that overcome the weakness of high false-alarm rates when runs rules are added to a Shewhart control chart. The average run length performance and design aspects of the charts are studied thoroughly. The performance of associated two-sided control schemes is investigated as well.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号