期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Approximate and exact distributions of rank tests for balanced incomplete block designs

Mayer Alvo Paul Cabilio 《统计学通讯:理论与方法》2013,42(12):3073-3121

Judges rank k out of t objects according to m replic ations of abasic balanced incomplete block design with bblocks. In Alvo and Cabilio(1991),it is shown that the Durbin test, which is the usual test in this situation, can be written in terms of Spearman correlations between the blocks, and using a Kendall correlation, they generated a new statistic for this situation.This Kendall tau based statistic has a richer support than the Durbin statistic, and is at least as efficient.In the present paper,exact and simulation based tables are generated for both statistics, and various approximations to these null distributions are considered and compared. 相似文献

2.

Testing homogeneity in a heteroscedastic contaminated normal mixture

Xiaoqing Niu Pengfei Li 《Journal of applied statistics》2019,46(8):1478-1491

Large-scale simultaneous hypothesis testing appears in many areas. A well-known inference method is to control the false discovery rate. One popular approach is to model the z-scores derived from the individual t-tests and then use this model to control the false discovery rate. We propose a heteroscedastic contaminated normal mixture to describe the distribution of z-scores and design an EM-test for testing homogeneity in this class of mixture models. The proposed EM-test can be used to investigate whether a collection of z-scores has arisen from a single normal distribution or whether a heteroscedastic contaminated normal mixture is more appropriate. We show that the EM-test statistic has a shifted mixture of chi-squared limiting distribution. Simulation results show that the proposed testing procedure has accurate type-I error and significantly larger power than its competitors under a variety of model specifications. A real-data example is analysed to exemplify the application of the proposed method. 相似文献

3.

A nonparametric procedure for the analysis of balanced crossover designs

Serge Tardif Franois Bellavance Constance Van Eeden 《Revue canadienne de statistique》2005,33(4):471-488

The authors propose nonparametric tests for the hypothesis of no direct treatment effects, as well as for the hypothesis of no carryover effects, for balanced crossover designs in which the number of treatments equals the number of periods p, where p ≥ 3. They suppose that the design consists of n replications of balanced crossover designs, each formed by m Latin squares of order p. Their tests are permutation tests which are based on the n vectors of least squares estimators of the parameters of interest obtained from the n replications of the experiment. They obtain both the exact and limiting distribution of the test statistics, and they show that the tests have, asymptotically, the same power as the F‐ratio test. 相似文献

4.

Designing cancer immunotherapy trials with delayed treatment effect using maximin efficiency robust statistics

Xue Ding Jianrong Wu 《Pharmaceutical statistics》2020,19(4):424-435

The indirect mechanism of action of immunotherapy causes a delayed treatment effect, producing delayed separation of survival curves between the treatment groups, and violates the proportional hazards assumption. Therefore using the log‐rank test in immunotherapy trial design could result in a severe loss efficiency. Although few statistical methods are available for immunotherapy trial design that incorporates a delayed treatment effect, recently, Ye and Yu proposed the use of a maximin efficiency robust test (MERT) for the trial design. The MERT is a weighted log‐rank test that puts less weight on early events and full weight after the delayed period. However, the weight function of the MERT involves an unknown function that has to be estimated from historical data. Here, for simplicity, we propose the use of an approximated maximin test, the V₀ test, which is the sum of the log‐rank test for the full data set and the log‐rank test for the data beyond the lag time point. The V₀ test fully uses the trial data and is more efficient than the log‐rank test when lag exits with relatively little efficiency loss when no lag exists. The sample size formula for the V₀ test is derived. Simulations are conducted to compare the performance of the V₀ test to the existing tests. A real trial is used to illustrate cancer immunotherapy trial design with delayed treatment effect. 相似文献

5.

Comparative Analyses of Pretest-Posttest Research Designs

Donna R. Brogan Michael H. Kutner 《The American statistician》2013,67(4):229-232

Two common methods of analyzing data from a two-group pretest-posttest research design are (a) two-sample t test on the difference score between pretest and posttest and (b) repeated-measures/split-plot analysis of variance. The repeated-measures/split-plot analysis subsumes the t test analysis, although the former requires more assumptions to be satisfied. A numerical example is given to illustrate some of the equivalences of the two methods of analysis. The investigator should choose the method of analysis based on the research objective(s). 相似文献

6.

Optimum Design for Type-I Step-stress Accelerated Life Tests of Two-parameter Weibull Distributions

《统计学通讯:理论与方法》2012,41(21):3863-3877

In this article, we focus on the general k-step step-stress accelerated life tests with Type-I censoring for two-parameter Weibull distributions based on the tampered failure rate (TFR) model. We get the optimum design for the tests under the criterion of the minimization of the asymptotic variance of the maximum likelihood estimate of the pth percentile of the lifetime under the normal operating conditions. Optimum test plans for the simple step-stress accelerated life tests under Type-I censoring are developed for the Weibull distribution and the exponential distribution in particular. Finally, an example is provided to illustrate the proposed design and a sensitivity analysis is conducted to investigate the robustness of the design. 相似文献

7.

Locally minimax tests for multiple correlations

N. Giri 《Revue canadienne de statistique》1979,7(1):53-60

Let X be a normally distributed p-dimensional column vector with mean μ and positive definite covariance matrix σ. and let X α, α = 1,…, N, be a random sample of size N from this distribution. Partition X as ( X ₁, X ₍₂₎', X '₍₃₎)', where X₁ is one-dimension, X₍₂₎ is p₂- dimensional, and so 1 + p₁ + p₂ = p. Let ρ₁ and ρ be the multiple correlation coefficients of X₁ with X₍₂₎ and with ( X '₍₂₎, X '₍₃₎)', respectively. Write ρ2/2 = ρ² - ρ2/1. We shall cosider the following two problems 相似文献

8.

A comparison of several adaptive tests for paired data

《Journal of Statistical Computation and Simulation》2012,82(9):1083-1093

In the last few years, two adaptive tests for paired data have been proposed. One test proposed by Freidlin et al. [On the use of the Shapiro–Wilk test in two-stage adaptive inference for paired data from moderate to very heavy tailed distributions, Biom. J. 45 (2003), pp. 887–900] is a two-stage procedure that uses a selection statistic to determine which of three rank scores to use in the computation of the test statistic. Another statistic, proposed by O'Gorman [Applied Adaptive Statistical Methods: Tests of Significance and Confidence Intervals, Society for Industrial and Applied Mathematics, Philadelphia, 2004], uses a weighted t-test with the weights determined by the data. These two methods, and an earlier rank-based adaptive test proposed by Randles and Hogg [Adaptive Distribution-free Tests, Commun. Stat. 2 (1973), pp. 337–356], are compared with the t-test and to Wilcoxon's signed-rank test. For sample sizes between 15 and 50, the results show that the adaptive test proposed by Freidlin et al. and the adaptive test proposed by O'Gorman have higher power than the other tests over a range of moderate to long-tailed symmetric distributions. The results also show that the test proposed by O'Gorman has greater power than the other tests for short-tailed distributions. For sample sizes greater than 50 and for small sample sizes the adaptive test proposed by O'Gorman has the highest power for most distributions. 相似文献

9.

Testing for normality in linear regression models

《Journal of Statistical Computation and Simulation》2012,82(10):1101-1113

The importance of the normal distribution for fitting continuous data is well known. However, in many practical situations data distribution departs from normality. For example, the sample skewness and the sample kurtosis are far away from 0 and 3, respectively, which are nice properties of normal distributions. So, it is important to have formal tests of normality against any alternative. D'Agostino et al. [A suggestion for using powerful and informative tests of normality, Am. Statist. 44 (1990), pp. 316–321] review four procedures Z ²(g ₁), Z ²(g ₂), D and K ² for testing departure from normality. The first two of these procedures are tests of normality against departure due to skewness and kurtosis, respectively. The other two tests are omnibus tests. An alternative to the normal distribution is a class of skew-normal distributions (see [A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Statist. 12 (1985), pp. 171–178]). In this paper, we obtain a score test (W) and a likelihood ratio test (LR) of goodness of fit of the normal regression model against the skew-normal family of regression models. It turns out that the score test is based on the sample skewness and is of very simple form. The performance of these six procedures, in terms of size and power, are compared using simulations. The level properties of the three statistics LR, W and Z ²(g ₁) are similar and close to the nominal level for moderate to large sample sizes. Also, their power properties are similar for small departure from normality due to skewness (γ₁≤0.4). Of these, the score test statistic has a very simple form and computationally much simpler than the other two statistics. The LR statistic, in general, has highest power, although it is computationally much complex as it requires estimates of the parameters under the normal model as well as those under the skew-normal model. So, the score test may be used to test for normality against small departure from normality due to skewness. Otherwise, the likelihood ratio statistic LR should be used as it detects general departure from normality (due to both skewness and kurtosis) with, in general, largest power. 相似文献

10.

A test of fit for lattice distributions

T.W. Epps 《统计学通讯:理论与方法》2013,42(6):1455-1479

相似文献

11.

On the Exact Size of Tests of Treatment Effects in Multi‐Arm Clinical Trials

Chris J. Lloyd 《Australian & New Zealand Journal of Statistics》2014,56(4):359-369

When testing treatment effects in multi‐arm clinical trials, the Bonferroni method or the method of Simes 1986) is used to adjust for the multiple comparisons. When control of the family‐wise error rate is required, these methods are combined with the close testing principle of Marcus et al. (1976). Under weak assumptions, the resulting p‐values all give rise to valid tests provided that the basic test used for each treatment is valid. However, standard tests can be far from valid, especially when the endpoint is binary and when sample sizes are unbalanced, as is common in multi‐arm clinical trials. This paper looks at the relationship between size deviations of the component test and size deviations of the multiple comparison test. The conclusion is that multiple comparison tests are as imperfect as the basic tests at nominal size α/m where m is the number of treatments. This, admittedly not unexpected, conclusion implies that these methods should only be used when the component test is very accurate at small nominal sizes. For binary end‐points, this suggests use of the parametric bootstrap test. All these conclusions are supported by a detailed numerical study. 相似文献

12.

An algorithm for maximum likelihood estimation in incomplete block variance component models

《Journal of Statistical Computation and Simulation》2012,82(3):237-251

In the recovery of interblock information to improve the treatment differences estimates in incomplete block designs, the parameter p is usually unknown. Many authors have worked on the problem of estimating it and of studying its properties together with the properties of the treatment differences estimates. In this paper a numerically efficient algorithm is developed which yields the maximum likelihood estimates (MLE) of all the parameters in the mixed incomplete block design model (treatment effects, ρ and variance) 相似文献

13.

A power study of k-linear-r-ahead recursive residuals test for change-point in finite sequences

《Journal of Statistical Computation and Simulation》2012,82(12):1201-1213

A change-point problem in finite sequences is considered along with, so-called, k-linear-r-ahead recursive residuals and a test procedure proposed by ?o?a¸d? et al. [?o?a¸d?, J.A., Szkutnik, Z., Majerczak, J. and Duda, K. 1998, Detection of change point in oxygen uptake during an incremental exercise test using recursive residuals: relationship to the plasma lactate accumulation and blood acid base balance. European Journal of Applied Physiology, 78, 369–377.]. Theoretical significance levels of that (conservative) test are compared with its simulated sizes. Numerical approximations to the powers against various alternatives are given. Properties of the k-linear-r-ahead recursive residuals are described and the consistency of the test is proved, when the noise level goes to zero. 相似文献

14.

Modified p-Value of Two-Sided Test for Normal Distribution with Restricted Parameter Space

Hsiuying Wang 《统计学通讯:理论与方法》2013,42(8):1361-1374

This article proposes a modified p-value for the two-sided test of the location of the normal distribution when the parameter space is restricted. A commonly used test for the two-sided test of the normal distribution is the uniformly most powerful unbiased (UMPU) test, which is also the likelihood ratio test. The p-value of the test is used as evidence against the null hypothesis. Note that the usual p-value does not depend on the parameter space but only on the observation and the assumption of the null hypothesis. When the parameter space is known to be restricted, the usual p-value cannot sufficiently utilize this information to make a more accurate decision. In this paper, a modified p-value (also called the rp-value) dependent on the parameter space is proposed, and the test derived from the modified p-value is also shown to be the UMPU test. 相似文献

15.

Variations of Q–Q Plots: The Power of Our Eyes!

Adam Loy Lendie Follett Heike Hofmann 《The American statistician》2013,67(2):202-214

In statistical modeling, we strive to specify models that resemble data collected in studies or observed from processes. Consequently, distributional specification and parameter estimation are central to parametric models. Graphical procedures, such as the quantile–quantile (Q–Q) plot, are arguably the most widely used method of distributional assessment, though critics find their interpretation to be overly subjective. Formal goodness of fit tests are available and are quite powerful, but only indicate whether there is a lack of fit, not why there is lack of fit. In this article, we explore the use of the lineup protocol to inject rigor into graphical distributional assessment and compare its power to that of formal distributional tests. We find that lineup tests are considerably more powerful than traditional tests of normality. A further investigation into the design of Q–Q plots shows that de-trended Q–Q plots are more powerful than the standard approach as long as the plot preserves distances in x and y to be the same. While we focus on diagnosing nonnormality, our approach is general and can be directly extended to the assessment of other distributions. 相似文献

16.

On limiting distribution laws of statistics analogous to pearson's chi-square

E. Csáki I. Vincze 《Statistics》2013,47(4):531-548

Two test-statistics analogous to Pearson's chi-square test function - given in (1.6) and (1.7) - are investigated. These statistics utilize, apart from the number of sample elements lying in the respective intervals of the partition, their positions within the intervals too. It is shown that the test-statistics are asymptotically distributed - as the sample size N tends to infinity - according to the x ²distribution with parameter r, i.e. the number of intervals chosen. The limiting distribution of the test statistics under the null-hypothesis when N tends to the infinity and r =O(N ^α) (0<α<1), further the consistency of the tests based on these statistics is considered. Some remarks are made concerning the efficiency of the corresponding goodness of fit tests also; the authors intend to return to a more detailed treatment of the efficiency later. 相似文献

17.

A distribution-free approach for selecting better treatment through an ethical allocation

Radhakanta Das 《Journal of nonparametric statistics》2019,31(2):482-505

相似文献

18.

More powerful tests for the sign testing problem about gamma scale parameters

Chia-Hao Chan Mei-Mei Zen 《Statistics》2015,49(3):564-577

相似文献

19.

BAHADUR SLOPES FOR COMPARISONS OF TESTS BASED ON RESAMPLING

Bing-Yi Jing 《Australian & New Zealand Journal of Statistics》1994,36(3):355-362

Let X₁,…, X_n be random variables symmetric about θ from a common unknown distribution F_θ(x) =F(x–θ). To test the null hypothesis H₀:θ= 0 against the alternative H₁:θ > 0, permutation tests can be used at the cost of computational difficulties. This paper investigates alternative tests that are computationally simpler, notably some bootstrap tests which are compared with permutation tests. Of these the symmetrical bootstrap-f test competes very favourably with the permutation test in terms of Bahadur asymptotic efficiency, so it is a very attractive alternative. 相似文献

20.

A modified one-sample test for goodness-of-fit

《Journal of Statistical Computation and Simulation》2012,82(2):422-429

This paper introduces a modified one-sample test of goodness-of-fit based on the cumulative distribution function. Damico [A new one-sample test for goodness-of-fit. Commun Stat – Theory Methods. 2004;33:181–193] proposed a test for testing goodness-of-fit of univariate distribution that uses the concept of partitioning the probability range into n intervals of equal probability mass 1/n and verifies that the hypothesized distribution evaluated at the observed data would place one case into each interval. The present paper extends this notion by allowing for m intervals of probability mass r/n, where r≥1 and n=m×r. A simulation study for small and moderate sample sizes demonstrates that the proposed test for two observations per interval under various alternatives is more powerful than the test proposed by Damico (2004). 相似文献