首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Classical continuous goodness-of-fit (GOF) testing is employed for examining whether the data come from an assumed parametric model. In many cases, GOF tests assume a uniform null distribution and examine extreme values of the order statistics of the samples. Many of these statistics can be expressed by a function of the order statistics and the p-values amount to a joint probability statement based on the uniform order statistics. In this paper, we utilize Steck''s recursion method and propose two high precision computing algorithms to compute the p-values for these GOF statistics. The numerical difficulties in implementing Steck''s method are discussed and compared with solutions provided in high precision libraries.  相似文献   

2.
Nonparametric tests are proposed for the equality of two unknown p-variate distributions. Empirical probability measures are defined from samples from the two distributions and used to construct test statistics as the supremum of the absolute differences between empirical probabilities, the supremum being taken over all possible events. The test statistics are truly multivariate in not requiring the artificial ranking of multivariate observations, and they are distribution-free in the general p-variate case. Asymptotic null distributions are obtained. Powers of the proposed tests and a competitor are examined by Monte Carlo techniques.  相似文献   

3.
ABSTRACT

Conditional tests are constructed by conditioning a fit measure to a minimal sufficient statistic. To calculate the p-value of these tests, Monte Carlo methods with co-sufficient samples can be used. In this paper we show how to simulate co-sufficient samples when the data distribution belongs to the exponential family with doubly transitive sufficient statistics. The proposed method is illustrated using the beta distribution.  相似文献   

4.
Birnbaum–Saunders (BS) distribution is widely used in reliability applications to model failure times. For several samples from possible different BS distributions, to prevent wrong conclusions in any further analysis, it is of importance to accompany a formal comparison for characteristic quantities of the distributions, including mean, quantile and reliability function difference. To this end, two test statistics, which are respectively based on the exact generalized p-value approach and the Delta method, are proposed and their behaviours are investigated. Simulation studies are carried out to examine the size and power performance of the newly proposed statistics. An interesting phenomenon is that in the finite sample simulations we conduct, the Delta method-based test almost uniformly outperforms the generalized p-value-based test although its sampling null distribution is simulated by Monte Carlo method. This might suggest that the sampling null distribution of the Delta method-based test statistic would have a fast convergence to its limit. The tests are also applied to analyse a real example on the fatigue life of 6061-T6 aluminium coupons for illustration.  相似文献   

5.
In this article, we introduce two goodness-of-fit tests for testing normality through the concept of the posterior predictive p-value. The discrepancy variables selected are the Kolmogorov-Smirnov (KS) and Berk-Jones (BJ) statistics and the prior chosen is Jeffreys’ prior. The constructed posterior predictive p-values are shown to be distributed independently of the unknown parameters under the null hypothesis, thus they can be taken as the test statistics. It emerges from the simulation that the new tests are more powerful than the corresponding classical tests against most of the alternatives concerned.  相似文献   

6.
Pearson’s chi-square (Pe), likelihood ratio (LR), and Fisher (Fi)–Freeman–Halton test statistics are commonly used to test the association of an unordered r×c contingency table. Asymptotically, these test statistics follow a chi-square distribution. For small sample cases, the asymptotic chi-square approximations are unreliable. Therefore, the exact p-value is frequently computed conditional on the row- and column-sums. One drawback of the exact p-value is that it is conservative. Different adjustments have been suggested, such as Lancaster’s mid-p version and randomized tests. In this paper, we have considered 3×2, 2×3, and 3×3 tables and compared the exact power and significance level of these test’s standard, mid-p, and randomized versions. The mid-p and randomized test versions have approximately the same power and higher power than that of the standard test versions. The mid-p type-I error probability seldom exceeds the nominal level. For a given set of parameters, the power of Pe, LR, and Fi differs approximately the same way for standard, mid-p, and randomized test versions. Although there is no general ranking of these tests, in some situations, especially when averaged over the parameter space, Pe and Fi have the same power and slightly higher power than LR. When the sample sizes (i.e., the row sums) are equal, the differences are small, otherwise the observed differences can be 10% or more. In some cases, perhaps characterized by poorly balanced designs, LR has the highest power.  相似文献   

7.
This article proposes a modified p-value for the two-sided test of the location of the normal distribution when the parameter space is restricted. A commonly used test for the two-sided test of the normal distribution is the uniformly most powerful unbiased (UMPU) test, which is also the likelihood ratio test. The p-value of the test is used as evidence against the null hypothesis. Note that the usual p-value does not depend on the parameter space but only on the observation and the assumption of the null hypothesis. When the parameter space is known to be restricted, the usual p-value cannot sufficiently utilize this information to make a more accurate decision. In this paper, a modified p-value (also called the rp-value) dependent on the parameter space is proposed, and the test derived from the modified p-value is also shown to be the UMPU test.  相似文献   

8.
Abstract

In statistical hypothesis testing, a p-value is expected to be distributed as the uniform distribution on the interval (0, 1) under the null hypothesis. However, some p-values, such as the generalized p-value and the posterior predictive p-value, cannot be assured of this property. In this paper, we propose an adaptive p-value calibration approach, and show that the calibrated p-value is asymptotically distributed as the uniform distribution. For Behrens–Fisher problem and goodness-of-fit test under a normal model, the calibrated p-values are constructed and their behavior is evaluated numerically. Simulations show that the calibrated p-values are superior than original ones.  相似文献   

9.
ABSTRACT

Area statistics are sample versions of areas occurring in a probability plot of two distribution functions F and G. This paper presents a unified basis for five statistics of this type. They can be used for various testing problems in the framework of the two sample problem for independent observations, such as testing equality of distributions against inequality or testing stochastic dominance of distributions in one or either direction against nondominance. Though three of the statistics considered have already been suggested in literature, two of them are new and deserve our interest. The finite sample distributions of the statistics (under F=G) can be calculated via recursion formulae. Two tables with critical values of the new statistics are included. The asymptotic distribution of the properly normalized versions of the area statistics are functionals of the Brownian bridge. The distribution functions and quantiles thereof are obtained by Monte Carlo simulation. Finally, the power functions of the two new tests based on area statistics are compared to the power functions of the tests based on the corresponding supremum statistics, i.e., statistics of the Kolmogorov–Smirnov type.  相似文献   

10.
Combining p-values from statistical tests across different studies is the most commonly used approach in meta-analysis for evolutionary biology. The most commonly used p-value combination methods mainly incorporate the z-transform tests (e.g., the un-weighted z-test and the weighted z-test) and the gamma-transform tests (e.g., the CZ method [Z. Chen, W. Yang, Q. Liu, J.Y. Yang, J. Li, and M.Q. Yang, A new statistical approach to combining p-values using gamma distribution and its application to genomewide association study, Bioinformatics 15 (2014), p. S3]). However, among these existing p-value combination methods, no method is uniformly most powerful in all situations [Chen et al. 2014]. In this paper, we propose a meta-analysis method based on the gamma distribution, MAGD, by pooling the p-values from independent studies. The newly proposed test, MAGD, allows for flexible accommodating of the different levels of heterogeneity of effect sizes across individual studies. The MAGD simultaneously retains all the characters of the z-transform tests and the gamma-transform tests. We also propose an easy-to-implement resampling approach for estimating the empirical p-values of MAGD for the finite sample size. Simulation studies and two data applications show that the proposed method MAGD is essentially as powerful as the z-transform tests (the gamma-transform tests) under the circumstance with the homogeneous (heterogeneous) effect sizes across studies.  相似文献   

11.
Abstract

The hypothesis tests of performance measures for an M/Ek/1 queueing system are considered. With pivotal models deduced from sufficient statistics for the unknown parameters, a generalized p-value approach to derive tests about parametric functions are proposed. The focus is on derivation of the p-values of hypothesis testing for five popular performance measures of the system in the steady state. Given a sample T, let p(T) be the p values we developed. We derive a closed form expression to show that, for small samples, the probability P(p(T) ? γ) is approximately equal to γ, for 0 ? γ ? 1.  相似文献   

12.
While it is often argued that a p-value is a probability; see Wasserstein and Lazar, we argue that a p-value is not defined as a probability. A p-value is a bijection of the sufficient statistic for a given test which maps to the same scale as the Type I error probability. As such, the use of p-values in a test should be no more a source of controversy than the use of a sufficient statistic. It is demonstrated that there is, in fact, no ambiguity about what a p-value is, contrary to what has been claimed in recent public debates in the applied statistics community. We give a simple example to illustrate that rejecting the use of p-values in testing for a normal mean parameter is conceptually no different from rejecting the use of a sample mean. The p-value is innocent; the problem arises from its misuse and misinterpretation. The way that p-values have been informally defined and interpreted appears to have led to tremendous confusion and controversy regarding their place in statistical analysis.  相似文献   

13.
ABSTRACT

This article has two objectives. The first and narrower is to formalize the p-value function, which records all possible p-values, each corresponding to a value for whatever the scalar parameter of interest is for the problem at hand, and to show how this p-value function directly provides full inference information for any corresponding user or scientist. The p-value function provides familiar inference objects: significance levels, confidence intervals, critical values for fixed-level tests, and the power function at all values of the parameter of interest. It thus gives an immediate accurate and visual summary of inference information for the parameter of interest. We show that the p-value function of the key scalar interest parameter records the statistical position of the observed data relative to that parameter, and we then describe an accurate approximation to that p-value function which is readily constructed.  相似文献   

14.
In this paper we propose a series of goodness-of-fit tests for the family of skew-normal models when all parameters are unknown. As the null distributions of the considered test statistics depend only on asymmetry parameter, we used a default and proper prior on skewness parameter leading to the prior predictive p-value advocated by G. Box. Goodness-of-fit tests, here proposed, depend only on sample size and exhibit full agreement between nominal and actual size. They also have good power against local alternative models which also account for asymmetry in the data.  相似文献   

15.
Many procedures exist for testing equality of means or medians to compare several independent distributions. However, the mean or median do not determine the entire distribution. In this article, we propose a new small-sample modification of the likelihood ratio test for testing the equality of the quantiles of several normal distributions. The merits of the proposed test are numerically compared with the existing tests—a generalized p-value method and likelihood ratio test—with respect to their sizes and powers. The simulation results demonstrate that proposed method is satisfactory; its actual size is very close to the nominal level. We illustrate these approaches using two real examples.  相似文献   

16.
In this article, we focus on the one-sided hypothesis testing for the univariate linear calibration, where a normally distributed response variable and an explanatory variable are involved. The observations of the response variable corresponding to known values of the explanatory variable are used to make inferences on a single unknown value of the explanatory variable. We apply the generalized inference to the calibration problem, and take the generalized p-value as the test statistic to develop a new p-value for one-sided hypothesis testing, which we refer to as the one-sided posterior predictive p-value. The behavior of the one-sided posterior predictive p-value is numerically compared with that of the generalized p-value, and simulations show that the proposed p-value is quite satisfactory in the frequentist performance.  相似文献   

17.
18.
This paper applies recent theories of testing for parameter constancy to the conditional variance in a GARCH model. The supremum Lagrange multiplier test for conditional Gaussian GARCH models and its robustified variants are discussed. The asymptotic null distribution of the test statistics are derived from the weak convergence of the scores, and the critical values from the hitting probability of squared Bessel process.

Monte Carlo studies on the finite sample size and power performance of the supremum LM tests are conducted. Applications of these tests to S&P 500 indicate that the hypothesis of stable conditional variance parameters can be rejected.  相似文献   

19.
This paper investigates methodologies for evaluating the probabilistic value (P-value) of the Kolmogorov–Smirnov (K–S) goodness-of-fit test using algorithmic program development implemented in Microsoft® Visual Basic® (VB). Six methods were examined for the one-sided one-sample and two methods for the two-sided one-sample cumulative sampling distributions in the investigative software implementation that was based on machine-precision arithmetic. For sample sizes n≤2000 considered, results from the Smirnov iterative method found optimal accuracy for K–S P-values≥0.02, while those from the SmirnovD were more accurate for lower P-values for the one-sided one-sample distribution statistics. Also, the Durbin matrix method sustained better P-value results than the Durbin recursion method for the two-sided one-sample tests up to n≤700 sample sizes. Based on these results, an algorithm for Microsoft Excel® function was proposed from which a model function was developed and its implementation was used to test the performance of engineering students in a general engineering course across seven departments.  相似文献   

20.
This article considers the problem of testing marginal homogeneity in a 2 × 2 contingency table. We first review some well-known conditional and unconditional p-values appeared in the statistical literature. Then we treat the p-value as the test statistic and use the unconditional approach to obtain the modified p-value, which is shown to be valid. For a given nominal level, the rejection region of the modified p-value test contains that of the original p-value test. Some nice properties of the modified p-value are given. Especially, under mild conditions the rejection region of the modified p-value test is shown to be the Barnard convex set as described by Barnard (1947 Barnard , G. A. ( 1947 ). Significance tests for 2 × 2 tables . Biometrika 34 : 123138 .[Crossref], [PubMed], [Web of Science ®] [Google Scholar]). If the one-sided null hypothesis has two nuisance parameters, we show that this result can reduce the dimension of the nuisance parameter space from two to one for computing modified p-values and sizes of tests. Numerical studies including an illustrative example are given. Numerical comparisons show that the sizes of the modified p-value tests are closer to a nominal level than those of the original p-value tests for many cases, especially in the case of small to moderate sample sizes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号