首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
When an I×J contingency table has many cells having very small frequencies, the usual chi-square approximation to the upper tail of the likelihood ratio goodness-of-fit statistic, G2 and Pearson chi-square statistic, X2, for testing independence, are not satisfactory. In this paper we consider the problem of adjusting G2 and X2. Suitable adjustments are suggested on the basis of analytical investigation of asymptotic bias terms for G2 and X2. A Monte Carlo simulation is performed for several tables to assess the adjustments of G2 and X2 in order to obtain a closer approximation to the nominal level of significance.  相似文献   

2.
The Bartlett's test (1937) for equality of variances is based on the χ2 distribution approximation. This approximation deteriorates either when the sample size is small (particularly < 4) or when the population number is large. According to a simulation investigation, we find a similar varying trend for the mean differences between empirical distributions of Bartlett's statistics and their χ2 approximations. By using the mean differences to represent the distribution departures, a simple adjustment approach on the Bartlett's statistic is proposed on the basis of equal mean principle. The performance before and after adjustment is extensively investigated under equal and unequal sample sizes, with number of populations varying from 3 to 100. Compared with the traditional Bartlett's statistic, the adjusted statistic is distributed more closely to χ2 distribution, for homogeneity samples from normal populations. The type I error is well controlled and the power is a little higher after adjustment. In conclusion, the adjustment has good control on the type I error and higher power, and thus is recommended for small samples and large population number when underlying distribution is normal.  相似文献   

3.
The exact and asymptotic upper tail probabilities (α = .10, .05, .01, .001) of the three chi-squared goodness-of-fit statistics Pearson's X 2, likelihood ratioG 2, and powerdivergence statisticD 2(λ), with λ= 2/3 are compared by complete enumeration for the binomial and the mixture binomial. For the two-component mixture binomial, three cases have been distinguished. 1. Both success probabilities and the mixing weights are unknwon. 2. One of the two success probabilities is known. And 3., the mixing weights are known. The binomial was investigated for the number of cellsk, being between 3 and 6 with sample sizes between 5 and 100, for k = 7 with sample sizes between 5 and 45, and for k = 10 with sample sizes ranging from 5 to 20. For the mixture binomial, solely k = 5 cells were considered with sample sizes from 5 to 100 and k = 8 cells with sample sizes between 4 and 20. Rating the relative accuracy of the chi-squared approximation in terms of ±10% and ±20% intervals around α led to the following conclusions for the binomial: 1. Using G2 is not recommendable. 2. At the significance levels α=.10 and α=.05X 2 should be preferred over D 2; D 2 is the best choice at α = .01. 3. Cochran's (1954; Biometrics, 10, 417-451) rule for the minimum expectation when using X 2 seems to generalize to the binomial for G 2 and D 2 ; as a compromise, it gives a rather strong lower limit for the expected cell frequencies in some circumstances, but a rather liberal in others. To draw similar conclusions concerning the mixture binomial was not possible, because in that case, the accuracy of the chi-squared approximation is not only a function of the chosen test statistic and of the significance level, but also heavily depends on the numerical value of theinvolved unknown parameters and on the hypothesis to be tested. Thereto, the present study may give rise only to warnings against the application of mixture models to small samples.  相似文献   

4.
In this paper, progressive-stress accelerated life tests are applied when the lifetime of a product under design stress follows the exponentiated distribution [G(x)]α. The baseline distribution, G(x), follows a general class of distributions which includes, among others, Weibull, compound Weibull, power function, Pareto, Gompertz, compound Gompertz, normal and logistic distributions. The scale parameter of G(x) satisfies the inverse power law and the cumulative exposure model holds for the effect of changing stress. A special case for an exponentiated exponential distribution has been discussed. Using type-II progressive hybrid censoring and MCMC algorithm, Bayes estimates of the unknown parameters based on symmetric and asymmetric loss functions are obtained and compared with the maximum likelihood estimates. Normal approximation and bootstrap confidence intervals for the unknown parameters are obtained and compared via a simulation study.  相似文献   

5.
The small-sample accuracy of seven members of the family of power-divergence statistics for testing independence or homogeneity in contingency tables was studied via simulation. The likelihood ratio statistic G 2 and Pearson's X 2 statistic are among these seven members, whose behavior was studied at nominal test sizes of.01 and.05 with marginal distributions that could be uniform or skewed and with a set of sample sizes that included sparseness conditions as measured through table density (i.e., the ratio of sample size to number of cells). The likelihood ratio statistic G 2 rejected the null hypothesis too often even with large table density, whereas Pearson's X 2 was sufficiently accurate and only presented a minor misbehavior when table density was less than two observations/cell. None of the other five statistics outperformed Pearson's X 2. A nonasymptotic variant of X 2 solved the minor inaccuracies of Pearson's X 2 and turned out to be the most accurate statistic for testing independence or homogeneity, even with table densities of one observation/cell. These results clearly advise against the use of the likelihood ratio statistic G 2.  相似文献   

6.
This paper provides a general method of modifying a statistic of interest in such a way that the distribution of the modified statistic can be approximated by an arbitrary reference distribution to an order of accuracy of O(n -1/2) or even O(n -1). The reference distribution is usually the asymptotic distribution of the original statistic. We prove that the multiplication of the statistic by a suitable stochastic correction improves the asymptotic approximation to its distribution. This paper extends the results of the closely related paper by Cordeiro and Ferrari (1991) to cope with several other statistical tests. The resulting expression for the adjustment factor requires knowledge of the Edgeworth-type expansion to order O(n-1) for the distribution of the unmodified statistic. In practice its functional form involves some derivatives of the reference distribution. Certain difference between the cumulants of appropriate order in n of the unmodified statistic and those of its first-order approximation, and the unmodified statistic itself. Some applications are discussed.  相似文献   

7.
Cordeiro (1983) has derived the expected value of the deviance for generalized linear models correct to terms of order n -1 being the sample size. Then a Bartlett-type factor is available for correcting the first moment of the deviance and for fitting its distribution. If the model is correct, the deviance is not, in general, distributed as chi-squared even asymptotically and very little is known about the adequacy of the X 2 approximation. This paper through simulation studies examines the behaviour of the deviance and a Bartlett adjusted deviance for testing the goodness-of-fit of a generalized linear model. The practical use of such adjustment is illustrated for some gamma and Poisson models. It is suggested that the null distribution of the adjusted deviance is better approximated by chi-square than the distribution of the deviance.  相似文献   

8.
《随机性模型》2013,29(2):173-191
Abstract

We propose a new approximation formula for the waiting time tail probability of the M/G/1 queue with FIFO discipline and unlimited waiting space. The aim is to address the difficulty of obtaining good estimates when the tail probability has non-exponential asymptotics. We show that the waiting time tail probability can be expressed in terms of the waiting time tail probability of a notional M/G/1 queue with truncated service time distribution plus the tail probability of an extreme order statistic. The Cramér–Lundberg approximation is applied to approximate the tail probability of the notional queue. In essence, our technique extends the applicability of the Cramér–Lundberg approximation to cases where the standard Lundberg condition does not hold. We propose a simple moment-based technique for estimating the parameters of the approximation; numerical results demonstrate that our approximation can yield very good estimates over the whole range of the argument.  相似文献   

9.
The exact and asymptotic upper tail probabilities ( α= .l0, .05, .01, .001) of the three chi-squared goodness-of-fit statistics Pearson's X 2, likelihood ratioG 2, and power-divergence statisticD 2 (λ ) , with λ = 2/3, are compared numerically for simple null hypotheses not involving parameter estimation. Three types of such hypotheses were investigated (equal cell probabilities, proportional cell probabilities, some fixed small expectations together with some increasing large expectations) for the number of cells being between 3 and 15, and for sample sizes from 10 to 40, increasing by steps of one. Rating the relative accuracy of the chi-squared approximation in terms of ±10% and ±20% intervals around α led to the following conclusions: 1. Using G 2 is not recommended. 2 . At the more relevant significance levels α = .10 and α = .05X 2 should be preferred over D 2. Solely in case of unequal cell probabilitiesD 2 is the better choice at α = .O1 and α = .001. 3 . Yarnold's (1970; Journal of the Amerin Statistical Association, 65, 864-886) rule for the minimum expectation when using X 2 ("If the number of cells k is 3 or more, and if r denotes the number of expectations less than 5, then the minimum expectation may be as small as 5r/k.") generalizes to D 2; it gives a good lower limit for the expected cell frequencies, however, when the number of cells is greater than 3. For k = 3 , even sample sizes over 15 may be insufficient.  相似文献   

10.
Duadic codes are defined in terms of idempotents of a group algebra GF(q)G, where G is a finite group and gcd(q,|G|)=1. Under the conditions of (1) q=2m, and (2) the idempotents are taken to be central and (3) the splitting is μ−1, we show that such duadic codes exist if and only if q has odd-order modulo |G|.  相似文献   

11.
Abstract

A new non linear estimator, W, for the number of valid, unique signatures on a petition has been shown better, for the cases enumerated and with certain restrictions, than a popular Goodman-type statistic, G. This article extends those results with relaxed conditions by developing the exact probability mass function and mean of W and a close approximation of the variance (Var(W)). If the proportion of valid signatures among unique and duplicated signatures is the same, then Var(W) is approximately a function of the means and variances of the two sample statistics. Using the delta method, we estimate Var(W), with the resulting approximation shown to be good, even when the condition of equal proportions does not hold. We compare W to G and establish which estimator is preferred for different intervals of the design parameters. Data from a Washington State petition illustrate the findings.  相似文献   

12.
13.
When a sample discriminant function is computed, it is desired to estimate the error rate using this function. This is often done by computing G(-D/2), where G is the cumulative normal distribution and D2 is the estimated Mahalanobis' distance. In this paper an asymptotic expansion of the expectation of G(-D/2) is derived and is compared with existing Monte Carlo estimates. The asymptotic bias of G(-D/2) is derived also and the well-known practical result that G(-D/2) gives too favourable an estimate of the true error rate  相似文献   

14.
Statistics R a based on power divergence can be used for testing the homogeneity of a product multinomial model. All R a have the same chi-square limiting distribution under the null hypothesis of homogeneity. R 0 is the log likelihood ratio statistic and R 1 is Pearson's X 2 statistic. In this article, we consider improvement of approximation of the distribution of R a under the homogeneity hypothesis. The expression of the asymptotic expansion of distribution of R a under the homogeneity hypothesis is investigated. The expression consists of continuous and discontinuous terms. Using the continuous term of the expression, a new approximation of the distribution of R a is proposed. A moment-corrected type of chi-square approximation is also derived. By numerical comparison, we show that both of the approximations perform much better than that of usual chi-square approximation for the statistics R a when a ≤ 0, which include the log likelihood ratio statistic.  相似文献   

15.
Fisher's A statistic, often called the adjusted R2 statistic, is shown to be a close approximation to the maximum likelihood estimate of the multiple correlation coefficient, p2, based on the marginal distribution of R2. Expansions for the estimate are obtained. The same methods lead to maximum marginal likelihood estimators for the noncentrality parameters for noncentral X2 and F.  相似文献   

16.
Generalized Bhaskar Rao designs with non-zero elements from an abelian group G are constructed. In particular this paper shows that the necessary conditions are sufficient for the existence of generalized Bhaskar Rao designs with k=3 for the following groups: ?G? is odd, G=Zr2, and G=Zr2×H where 3? ?H? and r?1. It also constructs generalized Bhaskar Rao designs with υ=k, which is equivalent to υ rows of a generalized Hadamard matrix of order n where υ?n.  相似文献   

17.
As the sample size increases, the coefficient of skewness of the Fisher's transformation z= tanh-1r, of the correlation coefficient decreases much more rapidly than the excess of its kurtosis. Hence, the distribution of standardized z can be approximated more accurately in terms of the t distribution with matching kurtosis than by the unit normal distribution. This t distribution can, in turn be subjected to Wallace's approximation resulting in a new normal approximation for the Fisher's z transform. This approximation, which can be used to estimate the probabilities, as well as the percentiles, compares favorably in both accuracy and simplicity, with the two best earlier approximations, namely, those due to Ruben (1966) and Kraemer (1974). Fisher (1921) suggested approximating distribution of the variance stabilizing transform z=(1/2) log ((1 +r)/(1r)) of the correlation coefficient r by the normal distribution with mean = (1/2) log ((1 + p)/(lp)) and variance =l/(n3). This approximation is generally recognized as being remarkably accurate when ||Gr| is moderate but not so accurate when ||Gr| is large, even when n is not small (David (1938)). Among various alternatives to Fisher's approximation, the normalizing transformation due to Ruben (1966) and a t approximation due to Kraemer (1973), are interesting on the grounds of novelty, accuracy and/or aesthetics. If r?= r/√ (1r2) and r?|Gr = |Gr/√(1|Gr2), then Ruben (1966) showed that (1) gn (r,|Gr) ={(2n5)/2}1/2r?r{(2n3)/2}1/2r?|GR, {1 + (1/2)(r?r2+r?|Gr2)}1/2 is approximately unit normal. Kraemer (1973) suggests approximating (2) tn (r, |Gr) = (r|GR1) √ (n2), √(11r2) √(1|Gr2) by a Student's t variable with (n2) degrees of freedom, where after considering various valid choices for |Gr1 she recommends taking |Gr1= |Gr*, the median of r given n and |Gr.  相似文献   

18.
When an r×c contingency table has many cells having very small expectations, the usual χ2 approximation to the upper tail of the Pearson χ2 goodness-of-fit statistic becomes very conservative. The alternatives considered in this paper are to use either a lognormal approximation, or to scale the usual χ2 approximation. The study involves thousands of tables with various sample sizes, and with tables whose sizes range from 2×2 through 2×10×10. Subject to certain restrictions the new scaled χ2 approximations are recommended for use with tables having an average cell expectation as small as 0·5.  相似文献   

19.
The situation where k populations are partitioned into one inferior group and one superior group is considered. The statistical problem is to select a random size subset of superior populations while trying to avoid including any inferior populations. A selection procedure is assumed to satisfy the condition that the probability of selecting at least one superior population is bounded below by P1<1. The performance of a procedure is measured by the probability of including an inferior population.The asymptotic performance, as k→∞ of Gupta's traditional maximum type procedure ψG is considered in the location-model. For normally distributed populations, ψG turns out to be asymptotically optimal, provided the size of the inferior group does not become infinitely larger than the size of the superior group.  相似文献   

20.
We develop an improved approximation to the asymptotic null distribution of the goodness-of-fit tests for panel observed multi-state Markov models (Aguirre-Hernandez and Farewell, Stat Med 21:1899–1911, 2002) and hidden Markov models (Titman and Sharples, Stat Med 27:2177–2195, 2008). By considering the joint distribution of the grouped observed transition counts and the maximum likelihood estimate of the parameter vector it is shown that the distribution can be expressed as a weighted sum of independent c21{\chi^2_1} random variables, where the weights are dependent on the true parameters. The performance of this approximation for finite sample sizes and where the weights are calculated using the maximum likelihood estimates of the parameters is considered through simulation. In the scenarios considered, the approximation performs well and is a substantial improvement over the simple χ 2 approximation.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号