期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Modified classification and regression tree splitting criteria for data with interactions 总被引：1，自引：0，他引：1

Alexandra P. Bremner & Ross H. Taplin 《Australian & New Zealand Journal of Statistics》2002,44(2):169-176

This paper proposes modified splitting criteria for classification and regression trees by modifying the definition of the deviance. The modified deviance is based on local averaging instead of global averaging and is more successful at modelling data with interactions. The paper shows that the modified criteria result in much simpler trees for pure interaction data (no main effects) and can produce trees with fewer errors and lower residual mean deviances than those produced by Clark & Pregibon's (1992) method when applied to real datasets with strong interaction effects. 相似文献

2.

Fitting and comparing seed germination models with a focus on the inverse normal distribution

Michael E. O'Neill Peter C. Thomson Brent C. Jacobs Phil Brain Ruth C. Butler Heather Turner Bernadetha Mitakda 《Australian & New Zealand Journal of Statistics》2004,46(3):349-366

This paper reviews current methods for fitting a range of models to censored seed germination data and recommends adoption of a probability‐based model for the time to germination. It shows that, provided the probability of a seed eventually germinating is not on the boundary, maximum likelihood estimates, their standard errors and the resultant deviances are identical whether only those seeds which have germinated are used or all seeds (including seeds ungerminated at the end of the experiment). The paper recommends analysis of deviance when exploring whether replicate data are consistent with a hypothesis that the underlying distributions are identical, and when assessing whether data from different treatments have underlying distributions with common parameters. The inverse normal distribution, otherwise known as the inverse Gaussian distribution, is discussed, as a natural distribution for the time to germination (including a parameter to measure the lag time to germination). The paper explores some of the properties of this distribution, evaluates the standard errors of the maximum likelihood estimates of the parameters and suggests an accurate approximation to the cumulative distribution function and the median time to germination. Additional material is on the web, at http://www.agric.usyd.edu.au/staff/oneill/ . 相似文献

3.

Goodness of fit of generalized linear models to sparse data

S. R. Paul & D. Deng 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2000,62(2):323-333

We derive approximations to the first three moments of the conditional distribution of the deviance statistic, for testing the goodness of fit of generalized linear models with non-canonical links, by using an estimating equations approach, for data that are extensive but sparse. A supplementary estimating equation is proposed from which the modified deviance statistic is obtained. An application of a modified deviance statistic is shown to binomial and Poisson data. We also conduct a performance study of the modified Pearson statistic derived by Farrington and the modified deviance statistic derived in this paper, in terms of size and power, through a small scale simulation experiment. Both statistics are shown to perform well in terms of size. The deviance statistic, however, shows an advantage of power. Two examples are given. 相似文献

4.

Permutation tests of scale using deviances

Scott J. Richter Melinda H. McCann 《统计学通讯:模拟与计算》2017,46(7):5553-5565

Robust tests for comparing scale parameters, based on deviances—absolute deviations from the median—are examined. Higgins (2004) proposed a permutation test for comparing two treatments based on the ratio of deviances, but the performance of this procedure has not been investigated. A simulation study examines the performance of Higgins’ test relative to other tests of scale utilizing deviances that have been shown in the literature to have good properties. An extension of Higgins’ procedure to three or more treatments is proposed, and a second simulation study compares its performance to other omnibus tests for comparing scale. While no procedure emerged as a preferred choice in every scenario, Higgins’ tests are found to perform well overall with respect to Type I error rate and power. 相似文献

5.

A log-linear regression model for the odd Weibull distribution with censored data

Edwin M.M. Ortega Gauss M. Cordeiro Elizabeth M. Hashimoto Kahadawala Cooray 《Journal of applied statistics》2014,41(9):1859-1880

We introduce the log-odd Weibull regression model based on the odd Weibull distribution (Cooray, 2006). We derive some mathematical properties of the log-transformed distribution. The new regression model represents a parametric family of models that includes as sub-models some widely known regression models that can be applied to censored survival data. We employ a frequentist analysis and a parametric bootstrap for the parameters of the proposed model. We derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes and present some ways to assess global influence. Further, for different parameter settings, sample sizes and censoring percentages, some simulations are performed. In addition, the empirical distribution of some modified residuals are given and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be extended to a modified deviance residual in the proposed regression model applied to censored data. We define martingale and deviance residuals to check the model assumptions. The extended regression model is very useful for the analysis of real data. 相似文献

6.

Influence diagnostics in the Gamma regression model with adjusted deviance residuals

Muhammad Amin Muhammad Amanullah Gauss M. Cordeiro 《统计学通讯:模拟与计算》2017,46(9):6959-6973

In this study, we develop the adjusted deviance residuals for the gamma regression model (GRM) by following Cordeiro's (2004) method. These adjusted deviance residuals under the GRM are used for influence diagnostics. A comparative analysis has been sorted out between our proposed method of the adjusted deviance residuals and an existing method for influence diagnostics. These results are illustrated by a simulation study and using a real data set. They are presented for different values of dispersion and sample sizes and indicate the significant role of the GRM inferences. 相似文献

7.

A note on Cochran test for homogeneity in one-way ANOVA and meta-analysis

Zhongxue Chen Hon Keung Tony Ng Saralees Nadarajah 《Statistical Papers》2014,55(2):301-310

In this paper we provide a formal yet simple and straightforward proof of the asymptotic χ² distribution for Cochran test statistic. Then, we show that the general form of this type of test statistics is invariant for the choice of weights. This fact is important since in practice many such test statistics are constructed with more complicated forms which usually require calculating generalized inverse matrices. Based on our results, we can simplify the construction of the test statistics. More importantly, properties such as anti-conservativeness of this type of test statistics can be drawn from Cochran test statistic. Furthermore, one can improve the performance of the tests by using some modified statistics with correction for small sample size situations. 相似文献

8.

On the use of corrections for overdispersion 总被引：3，自引：0，他引：3

J. K. Lindsey 《Journal of the Royal Statistical Society. Series C, Applied statistics》1999,48(4):553-561

In studying fluctuations in the size of a blackgrouse ( Tetrao tetrix ) population, an autoregressive model using climatic conditions appears to follow the change quite well. However, the deviance of the model is considerably larger than its number of degrees of freedom. A widely used statistical rule of thumb holds that overdispersion is present in such situations, but model selection based on a direct likelihood approach can produce opposing results. Two further examples, of binomial and of Poisson data, have models with deviances that are almost twice the degrees of freedom and yet various overdispersion models do not fit better than the standard model for independent data. This can arise because the rule of thumb only considers a point estimate of dispersion, without regard for any measure of its precision. A reasonable criterion for detecting overdispersion is that the deviance be at least twice the number of degrees of freedom, the familiar Akaike information criterion, but the actual presence of overdispersion should then be checked by some appropriate modelling procedure. 相似文献

9.

New Goodness of Fit Tests Based on Stochastic EDF

Jianxin Zhao Xingzhong Xu Xiaobo Ding 《统计学通讯:理论与方法》2013,42(6):1075-1094

A new approach of randomization is proposed to construct goodness of fit tests generally. Some new test statistics are derived, which are based on the stochastic empirical distribution function (EDF). Note that the stochastic EDF for a set of given sample observations is a randomized distribution function. By substituting the stochastic EDF for the classical EDF in the Kolmogorov–Smirnov, Cramér–von Mises, Anderson–Darling, Berk–Jones, and Einmahl–Mckeague statistics, randomized statistics are derived, of which the qth quantile and the expectation are chosen as test statistics. In comparison to existing tests, it is shown, by a simulation study, that the new test statistics are generally more powerful than the corresponding ones based on the classical EDF or modified EDF in most cases. 相似文献

10.

Testing for a change point in a sequence of exponential random variables with repeated values

《Journal of Statistical Computation and Simulation》2012,82(2):191-199

Two test statistics are proposed for the change-point problem with repeated values when the data follow an exponential distribution. The properties of these two statistics have been studied and their asymptotic distributions under the alternative have been derived. The powers of the two test statistics are compared. Real-data examples are presented to illustrate the application of these tests. 相似文献

11.

Robust 2k factorial design with Weibull error distributions

B rdal eno lu 《Journal of applied statistics》2005,32(10):1051-1066

It is well known that the least squares method is optimal only if the error distributions are normally distributed. However, in practice, non-normal distributions are more prevalent. If the error terms have a non-normal distribution, then the efficiency of least squares estimates and tests is very low. In this paper, we consider the 2^k factorial design when the distribution of error terms are Weibull W(p,σ). From the methodology of modified likelihood, we develop robust and efficient estimators for the parameters in 2^k factorial design. F statistics based on modified maximum likelihood estimators (MMLE) for testing the main effects and interaction are defined. They are shown to have high powers and better robustness properties as compared to the normal theory solutions. A real data set is analysed. 相似文献

12.

VARIATIONAL BAYESIAN ANALYSIS FOR HIDDEN MARKOV MODELS

C. A. McGrory D. M. Titterington 《Australian & New Zealand Journal of Statistics》2009,51(2):227-244

The variational approach to Bayesian inference enables simultaneous estimation of model parameters and model complexity. An interesting feature of this approach is that it also leads to an automatic choice of model complexity. Empirical results from the analysis of hidden Markov models with Gaussian observation densities illustrate this. If the variational algorithm is initialized with a large number of hidden states, redundant states are eliminated as the method converges to a solution, thereby leading to a selection of the number of hidden states. In addition, through the use of a variational approximation, the deviance information criterion for Bayesian model selection can be extended to the hidden Markov model framework. Calculation of the deviance information criterion provides a further tool for model selection, which can be used in conjunction with the variational approach. 相似文献

13.

Asymptotics of goodness-of-fit tests based on minimum p-value statistics

Veronika Gontscharuk Helmut Finner 《统计学通讯:理论与方法》2017,46(5):2332-2342

This paper provides some new results on the asymptotics of goodness-of-fit (GOF) tests based on minimum p-value statistics. In connection with detectability of sparse signals in high-dimensional data, various tests were proposed and investigated during the last decade, especially with respect to asymptotic properties. Minimum p-value GOF statistics were already investigated as minimum level attained statistics by Berk and Jones with respect to Bahadur efficiency. The distribution of minimum p-value GOF statistics is closely related to the distribution of higher criticism statistics, the distribution of the supremum of a normalized Brownian bridge, and the supremum of an Ornstein–Uhlenbeck process. 相似文献

14.

Multivariate portmanteau test for structural VARMA models with uncorrelated but non-independent error terms

Y. Boubacar Mainassara 《Journal of statistical planning and inference》2011,141(8):2961-2975

We consider portmanteau tests for testing the adequacy of structural vector autoregressive moving-average (VARMA) models under the assumption that the errors are uncorrelated but not necessarily independent. The structural forms are mainly used in econometrics to introduce instantaneous relationships between economic variables. We first study the joint distribution of the quasi-maximum likelihood estimator (QMLE) and the noise empirical autocovariances. We then derive the asymptotic distribution of residual empirical autocovariances and autocorrelations under weak assumptions on the noise. We deduce the asymptotic distribution of the Ljung-Box (or Box-Pierce) portmanteau statistics in this framework. It is shown that the asymptotic distribution of the portmanteau tests is that of a weighted sum of independent chi-squared random variables, which can be quite different from the usual chi-squared approximation used under independent and identically distributed (iid) assumptions on the noise. Hence we propose a method to adjust the critical values of the portmanteau tests. Monte Carlo experiments illustrate the finite sample performance of the modified portmanteau test. 相似文献

15.

Hausman-type tests for individual and time effects in the panel regression model with incomplete data

Jing Chen Rongxian Yue Jianhong Wu 《Journal of the Korean Statistical Society》2018,47(3):347-363

By comparing estimators of the variance of idiosyncratic error at different robust levels, two Hausman-type test statistics are respectively constructed for the existence of individual and time effects in the panel regression model with incomplete data. The resultant test statistics have several desired properties. Firstly, they are robust to the presence of one effect when the other is tested. Secondly, they are immune to the non-normal distribution of the disturbances since the distributional conditions are not needed in the construction of the statistics. Thirdly, they have more robust performances than the main competitors in the literature when the covariates are correlated with the effects. Additionally, they are very simple and have no heavy computational burden. Joint tests for both of the two effects are also discussed. Monte Carlo evidence shows that the proposed tests have desired finite sample properties, and a real data analysis gives further support. 相似文献

16.

Bayesian parametric accelerated failure time spatial model and its application to prostate cancer

Jiajia Zhang Andrew B. Lawson 《Journal of applied statistics》2011,38(3):591-603

Prostate cancer (PrCA) is the most common cancer diagnosed in American men and the second leading cause of death from malignancies. There are large geographical variation and racial disparities existing in the survival rate of PrCA. Much work on the spatial survival model is based on the proportional hazards (PH) model, but few focused on the accelerated failure time (AFT) model. In this paper, we investigate the PrCA data of Louisiana from the Surveillance, Epidemiology, and End Results program and the violation of the PH assumption suggests that the spatial survival model based on the AFT model is more appropriate for this data set. To account for the possible extra-variation, we consider spatially referenced independent or dependent spatial structures. The deviance information criterion is used to select a best-fitting model within the Bayesian frame work. The results from our study indicate that age, race, stage, and geographical distribution are significant in evaluating PrCA survival. 相似文献

17.

Heteroscedasticity and/or autocorrelation diagnostics in nonlinear models with AR(1) and symmetrical errors

Chun-Zheng Cao Jin-Guan Lin Li-Xing Zhu 《Statistical Papers》2010,51(4):813-836

In this paper, we discuss tests of heteroscedasticity and/or autocorrelation in nonlinear models with AR(1) and symmetrical errors. The symmetrical errors distribution class includes all symmetrical continuous distributions, such as normal, Student-t, power exponential, logistic I and II, contaminated normal, so on. First, score test statistics and their adjustment forms of heteroscedasticity are derived. Then, the asymptotic properties, including asymptotic chi-square and approximate powers under local alternatives of the score tests, are studied. The properties of test statistics are investigated through Monte Carlo simulations. Finally, a real data set is used to illustrate our test methods. 相似文献

18.

Goodness—of—fit for the two—parameter weibull distribution with estimated parameters

《Journal of Statistical Computation and Simulation》2012,82(2-3):133-143

Goodness—of—fit statistics based on the empirical distribution function (EDF) are not distribution—free when parameters for the hypothesized distribution are estimated. Tables are percentile values of several EDF statistics are available for the two—parameter Weibull distribution when parameters are estimated by maximum likelihood. To determine how these tabled values change when simpler estimators are employed, percentile scores for EDF goodness—of—fit tests were obtained by Monte—Carlo simulation for maximum likelihood estimators (MLEs), good linear unbiased estimators (GLUEs), and modified Cramer—von Mises, Anderson—Darling, and Watson statistics are presented for GLUEs for both complete and censored samples. Critical values for Kolmogorov—Smirnov statistics were less affected by the method of estimation than were closer for MLEs and MGLUEs than for MGLUEs and GLUEs. On the other hand, MGLUE and GLUE results were much more similar to each other than to the MLE results when censoring was light and sample sizes were large. 相似文献

19.

Modified Baumgartner statistics for the two-sample and multisample problems: a numerical comparison

《Journal of Statistical Computation and Simulation》2012,82(5):711-728

Various non-parametric rank tests based on the Baumgartner statistic have been proposed for testing the location, scale and location–scale parameters. The modified Baumgartner statistics are not suitable for the scale shifts for a two-sample problem. Two modified Baumgartner statistics are proposed by changing the weight function. The suggested statistics are extended to the multisample problem. Some exact critical values of the suggested test statistics are evaluated. Simulations are used to investigate the power of the modified Baumgartner statistics. 相似文献

20.

Divergence statistics: sampling properties and multinomial goodness of fit and divergence tests

K. Zografos K. Ferentinos T. Papaioannou 《统计学通讯:理论与方法》2013,42(5):1785-1802

φ-divergence .statistics are obtained by either replacing both distributions involved in the argument of the φ -divergence measure by their sample estimates or replacing one distribution and considering the other as given. The sampling properties of estimated divergence-type measures are investigated. Approximate means and variances are derived and asymptotic distributions are obtained. Tests of goodness of fit of observed frequencies to expected ones and tests of equality of divergences based on two or more multinomial samples are constructed. 相似文献