期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Quasi-binomial zero-inflated regression model suitable for variables with bounded support

E. Gmez&#x;Dniz D. I. Gallardo H. W. Gmez 《Journal of applied statistics》2020,47(12):2208

In recent years, a variety of regression models, including zero-inflated and hurdle versions, have been proposed to explain the case of a dependent variable with respect to exogenous covariates. Apart from the classical Poisson, negative binomial and generalised Poisson distributions, many proposals have appeared in the statistical literature, perhaps in response to the new possibilities offered by advanced software that now enables researchers to implement numerous special functions in a relatively simple way. However, we believe that a significant research gap remains, since very little attention has been paid to the quasi-binomial distribution, which was first proposed over fifty years ago. We believe this distribution might constitute a valid alternative to existing regression models, in situations in which the variable has bounded support. Therefore, in this paper we present a zero-inflated regression model based on the quasi-binomial distribution, taking into account the moments and maximum likelihood estimators, and perform a score test to compare the zero-inflated quasi-binomial distribution with the zero-inflated binomial distribution, and the zero-inflated model with the homogeneous model (the model in which covariates are not considered). This analysis is illustrated with two data sets that are well known in the statistical literature and which contain a large number of zeros. 相似文献

2.

Testing for departures from nominal dispersion in generalized nonlinear models with varying dispersion and/or additive random effects

《Journal of Statistical Computation and Simulation》2012,82(10):925-939

This paper discusses the tests for departures from nominal dispersion in the framework of generalized nonlinear models with varying dispersion and/or additive random effects. We consider two classes of exponential family distributions. The first is discrete exponential family distributions, such as Poisson, binomial, and negative binomial distributions. The second is continuous exponential family distributions, such as normal, gamma, and inverse Gaussian distributions. Correspondingly, we develop a unifying approach and propose several tests for testing for departures from nominal dispersion in two classes of generalized nonlinear models. The score test statistics are constructed and expressed in simple, easy to use, matrix formulas, so that the tests can easily be implemented using existing statistical software. The properties of test statistics are investigated through Monte Carlo simulations. 相似文献

3.

Score tests for heterogeneity and overdispersion in zero‐inflated Poisson and binomial regression models

Daniel B. Hall Kenneth S. Berenhaut 《Revue canadienne de statistique》2002,30(3):415-430

Hall (2000) has described zero‐inflated Poisson and binomial regression models that include random effects to account for excess zeros and additional sources of heterogeneity in the data. The authors of the present paper propose a general score test for the null hypothesis that variance components associated with these random effects are zero. For a zero‐inflated Poisson model with random intercept, the new test reduces to an alternative to the overdispersion test of Ridout, Demério & Hinde (2001). The authors also examine their general test in the special case of the zero‐inflated binomial model with random intercept and propose an overdispersion test in that context which is based on a beta‐binomial alternative. 相似文献

4.

Analyzing clustered count data with a cluster-specific random effect zero-inflated Conway–Maxwell–Poisson distribution

Hyoyoung Choo-Wosoba Somnath Datta 《Journal of applied statistics》2018,45(5):799-814

Count data analysis techniques have been developed in biological and medical research areas. In particular, zero-inflated versions of parametric count distributions have been used to model excessive zeros that are often present in these assays. The most common count distributions for analyzing such data are Poisson and negative binomial. However, a Poisson distribution can only handle equidispersed data and a negative binomial distribution can only cope with overdispersion. However, a Conway–Maxwell–Poisson (CMP) distribution [4] can handle a wide range of dispersion. We show, with an illustrative data set on next-generation sequencing of maize hybrids, that both underdispersion and overdispersion can be present in genomic data. Furthermore, the maize data set consists of clustered observations and, therefore, we develop inference procedures for a zero-inflated CMP regression that incorporates a cluster-specific random effect term. Unlike the Gaussian models, the underlying likelihood is computationally challenging. We use a numerical approximation via a Gaussian quadrature to circumvent this issue. A test for checking zero-inflation has also been developed in our setting. Finite sample properties of our estimators and test have been investigated by extensive simulations. Finally, the statistical methodology has been applied to analyze the maize data mentioned before. 相似文献

5.

A note on Dean's overdispersion test

Zhao Yang James W. Hardin Cheryl L. Addy 《Journal of statistical planning and inference》2009

This note discusses an extension to the score test statistics for overdispersion in Poisson and binomial regression models [Dean, C.B., 1992. Testing for overdispersion in Poisson and binomial regression models. J. Amer. Statist. Assoc. 87, 451–457]. Examples illustrate the application of the extended results. 相似文献

6.

Comparing two independent incidence rates using conditional and unconditional exact tests

Han C 《Pharmaceutical statistics》2008,7(3):195-201

Several unconditional exact tests, which are constructed to control the Type I error rate at the nominal level, for comparing two independent Poisson rates are proposed and compared to the conditional exact test using a binomial distribution. The unconditional exact test using binomial p-value, likelihood ratio, or efficient score as the test statistic improves the power in general, and are therefore recommended. Unconditional exact tests using Wald statistics, whether on the original or square-root scale, may be substantially less powerful than the conditional exact test, and is not recommended. An example is provided from a cardiovascular trial. 相似文献

7.

The robustness of the likelihood ratio test,the nonparametric sum rank test,and f-ratio tests when the populations are from the negative binomial family

Lawrence I-Kuei Lin Roy L. Sanford 《统计学通讯:模拟与计算》2013,42(5):523-539

In this study, the robustness of power and significance level of several statistical testing methods was evaluated under the assumption that the test populations were from Poisson, negative binomial, or geometric distributions. The F-ratio test, with or without appropriate transformations, was shown to be both safe and robust for all distributions examined. 相似文献

8.

Testing for a finite mixture model with two components

Hanfeng Chen Jiahua Chen John D. Kalbfleisch 《Journal of the Royal Statistical Society. Series B, Statistical methodology》2004,66(1):95-115

Summary. We consider a finite mixture model with k components and a kernel distribution from a general one-parameter family. The problem of testing the hypothesis k =2 versus k 3 is studied. There has been no general statistical testing procedure for this problem. We propose a modified likelihood ratio statistic where under the null and the alternative hypotheses the estimates of the parameters are obtained from a modified likelihood function. It is shown that estimators of the support points are consistent. The asymptotic null distribution of the modified likelihood ratio test proposed is derived and found to be relatively simple and easily applied. Simulation studies for the asymptotic modified likelihood ratio test based on finite mixture models with normal, binomial and Poisson kernels suggest that the test proposed performs well. Simulation studies are also conducted for a bootstrap method with normal kernels. An example involving foetal movement data from a medical study illustrates the testing procedure. 相似文献

9.

A Statistical Reanalysis of the Classical Rutherford's Experiment

V. Voinov E. Voinov 《统计学通讯:模拟与计算》2013,42(1):157-171

Modified chi-squared and some newly developed tests for the Poisson, binomial, and an approximated Feller's distribution are discussed. A reanalysis of the classical Rutherford's experimental data on alpha decay is done. Previous analyses of the data were not correct from the point of view of the theory of statistical testing. Tests used show that the data contradict to both Poisson and binomial distribution and do not contradict to a precise “binomial” approximation of Feller's distribution that takes into account a counter's dead time. This gives a plausible statistically correct confirmation of the well-established exponential law of radioactive decay. 相似文献

10.

A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution

Galit Shmueli Thomas P. Minka Joseph B. Kadane Sharad Borle Peter Boatwright 《Journal of the Royal Statistical Society. Series C, Applied statistics》2005,54(1):127-142

Summary. A useful discrete distribution (the Conway–Maxwell–Poisson distribution) is revived and its statistical and probabilistic properties are introduced and explored. This distribution is a two-parameter extension of the Poisson distribution that generalizes some well-known discrete distributions (Poisson, Bernoulli and geometric). It also leads to the generalization of distributions derived from these discrete distributions (i.e. the binomial and negative binomial distributions). We describe three methods for estimating the parameters of the Conway–Maxwell–Poisson distribution. The first is a fast simple weighted least squares method, which leads to estimates that are sufficiently accurate for practical purposes. The second method, using maximum likelihood, can be used to refine the initial estimates. This method requires iterations and is more computationally intensive. The third estimation method is Bayesian. Using the conjugate prior, the posterior density of the parameters of the Conway–Maxwell–Poisson distribution is easily computed. It is a flexible distribution that can account for overdispersion or underdispersion that is commonly encountered in count data. We also explore two sets of real world data demonstrating the flexibility and elegance of the Conway–Maxwell–Poisson distribution in fitting count data which do not seem to follow the Poisson distribution. 相似文献

11.

Some rememarks on locally most powerful unbiased tests for the detection of mixtures of poisson or binomial distributions

Byung Soo Kim 《统计学通讯:理论与方法》2013,42(11):3733-3741

In t h i s note mixture models are used to represent overdispersion relative to Poisson or binomial distributions. We flnd a sufflclent condition on the mixing distribution underich the detection of mixture departures from the Poisson or binomial adrnits a locally most powerful unbiased test. The conditions specify plynoria: relations between the variance and mean of Le glxing distribution. 相似文献

12.

Pooling means under uncertain prior information with application to discrete distributions

S. E. Ahmed 《Statistics》2013,47(3):265-277

The problem of pooling means is considered based on two samples in presence of the uncertain prior information that these samples are taken from possibly identical populations. Two discrete models, Poisson and binomial are considered in particular. Three estimators, i.e. the unrestricted estimator, shrinkage restricted estimator and estimators based on preliminary test are proposed. Their asymptotic mean squared errors are derived and compared. It is demonstrated via asymptotic results that the range of the parameter space in which shrinkage preliminary test estimator dominates the unrestricted estimator is wider than that of the usual preliminary test estimator. A Monte Carlo study for Poisson model is presented to compare the performance of the estimators for small samples. 相似文献

13.

Zero-inflated and overdispersed: what's one to do?

Suzanne E. Perumean-Chaney Charity Morgan David McDowall Inmaculada Aban 《Journal of Statistical Computation and Simulation》2013,83(9):1671-1683

Zero-inflated Poisson (ZIP) and zero-inflated negative binomial (ZINB) models are recommended for handling excessive zeros in count data. For various reasons, researchers may not address zero inflation. This paper helps educate researchers on (1) the importance of accounting for zero inflation and (2) the consequences of misspecifying the statistical model. Using simulations, we found that when the zero inflation in the data was ignored, estimation was poor and statistically significant findings were missed. When overdispersion within the zero-inflated data was ignored, poor estimation and inflated Type I errors resulted. Recommendations on when to use the ZINB and ZIP models are provided. In an illustration using a two-step model selection procedure (likelihood ratio test and the Vuong test), the ZIP model was correctly identified only when the distributions had moderate means and sample sizes and did not correctly identify the ZINB model or the zero inflation in the ZIP and ZINB distributions. 相似文献

14.

Comparisons of some bivariate regression models

《Journal of Statistical Computation and Simulation》2012,82(7):937-949

The bivariate negative binomial regression (BNBR) and the bivariate Poisson log-normal regression (BPLR) models have been used to describe count data that are over-dispersed. In this paper, a new bivariate generalized Poisson regression (BGPR) model is defined. An advantage of the new regression model over the BNBR and BPLR models is that the BGPR can be used to model bivariate count data with either over-dispersion or under-dispersion. In this paper, we carry out a simulation study to compare the three regression models when the true data-generating process exhibits over-dispersion. In the simulation experiment, we observe that the bivariate generalized Poisson regression model performs better than the bivariate negative binomial regression model and the BPLR model. 相似文献

15.

A Bayesian Approach for Zero-Inflated Count Regression Models by Using the Reversible Jump Markov Chain Monte Carlo Method and an Application

İlknur Özmen 《统计学通讯:理论与方法》2013,42(12):2109-2127

In this study, estimation of the parameters of the zero-inflated count regression models and computations of posterior model probabilities of the log-linear models defined for each zero-inflated count regression models are investigated from the Bayesian point of view. In addition, determinations of the most suitable log-linear and regression models are investigated. It is known that zero-inflated count regression models cover zero-inflated Poisson, zero-inflated negative binomial, and zero-inflated generalized Poisson regression models. The classical approach has some problematic points but the Bayesian approach does not have similar flaws. This work points out the reasons for using the Bayesian approach. It also lists advantages and disadvantages of the classical and Bayesian approaches. As an application, a zoological data set, including structural and sampling zeros, is used in the presence of extra zeros. In this work, it is observed that fitting a zero-inflated negative binomial regression model creates no problems at all, even though it is known that fitting a zero-inflated negative binomial regression model is the most problematic procedure in the classical approach. Additionally, it is found that the best fitting model is the log-linear model under the negative binomial regression model, which does not include three-way interactions of factors. 相似文献

16.

Comparison of Shared Frailty Models for Kidney Infection Data under Exponential Power Baseline Distribution

David D. Hanagal Alok D. Dabade 《统计学通讯:理论与方法》2013,42(23):5091-5108

Shared frailty models are often used to model heterogeneity in survival analysis. There are certain assumptions about the baseline distribution and distribution of frailty. In this paper, four shared frailty models with frailty distribution gamma, inverse Gaussian, compound Poisson, and compound negative binomial with exponential power as baseline distribution are proposed. These models are fitted using Markov Chain Monte Carlo methods. These models are illustrated with a real life bivariate survival data set of McGilchrist and Aisbett (1991) related to kidney infection, and the best model is suggested for the data using different model comparison criteria. 相似文献

17.

Testing overdispersion in the zero-inflated Poisson model

Zhao Yang James W. Hardin Cheryl L. Addy 《Journal of statistical planning and inference》2009

The zero-inflated negative binomial (ZINB) model is used to account for commonly occurring overdispersion detected in data that are initially analyzed under the zero-inflated Poisson (ZIP) model. Tests for overdispersion (Wald test, likelihood ratio test [LRT], and score test) based on ZINB model for use in ZIP regression models have been developed. Due to similarity to the ZINB model, we consider the zero-inflated generalized Poisson (ZIGP) model as an alternate model for overdispersed zero-inflated count data. The score test has an advantage over the LRT and the Wald test in that the score test only requires that the parameter of interest be estimated under the null hypothesis. This paper proposes score tests for overdispersion based on the ZIGP model and illustrates that the derived score statistics are exactly the same as the score statistics under the ZINB model. A simulation study indicates the proposed score statistics are preferred to other tests for higher empirical power. In practice, based on the approximate mean–variance relationship in the data, the ZINB or ZIGP model can be considered, and a formal score test based on asymptotic standard normal distribution can be employed for assessing overdispersion in the ZIP model. We provide an example to illustrate the procedures for data analysis. 相似文献

18.

Zero-inflated Poisson and negative binomial integer-valued GARCH models

Fukang Zhu 《Journal of statistical planning and inference》2012,142(4):826-839

Zero inflation means that the proportion of 0's of a model is greater than the proportion of 0's of the corresponding Poisson model, which is a common phenomenon in count data. To model the zero-inflated characteristic of time series of counts, we propose zero-inflated Poisson and negative binomial INGARCH models, which are useful and flexible generalizations of the Poisson and negative binomial INGARCH models, respectively. The stationarity conditions and the autocorrelation function are given. Based on the EM algorithm, the estimating procedure is simple and easy to be implemented. A simulation study shows that the estimation method is accurate and reliable as long as the sample size is reasonably large. A real data example leads to superior performance of the proposed models compared with other competitive models in the literature. 相似文献

19.

Inflated modified power series distributions with applications

Pushpa L. Gupta Ramesh C. Gupta Ram C. Tripathi 《统计学通讯:理论与方法》2013,42(9):2355-2374

In certain applications involving discrete data, it is sometimes found that X = 0 is observed with a frequency significantly higher than predicted by the assumed model. Zero inflated Poisson, binomial and negative binomial models have been employed in some clinical trials and in some regression analysis problems.

In this paper, we study the zero inflated modified power series distributions (IMPSD) which include among others the generalized Poisson and the generalized negative binomial distributions and hence the Poisson, binomial and negative binomial distributions. The structural properties along with the distribution of the sum of independent IMPSD variables are studied. The maximum likelihood estimation of the parameters of the model is examined and the variance-covariance matrix of the estimators is obtained. Finally, examples are presented for the generalized Poisson distribution to illustrate the results. 相似文献

20.

Algebraic Methods for Polynomial Statistical Models

Dinwoodie I. H. 《Statistics and Computing》2002,12(4):307-314

We describe applications of computational algebra to statistical problems of parameter identifiability, sufficiency, and estimation. The methods work for a family of statistical models that includes Poisson and binomial examples in network tomography. 相似文献