首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
This paper is concerned with semiparametric discrete kernel estimators when the unknown count distribution can be considered to have a general weighted Poisson form. The estimator is constructed by multiplying the Poisson estimate with a nonparametric discrete kernel-type estimate of the Poisson weight function. Comparisons are then carried out with the ordinary discrete kernel probability mass function estimators. The Poisson weight function is thus a local multiplicative correction factor, and is considered as the uniform measure to detect departures from the equidispersed Poisson distribution. In this way, the effects of dispersion and zero-proportion with respect to the standard Poisson distribution are also minimized. This method of estimation is also applied to the weighted binomial form for the count distribution having a finite support. The proposed estimators, in addition to being simple, easy-to-implement and effective, also outperform the competing nonparametric and parametric estimators in finite-sample situations. Two examples illustrate this new semiparametric estimation.  相似文献   

2.
Methods for analyzing and modeling count data time series are used in various fields of practice, and they are particularly relevant for applications in finance and economy. We consider the binomial autoregressive (AR(1)) model for count data processes with a first-order AR dependence structure and a binomial marginal distribution. We present four approaches for estimating its model parameters based on given time series data, and we derive expressions for the asymptotic distribution of these estimators. Then we investigate the finite-sample performance of the estimators and of the respective asymptotic approximations in a simulation study, including a discussion of the 2-block jackknife. We illustrate our methods and findings with a real-data example about transactions at the Korea stock market. We conclude with an application of our results for obtaining reliable estimates for process capability indices.  相似文献   

3.
The negative binomial distribution offers an alternative view to the binomial distribution for modeling count data. This alternative view is particularly useful when the probability of success is very small, because, unlike the fixed sampling scheme of the binomial distribution, the inverse sampling approach allows one to collect enough data in order to adequately estimate the proportion of success. However, despite work that has been done on the joint estimation of two binomial proportions from independent samples, there is little, if any, similar work for negative binomial proportions. In this paper, we construct and investigate three confidence regions for two negative binomial proportions based on three statistics: the Wald (W), score (S) and likelihood ratio (LR) statistics. For large-to-moderate sample sizes, this paper finds that all three regions have good coverage properties, with comparable average areas for large sample sizes but with the S method producing the smaller regions for moderate sample sizes. In the small sample case, the LR method has good coverage properties, but often at the expense of comparatively larger areas. Finally, we apply these three regions to some real data for the joint estimation of liver damage rates in patients taking one of two drugs.  相似文献   

4.
Chain binomial models axe commonly used to model the spread of an epidemic through a population. This paper shows that for a flexible class of chain binomial models an approximate maximum likelihood estimator of the infection rate, derived from a Poisson approximation to the binomial distribution, has an asymptotically normal distribution, as do some other related'estimators.  相似文献   

5.

This paper is concerned with properties (bias, standard deviation, mean square error and efficiency) of twenty six estimators of the intraclass correlation in the analysis of binary data. Our main interest is to study these properties when data are generated from different distributions. For data generation we considered three over-dispersed binomial distributions, namely, the beta-binomial distribution, the probit normal binomial distribution and a mixture of two binomial distributions. The findings regarding bias, standard deviation and mean squared error of all these estimators, are that (a) in general, the distributions of biases of most of the estimators are negatively skewed. The biases are smallest when data are generated from the beta-binomial distribution and largest when data are generated from the mixture distribution; (b) the standard deviations are smallest when data are generated from the beta-binomial distribution; and (c) the mean squared errors are smallest when data are generated from the beta-binomial distribution and largest when data are generated from the mixture distribution. Of the 26, nine estimators including the maximum likelihood estimator, an estimator based on the optimal quadratic estimating equations of Crowder (1987), and an analysis of variance type estimator is found to have least amount of bias, standard deviation and mean squared error. Also, the distributions of the bias, standard deviation and mean squared error for each of these estimators are, in general, more symmetric than those of the other estimators. Our findings regarding efficiency are that the estimator based on the optimal quadratic estimating equations has consistently high efficiency and least variability in the efficiency results. In the important range in which the intraclass correlation is small (≤0 5), on the average, this estimator shows best efficiency performance. The analysis of variance type estimator seems to do well for larger values of the intraclass correlation. In general, the estimator based on the optimal quadratic estimating equations seems to show best efficiency performance for data from the beta-binomial distribution and the probit normal binomial distribution, and the analysis of variance type estimator seems to do well for data from the mixture distribution.  相似文献   

6.
In certain applications involving discrete data, it is sometimes found that X = 0 is observed with a frequency significantly higher than predicted by the assumed model. Zero inflated Poisson, binomial and negative binomial models have been employed in some clinical trials and in some regression analysis problems.

In this paper, we study the zero inflated modified power series distributions (IMPSD) which include among others the generalized Poisson and the generalized negative binomial distributions and hence the Poisson, binomial and negative binomial distributions. The structural properties along with the distribution of the sum of independent IMPSD variables are studied. The maximum likelihood estimation of the parameters of the model is examined and the variance-covariance matrix of the estimators is obtained. Finally, examples are presented for the generalized Poisson distribution to illustrate the results.  相似文献   

7.
This paper concerns the calculation of Bayes estimators of ratios of outcome proportions generated by the replication of an arbitrary tree-structured compound Bernoulli experiment under a multinomial-type sampling scheme. Here the compound Bernoulli experiment is treated as a collection of linear sequences of independent generalized Bernoulli trials having Dirichlet type 1 prior probability distributions. A method of obtaining a closed-form expression of the cumulative distribution function of the ratio of proportions – from its Meijer G-function representation – is described. Bayes point and interval estimators are directly obtained from the properties the distribution function as well as its related probability density function. In addition, the density function is used to derive the probability mass function of the predictive distribution any two associated outcome categories of the experiment – under an inverse multinomial-type sampling scheme. An illustrative numerical example concerning a Bayesian analysis of a simple tree-structured mortality model for medical patients who have suffered an acute myocardial infarction (heart attack) is also included.  相似文献   

8.
Count data analysis techniques have been developed in biological and medical research areas. In particular, zero-inflated versions of parametric count distributions have been used to model excessive zeros that are often present in these assays. The most common count distributions for analyzing such data are Poisson and negative binomial. However, a Poisson distribution can only handle equidispersed data and a negative binomial distribution can only cope with overdispersion. However, a Conway–Maxwell–Poisson (CMP) distribution [4] can handle a wide range of dispersion. We show, with an illustrative data set on next-generation sequencing of maize hybrids, that both underdispersion and overdispersion can be present in genomic data. Furthermore, the maize data set consists of clustered observations and, therefore, we develop inference procedures for a zero-inflated CMP regression that incorporates a cluster-specific random effect term. Unlike the Gaussian models, the underlying likelihood is computationally challenging. We use a numerical approximation via a Gaussian quadrature to circumvent this issue. A test for checking zero-inflation has also been developed in our setting. Finite sample properties of our estimators and test have been investigated by extensive simulations. Finally, the statistical methodology has been applied to analyze the maize data mentioned before.  相似文献   

9.
Clinical studies in overactive bladder have traditionally used analysis of covariance or nonparametric methods to analyse the number of incontinence episodes and other count data. It is known that if the underlying distributional assumptions of a particular parametric method do not hold, an alternative parametric method may be more efficient than a nonparametric one, which makes no assumptions regarding the underlying distribution of the data. Therefore, there are advantages in using methods based on the Poisson distribution or extensions of that method, which incorporate specific features that provide a modelling framework for count data. One challenge with count data is overdispersion, but methods are available that can account for this through the introduction of random effect terms in the modelling, and it is this modelling framework that leads to the negative binomial distribution. These models can also provide clinicians with a clearer and more appropriate interpretation of treatment effects in terms of rate ratios. In this paper, the previously used parametric and non‐parametric approaches are contrasted with those based on Poisson regression and various extensions in trials evaluating solifenacin and mirabegron in patients with overactive bladder. In these applications, negative binomial models are seen to fit the data well. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

10.
In survey sampling and in stereology, it is often desirable to estimate the ratio of means θ= E(Y)/E(X) from bivariate count data (X, Y) with unknown joint distribution. We review methods that are available for this problem, with particular reference to stereological applications. We also develop new methods based on explicit statistical models for the data, and associated model diagnostics. The methods are tested on a stereological dataset. For point‐count data, binomial regression and bivariate binomial models are generally adequate. Intercept‐count data are often overdispersed relative to Poisson regression models, but adequately fitted by negative binomial regression.  相似文献   

11.
In this paper, we investigate four existing and three new confidence interval estimators for the negative binomial proportion (i.e., proportion under inverse/negative binomial sampling). An extensive and systematic comparative study among these confidence interval estimators through Monte Carlo simulations is presented. The performance of these confidence intervals are evaluated in terms of their coverage probabilities and expected interval widths. Our simulation studies suggest that the confidence interval estimator based on saddlepoint approximation is more appealing for large coverage levels (e.g., nominal level≤1% ) whereas the score confidence interval estimator is more desirable for those commonly used coverage levels (e.g., nominal level>1% ). We illustrate these confidence interval construction methods with a real data set from a maternal congenital heart disease study.  相似文献   

12.
We consider the problem of assessing prediction for count time series based on either the Poisson distribution or the negative binomial distribution. By a suitable parametrization we employ both distributions with the same mean. We regress the mean on its past values and the values of the response and after obtaining consistent estimators of the regression parameters, regardless of the response distribution, we employ different criteria to study the prediction problem. We show by simulation and data examples that scoring rules and diagnostic graphs that have been proposed for independent but not identically distributed data can be adapted in the setting of count dependent data.  相似文献   

13.
A new class of Bayesian estimators for a proportion in multistage binomial designs is considered. Priors belong to the beta-J distribution family, which is derived from the Fisher information associated with the design. The transposition of the beta parameters of the Haldane and the uniform priors in fixed binomial experiments into the beta-J distribution yields bias-corrected versions of these priors in multistage designs. We show that the estimator of the posterior mean based on the corrected Haldane prior and the estimator of the posterior mode based on the corrected uniform prior have good frequentist properties. An easy-to-use approximation of the estimator of the posterior mode is provided. The new Bayesian estimators are compared to Whitehead's and the uniformly minimum variance estimators through several multistage designs. Last, the bias of the estimator of the posterior mode is derived for a particular case.  相似文献   

14.
This paper introduces an exchangeable negative binomial distribution resulting from relaxing the independence of the Bernoulli sequence associated with a negative binomial distribution to exchangeability. It is demonstrated that the introduced distribution is a mixture of negative binomial distributions and can be characterized by infinitely many parameters that form a completely monotone sequence. The moments of the distribution are derived and a small simulation is conducted to illustrate the distribution. For data analytic purposes, two methods, truncation and completely-monotone links, are given for converting the saturated distribution of infinitely many parameters to parsimonious distributions of finitely many parameters. A full likelihood procedure is described which can be used to investigate correlated and overdispersed count data common in biomedical sciences and teratology. In the end, the introduced distribution is applied to analyze a real clinical data of burn wounds on patients.  相似文献   

15.
对二项分布比例参数p的似然比置信区间,提出一种简便求解方法。在平均覆盖率、平均区间长度及区间长度的95%置信区间准则下与WScore、Plus4、Jeffreys置信区间进行模拟比较。试验表明,在二项分布b(n,p)的参数n≥20且p∈(0.1,0.9)时,该方法获取的似然比置信区间性能优良。当点估计p值不是接近于0或1且n≥20时,推荐使用本方法获取p的置信区间。  相似文献   

16.
The modelling and analysis of count-data time series are areas of emerging interest with various applications in practice. We consider the particular case of the binomial AR(1) model, which is well suited for describing binomial counts with a first-order autoregressive serial dependence structure. We derive explicit expressions for the joint (central) moments and cumulants up to order 4. Then, we apply these results for expressing moments and asymptotic distribution of the squared difference estimator as an alternative to the sample autocovariance. We also analyse the asymptotic distribution of the conditional least-squares estimators of the parameters of the binomial AR(1) model. The finite-sample performance of these estimators is investigated in a simulation study, and we apply them to real data about computerized workstations.  相似文献   

17.
Biological control of pests is an important branch of entomology, providing environmentally friendly forms of crop protection. Bioassays are used to find the optimal conditions for the production of parasites and strategies for application in the field. In some of these assays, proportions are measured and, often, these data have an inflated number of zeros. In this work, six models will be applied to data sets obtained from biological control assays for Diatraea saccharalis , a common pest in sugar cane production. A natural choice for modelling proportion data is the binomial model. The second model will be an overdispersed version of the binomial model, estimated by a quasi-likelihood method. This model was initially built to model overdispersion generated by individual variability in the probability of success. When interest is only in the positive proportion data, a model can be based on the truncated binomial distribution and in its overdispersed version. The last two models include the zero proportions and are based on a finite mixture model with the binomial distribution or its overdispersed version for the positive data. Here, we will present the models, discuss their estimation and compare the results.  相似文献   

18.
A new generalization of the binomial distribution is introduced that allows dependence between trials, nonconstant probabilities of success from trial to trial, and which contains the usual binomial distribution as a special case. Along with the number of trials and an initial probability of ‘success’, an additional parameter that controls the degree of correlation between trials is introduced. The resulting class of distributions includes the binomial, unirnodal distributions, and bimodal distributions. Formulas for the moments, mean, and variance of this distribution are given along with a method for fitting the distribution to sample data.  相似文献   

19.
Data sets with excess zeroes are frequently analyzed in many disciplines. A common framework used to analyze such data is the zero-inflated (ZI) regression model. It mixes a degenerate distribution with point mass at zero with a non-degenerate distribution. The estimates from ZI models quantify the effects of covariates on the means of latent random variables, which are often not the quantities of primary interest. Recently, marginal zero-inflated Poisson (MZIP; Long et al. [A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat. Med. 33 (2014), pp. 5151–5165]) and negative binomial (MZINB; Preisser et al., 2016) models have been introduced that model the mean response directly. These models yield covariate effects that have simple interpretations that are, for many applications, more appealing than those available from ZI regression. This paper outlines a general framework for marginal zero-inflated models where the latent distribution is a member of the exponential dispersion family, focusing on common distributions for count data. In particular, our discussion includes the marginal zero-inflated binomial (MZIB) model, which has not been discussed previously. The details of maximum likelihood estimation via the EM algorithm are presented and the properties of the estimators as well as Wald and likelihood ratio-based inference are examined via simulation. Two examples presented illustrate the advantages of MZIP, MZINB, and MZIB models for practical data analysis.  相似文献   

20.
ABSTRACT

In this paper, we derive the Bayes estimators of functions of parameters of the size-biased generalized power series distribution under squared error loss function and weighted square error loss function. The results of size-biased GPSD are then used to obtain particular cases of the size-biased negative binomial, size-biased logarithmic series, and size-biased Poisson distributions. These estimators are better than the classical minimum variance unbiased estimators in the sense that they increase the range of the estimation. Finally, an example is provided to illustrate the results and a goodness of fit test is done using the maximum likelihood and Bayes estimators.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号