期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Improved two-parameter estimators for the negative binomial and Poisson regression models

Merve Kandemir Çetinkaya Selahattin Kaçıranlar 《Journal of Statistical Computation and Simulation》2019,89(14):2645-2660

Negative binomial regression (NBR) and Poisson regression (PR) applications have become very popular in the analysis of count data in recent years. However, if there is a high degree of relationship between the independent variables, the problem of multicollinearity arises in these models. We introduce new two-parameter estimators (TPEs) for the NBR and the PR models by unifying the two-parameter estimator (TPE) of Özkale and Kaç?ranlar [The restricted and unrestricted two-parameter estimators. Commun Stat Theory Methods. 2007;36:2707–2725]. These new estimators are general estimators which include maximum likelihood (ML) estimator, ridge estimator (RE), Liu estimator (LE) and contraction estimator (CE) as special cases. Furthermore, biasing parameters of these estimators are given and a Monte Carlo simulation is done to evaluate the performance of these estimators using mean square error (MSE) criterion. The benefits of the new TPEs are also illustrated in an empirical application. The results show that the new proposed TPEs for the NBR and the PR models are better than the ML estimator, the RE and the LE. 相似文献

2.

Discriminating between the generalized Rayleigh and Weibull distributions 总被引：1，自引：0，他引：1

Mohammad Z. Raqab 《Journal of applied statistics》2013,40(7):1480-1493

Generalized Rayleigh (GR) and Weibull (WE) distributions are used quite effectively for analysing skewed lifetime data. In this paper, we consider the problem of selecting either GR or WE distribution as a more appropriate fitting model for a given data set. We use the ratio of maximized likelihoods (RML) for discriminating between the two distributions. The asymptotic and simulated distributions of the logarithm of the RML are applied to determine the probability of correctly selecting between these two families of distributions. It is examined numerically that the asymptotic results work quite well even for small sample sizes. A real data set involving the annual rainfall recorded at Los Angeles Civic Center during 25 years is analysed to illustrate the procedures developed here. 相似文献

3.

Discriminating between the log-normal and generalized exponential distributions

《Journal of statistical planning and inference》2005,127(1-2):213-227

The two-parameter generalized exponential distribution was recently introduced by Gupta and Kundu (Austral. New Zealand J. Statist. 40 (1999) 173). It is observed that the Generalized Exponential distribution can be used quite effectively to analyze skewed data set as an alternative to the more popular log-normal distribution. In this paper, we use the ratio of the maximized likelihoods in choosing between the log-normal and generalized exponential distributions. We obtain asymptotic distributions of the logarithm of the ratio of the maximized likelihoods and use them to determine the required sample size to discriminate between the two distributions for a user specified probability of correct selection and tolerance limit. 相似文献

4.

Bivariate negative binomial regression model with excess zeros and right censoring: an application to Indonesian data

Seyed Ehsan Saffari John Carson Allen 《Journal of applied statistics》2020,47(10):1901

We propose a bivariate hurdle negative binomial (BHNB) regression model with right censoring to model correlated bivariate count data with excess zeros and few extreme observations. The parameters of the BHNB regression model are obtained using maximum likelihood with conjugate gradient optimization. The proposed model is applied to actual survey data where the bivariate outcome is number of days missed from primary activities and number of days spent in bed due to illness during the 4-week period preceding the inquiry date. We compared the right censored BHNB model to the right censored bivariate negative binomial (BNB) model. A simulation study is conducted to discuss some properties of the BHNB model. Our proposed model demonstrated superior performance in goodness-of-fit of estimated frequencies.KEYWORDS: Zero inflation, over-dispersion, parameter estimation, model selection, right censoring 相似文献

5.

Discriminating between the generalized Rayleigh and Weibull distributions: Some comparative studies

Murad A. Ahmad Debasis Kundu 《统计学通讯:模拟与计算》2017,46(6):4880-4895

The generalized Rayleigh distribution was introduced and studied quite effectively in the literature. The closeness and separation between the distributions are extremely important for analyzing any lifetime data. In this spirit, both the generalized Rayleigh and Weibull distributions can be used for analyzing skewed datasets. In this article, we compare these two distributions based on the Fisher information measures and use it for discrimination purposes. It is evident that the Fisher information measures play an important role in separating between the distributions. The total information measures and the variances of the different percentile estimators are computed and presented. A real life dataset is analyzed for illustration purposes and a numerical comparison study is performed to assess our procedures in separating between these two distributions. 相似文献

6.

A note on LeCam’s bound for the distance between the Poisson binomial and the Poisson distribution

Michael Weba 《Statistical Papers》2002,43(3):445-452

n possibly different success probabilities p ₁, p ₂, ..., p _n is frequently approximated by a Poisson distribution with parameter λ = p ₁ + p ₂ + ... + p _n. LeCam's bound p ² ₁ + p ² ₂ + ... + p _n ² for the total variation distance between both distributions is particularly useful provided the success probabilities are small. The paper presents an improved version of LeCam's bound if a generalized d-dimensional Poisson binomial distribution is to be approximated by a compound Poisson distribution. Received: May 10, 2000; revised version: January 15, 2001 相似文献

7.

Discriminating between the Weibull and log-normal distributions for Type-II censored data

Arabin Kumar Dey 《Statistics》2013,47(2):197-214

Log-normal and Weibull distributions are the two most popular distributions for analysing lifetime data. In this paper, we consider the problem of discriminating between the two distribution functions. It is assumed that the data are coming either from log-normal or Weibull distributions and that they are Type-II censored. We use the difference of the maximized log-likelihood functions, in discriminating between the two distribution functions. We obtain the asymptotic distribution of the discrimination statistic. It is used to determine the probability of correct selection in this discrimination process. We perform some simulation studies to observe how the asymptotic results work for different sample sizes and for different censoring proportions. It is observed that the asymptotic results work quite well even for small sizes if the censoring proportions are not very low. We further suggest a modified discrimination procedure. Two real data sets are analysed for illustrative purposes. 相似文献

8.

An empirical approach to determine a threshold for assessing overdispersion in Poisson and negative binomial models for count data

Elizabeth H. Payne Mulugeta Gebregziabher James W. Hardin Viswanathan Ramakrishnan Leonard E. Egede 《统计学通讯:模拟与计算》2018,47(6):1722-1738

Overdispersion is a problem encountered in the analysis of count data that can lead to invalid inference if unaddressed. Decision about whether data are overdispersed is often reached by checking whether the ratio of the Pearson chi-square statistic to its degrees of freedom is greater than one; however, there is currently no fixed threshold for declaring the need for statistical intervention. We consider simulated cross-sectional and longitudinal datasets containing varying magnitudes of overdispersion caused by outliers or zero inflation, as well as real datasets, to determine an appropriate threshold value of this statistic which indicates when overdispersion should be addressed. 相似文献

9.

Developing a Liu estimator for the negative binomial regression model: method and application

Kristofer Månsson 《Journal of Statistical Computation and Simulation》2013,83(9):1773-1780

This paper introduces a new shrinkage estimator for the negative binomial regression model that is a generalization of the estimator proposed for the linear regression model by Liu [A new class of biased estimate in linear regression, Comm. Stat. Theor. Meth. 22 (1993), pp. 393–402]. This shrinkage estimator is proposed in order to solve the problem of an inflated mean squared error of the classical maximum likelihood (ML) method in the presence of multicollinearity. Furthermore, the paper presents some methods of estimating the shrinkage parameter. By means of Monte Carlo simulations, it is shown that if the Liu estimator is applied with these shrinkage parameters, it always outperforms ML. The benefit of the new estimation method is also illustrated in an empirical application. Finally, based on the results from the simulation study and the empirical application, a recommendation regarding which estimator of the shrinkage parameter that should be used is given. 相似文献

10.

The analysis of incontinence episodes and other count data in patients with overactive bladder by Poisson and negative binomial regression

下载免费PDF全文

R. Martina R. Kay R. van Maanen A. Ridder 《Pharmaceutical statistics》2015,14(2):151-160

Clinical studies in overactive bladder have traditionally used analysis of covariance or nonparametric methods to analyse the number of incontinence episodes and other count data. It is known that if the underlying distributional assumptions of a particular parametric method do not hold, an alternative parametric method may be more efficient than a nonparametric one, which makes no assumptions regarding the underlying distribution of the data. Therefore, there are advantages in using methods based on the Poisson distribution or extensions of that method, which incorporate specific features that provide a modelling framework for count data. One challenge with count data is overdispersion, but methods are available that can account for this through the introduction of random effect terms in the modelling, and it is this modelling framework that leads to the negative binomial distribution. These models can also provide clinicians with a clearer and more appropriate interpretation of treatment effects in terms of rate ratios. In this paper, the previously used parametric and non‐parametric approaches are contrasted with those based on Poisson regression and various extensions in trials evaluating solifenacin and mirabegron in patients with overactive bladder. In these applications, negative binomial models are seen to fit the data well. Copyright © 2014 John Wiley & Sons, Ltd. 相似文献

11.

On the recoding of continuous and bounded indexes to a binomial form: an application to quality-of-life scores

Inmaculada Arostegui Vicente Núñez-Antón José M. Quintana 《Journal of applied statistics》2013,40(3):563-582

The statistical analysis of patient-reported outcomes (PROs) as endpoints has shown to be of great practical relevance. The resulting scores or indexes from the questionnaires used to measure PROs could be treated as continuous or ordinal. The goal of this study is to propose and evaluate a recoding process of the scores, so that they can be treated as binomial outcomes and, therefore, analyzed using logistic regression with random effects. The general methodology of recoding is based on the observable values of the scores. In order to obtain an optimal recoding, the evaluation of the recoding method is tested for different values of the parameters of the binomial distribution and different probability distributions of the random effects. We illustrate, evaluate and validate the proposed method of recoding with the Short Form-36 (SF-36) Survey and real data. The optimal recoding approach is very useful and flexible. Moreover, it has a natural interpretation, not only for ordinal scores, but also for questionnaires with many dimensions and different profiles, where a common method of analysis is desired, such as the SF-36. 相似文献

12.

Modelling the evolution of distributions: an application to Major League baseball

Gary Koop 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2004,167(4):639-655

Summary. We develop Bayesian techniques for modelling the evolution of entire distributions over time and apply them to the distribution of team performance in Major League baseball for the period 1901–2000. Such models offer insight into many key issues (e.g. competitive balance) in a way that regression-based models cannot. The models involve discretizing the distribution and then modelling the evolution of the bins over time through transition probability matrices. We allow for these matrices to vary over time and across teams. We find that, with one exception, the transition probability matrices (and, hence, competitive balance) have been remarkably constant across time and over teams. The one exception is the Yankees, who have outperformed all other teams. 相似文献

13.

Empirical Bayes estimates of finite mixture of negative binomial regression models and its application to highway safety

Yajie Zou John E. Ash Dominique Lord Lingtao Wu 《Journal of applied statistics》2018,45(9):1652-1669

The empirical Bayes (EB) method is commonly used by transportation safety analysts for conducting different types of safety analyses, such as before–after studies and hotspot analyses. To date, most implementations of the EB method have been applied using a negative binomial (NB) model, as it can easily accommodate the overdispersion commonly observed in crash data. Recent studies have shown that a generalized finite mixture of NB models with K mixture components (GFMNB-K) can also be used to model crash data subjected to overdispersion and generally offers better statistical performance than the traditional NB model. So far, nobody has developed how the EB method could be used with finite mixtures of NB models. The main objective of this study is therefore to use a GFMNB-K model in the calculation of EB estimates. Specifically, GFMNB-K models with varying weight parameters are developed to analyze crash data from Indiana and Texas. The main finding shows that the rankings produced by the NB and GFMNB-2 models for hotspot identification are often quite different, and this was especially noticeable with the Texas dataset. Finally, a simulation study designed to examine which model formulation can better identify the hotspot is recommended as our future research. 相似文献

14.

Expectation identity for the binomial distribution and its application in the calculations of high-order binomial moments

Ying-Ying Zhang Teng-Zhong Rong Man-Man Li 《统计学通讯:理论与方法》2013,42(22):5467-5476

ABSTRACT

For the exponential families normal, gamma, beta, Poisson, and negative binomial, there exists an expectation identity for each of the family. For the binomial family, we discover an expectation identity, which is useful in analytical calculations of its high-order moments. 相似文献

15.

Minimum chi-squared estimation of stable distributions parameters: an application to the Warsaw Stock Exchange

Zbigniew Kominek 《Journal of applied statistics》2002,29(5):729-744

This paper derives an application of the minimum chi-squared (MCS) methodology to estimate the parameters of the unimodal symmetric stable distribution. The proposed method is especially suitable for large, both regular and non-standard, data sets. Monte Carlo simulations are performed to compare the efficiency of the MCS estimation with the efficiency of the McCulloch quantile algorithm. In the case of grouped observations, evidence in favour of the MCS method is reported. For the ungrouped data the MCS estimation generally performs better than McCulloch's quantile method for samples larger than 400 observations and for high alphas. The relative advantage of the MCS over the McCulloch estimators increases for larger samples. The empirical example analyses the highly irregular distributions of returns on the selected securities from the Warsaw Stock Exchange. The quantile and maximum likelihood estimates of characteristic exponents are generally smaller than the MCS ones. This reflects the bias in the traditional methods, which is due to a lack of adjustment for censored and clustered observations, and shows the flexibility of the proposed MCS approach. 相似文献

16.

Bayesian robustness of the compound Poisson distribution under bidimensional prior: an application to the collective risk model

Agustín Hernández Bastida José María Pérez Sánchez 《Journal of applied statistics》2009,36(8):853-869

The distribution of the aggregate claims in one year plays an important role in Actuarial Statistics for computing, for example, insurance premiums when both the number and size of the claims must be implemented into the model. When the number of claims follows a Poisson distribution the aggregated distribution is called the compound Poisson distribution. In this article we assume that the claim size follows an exponential distribution and later we make an extensive study of this model by assuming a bidimensional prior distribution for the parameters of the Poisson and exponential distribution with marginal gamma. This study carries us to obtain expressions for net premiums, marginal and posterior distributions in terms of some well-known special functions used in statistics. Later, a Bayesian robustness study of this model is made. Bayesian robustness on bidimensional models was deeply treated in the 1990s, producing numerous results, but few applications dealing with this problem can be found in the literature. 相似文献

17.

Longitudinal Poisson modeling: an application for CD4 counting in HIV-infected patients

Emílio Augusto Coelho-Barros Jorge Alberto Achcar Josmar Mazucheli 《Journal of applied statistics》2010,37(5):865-880

In this paper, we present different “frailty” models to analyze longitudinal data in the presence of covariates. These models incorporate the extra-Poisson variability and the possible correlation among the repeated counting data for each individual. Assuming a CD4 counting data set in HIV-infected patients, we develop a hierarchical Bayesian analysis considering the different proposed models and using Markov Chain Monte Carlo methods. We also discuss some Bayesian discrimination aspects for the choice of the best model. 相似文献

18.

Conditionally gaussian distributions and an application to kalman filtering with stochastic regressors

Heikki Ruskeepä 《统计学通讯:理论与方法》2013,42(12):2919-2942

The work reviews theory of conditionally Gaussian distributions, especially so called theorems on normal correlation. Three theorems are given: the basic, the recursive, and the conditional theorem on normal correlation. They assume that (a,y), (a,x,y), or (a,y,z) has a Gaussian distribution, ussert that (a,y), (a,x,y), and (a,y,z), respectively, are Gaussian, and give formulas for the corresponding conditional mean vectors and variance covariance matrices. A proof is presented for the recursive and the conditional theorem. 相似文献

19.

Restricted ridge estimator in generalized linear models: Monte Carlo simulation studies on Poisson and binomial distributed responses

Fikriye Kurtoğlu M. Revan Özkale 《统计学通讯:模拟与计算》2019,48(4):1191-1218

It is known that collinearity among the explanatory variables in generalized linear models (GLMs) inflates the variance of maximum likelihood estimators. To overcome multicollinearity in GLMs, ordinary ridge estimator and restricted estimator were proposed. In this study, a restricted ridge estimator is introduced by unifying the ordinary ridge estimator and the restricted estimator in GLMs and its mean squared error (MSE) properties are discussed. The MSE comparisons are done in the context of first-order approximated estimators. The results are illustrated by a numerical example and two simulation studies are conducted with Poisson and binomial responses. 相似文献

20.

Modified ridge-type for the Poisson regression model: simulation and application

Adewale F. Lukman Benedicta Aladeitan Kayode Ayinde Mohamed R. Abonazel 《Journal of applied statistics》2022,49(8):2124

The Poisson regression model (PRM) is employed in modelling the relationship between a count variable (y) and one or more explanatory variables. The parameters of PRM are popularly estimated using the Poisson maximum likelihood estimator (PMLE). There is a tendency that the explanatory variables grow together, which results in the problem of multicollinearity. The variance of the PMLE becomes inflated in the presence of multicollinearity. The Poisson ridge regression (PRRE) and Liu estimator (PLE) have been suggested as an alternative to the PMLE. However, in this study, we propose a new estimator to estimate the regression coefficients for the PRM when multicollinearity is a challenge. We perform a simulation study under different specifications to assess the performance of the new estimator and the existing ones. The performance was evaluated using the scalar mean square error criterion and the mean squared error prediction error. The aircraft damage data was adopted for the application study and the estimators’ performance judged by the SMSE and the mean squared prediction error. The theoretical comparison shows that the proposed estimator outperforms other estimators. This is further supported by the simulation study and the application result.KEYWORDS: Poisson regression model, Poisson maximum likelihood estimator, multicollinearity, Poisson ridge regression, Liu estimator, simulation 相似文献