期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Exact Bayesian modeling for bivariate Poisson data and extensions

Dimitris Karlis Panagiotis Tsiamyrtzis 《Statistics and Computing》2008,18(1):27-40

Bivariate count data arise in several different disciplines (epidemiology, marketing, sports statistics just to name a few) and the bivariate Poisson distribution being a generalization of the Poisson distribution plays an important role in modelling such data. In the present paper we present a Bayesian estimation approach for the parameters of the bivariate Poisson model and provide the posterior distributions in closed forms. It is shown that the joint posterior distributions are finite mixtures of conditionally independent gamma distributions for which their full form can be easily deduced by a recursively updating scheme. Thus, the need of applying computationally demanding MCMC schemes for Bayesian inference in such models will be removed, since direct sampling from the posterior will become available, even in cases where the posterior distribution of functions of the parameters is not available in closed form. In addition, we define a class of prior distributions that possess an interesting conjugacy property which extends the typical notion of conjugacy, in the sense that both prior and posteriors belong to the same family of finite mixture models but with different number of components. Extension to certain other models including multivariate models or models with other marginal distributions are discussed. 相似文献

2.

A mixture model with Poisson and zero-truncated Poisson components to analyze road traffic accidents in Turkey

Hande Konuk Ünlü Derek S. Young Ayten Yiiter L. Hilal zcebe 《Journal of applied statistics》2022,49(4):1003

The analysis of traffic accident data is crucial to address numerous concerns, such as understanding contributing factors in an accident''s chain-of-events, identifying hotspots, and informing policy decisions about road safety management. The majority of statistical models employed for analyzing traffic accident data are logically count regression models (commonly Poisson regression) since a count – like the number of accidents – is used as the response. However, features of the observed data frequently do not make the Poisson distribution a tenable assumption. For example, observed data rarely demonstrate an equal mean and variance and often times possess excess zeros. Sometimes, data may have heterogeneous structure consisting of a mixture of populations, rather than a single population. In such data analyses, mixtures-of-Poisson-regression models can be used. In this study, the number of injuries resulting from casualties of traffic accidents registered by the General Directorate of Security (Turkey, 2005–2014) are modeled using a novel mixture distribution with two components: a Poisson and zero-truncated-Poisson distribution. Such a model differs from existing mixture models in literature where the components are either all Poisson distributions or all zero-truncated Poisson distributions. The proposed model is compared with the Poisson regression model via simulation and in the analysis of the traffic data. 相似文献

3.

Finite mixtures of multivariate Poisson distributions with application

Dimitris Karlis Loukia Meligkotsidou 《Journal of statistical planning and inference》2007

In the present paper we examine finite mixtures of multivariate Poisson distributions as an alternative class of models for multivariate count data. The proposed models allow for both overdispersion in the marginal distributions and negative correlation, while they are computationally tractable using standard ideas from finite mixture modelling. An EM type algorithm for maximum likelihood (ML) estimation of the parameters is developed. The identifiability of this class of mixtures is proved. Properties of ML estimators are derived. A real data application concerning model based clustering for multivariate count data related to different types of crime is presented to illustrate the practical potential of the proposed class of models. 相似文献

4.

Marginal zero-inflated regression models for count data

Jacob Martin Daniel B. Hall 《Journal of applied statistics》2017,44(10):1807-1826

Data sets with excess zeroes are frequently analyzed in many disciplines. A common framework used to analyze such data is the zero-inflated (ZI) regression model. It mixes a degenerate distribution with point mass at zero with a non-degenerate distribution. The estimates from ZI models quantify the effects of covariates on the means of latent random variables, which are often not the quantities of primary interest. Recently, marginal zero-inflated Poisson (MZIP; Long et al. [A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat. Med. 33 (2014), pp. 5151–5165]) and negative binomial (MZINB; Preisser et al., 2016) models have been introduced that model the mean response directly. These models yield covariate effects that have simple interpretations that are, for many applications, more appealing than those available from ZI regression. This paper outlines a general framework for marginal zero-inflated models where the latent distribution is a member of the exponential dispersion family, focusing on common distributions for count data. In particular, our discussion includes the marginal zero-inflated binomial (MZIB) model, which has not been discussed previously. The details of maximum likelihood estimation via the EM algorithm are presented and the properties of the estimators as well as Wald and likelihood ratio-based inference are examined via simulation. Two examples presented illustrate the advantages of MZIP, MZINB, and MZIB models for practical data analysis. 相似文献

5.

Modified Sarhan–Balakrishnan singular bivariate distribution

Debasis Kundu Rameshwar D. Gupta 《Journal of statistical planning and inference》2010

Recently Sarhan and Balakrishnan [2007. A new class of bivariate distribution and its mixture. Journal of Multivariate Analysis 98, 1508–1527] introduced a new bivariate distribution using generalized exponential and exponential distributions. They discussed several interesting properties of this new distribution. Unfortunately, they did not discuss any estimation procedure of the unknown parameters. In this paper using the similar idea as of Sarhan and Balakrishnan [2007. A new class of bivariate distribution and its mixture. Journal of Multivariate Analysis 98, 1508–1527], we have proposed a singular bivariate distribution, which has an extra shape parameter. It is observed that the marginal distributions of the proposed bivariate distribution are more flexible than the corresponding marginal distributions of the Marshall–Olkin bivariate exponential distribution, Sarhan–Balakrishnan's bivariate distribution or the bivariate generalized exponential distribution. Different properties of this new distribution have been discussed. We provide the maximum likelihood estimators of the unknown parameters using EM algorithm. We reported some simulation results and performed two data analysis for illustrative purposes. Finally we propose some generalizations of this bivariate model. 相似文献

6.

A new aging concept derived from the increasing convex ordering

T. H. M. Abouelmagd A. A. E. Ahmed Enayat M. Abd Elrazik Mahmoud M. Mansour A-Hadi N. Ahmed 《统计学通讯:理论与方法》2019,48(12):2904-2916

In this paper, a comparison between the life distribution of a new unit with that of a used unit in the increasing convex order is made leading to a new class of life distributions which we call “new better than used in convex ordering of second order”. This class includes as subclasses the NBU and the NBUC and is a subclass of the NBUCA class. Preservation properties under convolution, random maxima, mixing and formation of coherent structures are established. Stochastic comparisons of the excess lifetime when the inter-arrival times belong to the NBUC(2) class are developed. Some applications of Poisson shock models and a test of exponentiality against NBUC(2) alternative are presented. 相似文献

7.

BIVARIATE DISTRIBUTION WITH TRUNCATED POISSON MARGINAL DISTRIBUTIONS

《统计学通讯:理论与方法》2013,42(4):527-534

ABSTRACT

A bivariate distribution, whose marginal distributions are truncated Poisson distributions, is developed as a product of truncated Poisson distributions and a multiplicative factor. The multiplicative factor takes into account the correlation, either positive or negative, between the two random variables. The distributional properties of this model are studied and the model is fitted to a real life bivariate data. 相似文献

8.

Multivariate Poisson hidden Markov models with a case study of modelling seismicity

下载免费PDF全文

K. Orfanogiannaki D. Karlis 《Australian & New Zealand Journal of Statistics》2018,60(3):301-322

While the literature on multivariate models for continuous data flourishes, there is a lack of models for multivariate counts. We aim to contribute to this framework by extending the well known class of univariate hidden Markov models to the multidimensional case, by introducing multivariate Poisson hidden Markov models. Each state of the extended model is associated with a different multivariate discrete distribution. We consider different distributions with Poisson marginals, starting from the multivariate Poisson distribution and then extending to copula based distributions to allow flexible dependence structures. An EM type algorithm is developed for maximum likelihood estimation. A real data application is presented to illustrate the usefulness of the proposed models. In particular, we apply the models to the occurrence of strong earthquakes (surface wave magnitude ≥5), in three seismogenic subregions in the broad region of the North Aegean Sea for the time period from 1 January 1981 to 31 December 2008. Earthquakes occurring in one subregion may trigger events in adjacent ones and hence the observed time series of events are cross‐correlated. It is evident from the results that the three subregions interact with each other at times differing by up to a few months. This migration of seismic activity is captured by the model as a transition to a state of higher seismicity. 相似文献

9.

Bivariate survival modeling: a Bayesian approach based on Copulas

Romeo JS Tanaka NI Pedroso-de-Lima AC 《Lifetime data analysis》2006,12(2):205-222

相似文献

10.

Bivariate Hofmann distributions

J. F. Walhin 《Journal of applied statistics》2003,30(9):1033-1046

The aim of this paper is to develop some bivariate generalizations of the Hofmann distribution. The Hofmann distribution is known to give nice fits for overdispersed data sets. Two bivariate models are proposed. Recursive formulae are given for the evaluation of the probability function. Moments, conditional distributions and marginal distributions are studied. Two data sets are fitted based on the proposed models. Parameters are estimated by maximum likelihood. 相似文献

11.

A class of bivariate regression models for discrete and/or continuous responses

Willian Luís de Oliveira Carlos Alberto Ribeiro Diniz Maria Durbán 《统计学通讯:模拟与计算》2013,42(8):2359-2383

ABSTRACT

A general class of models for discrete and/or continuous responses is proposed in which joint distributions are constructed via the conditional approach. It is assumed that the distributions of one response and of the other response given the first one belong to exponential family of distributions. Furthermore, the marginal means are related to the covariates by link functions and a dependency structure between the responses is inserted into the model. Estimation methods, diagnostic analysis and a simulation study considering a Bernoulli-exponential model, a particular case of the class, are presented. Finally, this model is used in a real data set. 相似文献

12.

Construction of bivariate and multivariate weighted distributions via conditioning

Barry C. Arnold Indranil Ghosh Ayman Alzaatreh 《统计学通讯:理论与方法》2017,46(18):8897-8912

Weighted distributions (univariate and bivariate) have received widespread attention over the last two decades because of their flexibility for analyzing skewed data. In this article, we propose an alternative method to construct a new family of bivariate and multivariate weighted distributions. For illustrative purposes, some examples of the proposed method are presented. Several structural properties of the bivariate weighted distributions including marginal distributions together with distributions of the minimum and maximum, evaluation of the reliability parameter, and verification of total positivity of order two are also presented. In addition, we provide some multivariate extensions of the proposed models. A real-life data set is used to show the applicability of these bivariate weighted distributions. 相似文献

13.

Discrete and continuous bivariate lifetime models in presence of cure rate: a comparative study under Bayesian approach

Ricardo Puziol de Oliveira Jorge Alberto Achcar Danielle Peralta Josmar Mazucheli 《Journal of applied statistics》2019,46(3):449-467

The modeling and analysis of lifetime data in which the main endpoints are the times when an event of interest occurs is of great interest in medical studies. In these studies, it is common that two or more lifetimes associated with the same unit such as the times to deterioration levels or the times to reaction to a treatment in pairs of organs like lungs, kidneys, eyes or ears. In medical applications, it is also possible that a cure rate is present and needed to be modeled with lifetime data with long-term survivors. This paper presented a comparative study under a Bayesian approach among some existing continuous and discrete bivariate distributions such as the bivariate exponential distributions and the bivariate geometric distributions in presence of cure rate, censored data and covariates. In presence of lifetimes related to cured patients, it is assumed standard mixture cure rate models in the data analysis. The posterior summaries of interest are obtained using Markov Chain Monte Carlo methods. To illustrate the proposed methodology two real medical data sets are considered. 相似文献

14.

Bivariate zero-inflated negative binomial regression model with applications

Pouya Faroughi 《Journal of Statistical Computation and Simulation》2017,87(3):457-477

Count data often display excessive number of zero outcomes than are expected in the Poisson regression model. The zero-inflated Poisson regression model has been suggested to handle zero-inflated data, whereas the zero-inflated negative binomial (ZINB) regression model has been fitted for zero-inflated data with additional overdispersion. For bivariate and zero-inflated cases, several regression models such as the bivariate zero-inflated Poisson (BZIP) and bivariate zero-inflated negative binomial (BZINB) have been considered. This paper introduces several forms of nested BZINB regression model which can be fitted to bivariate and zero-inflated count data. The mean–variance approach is used for comparing the BZIP and our forms of BZINB regression model in this study. A similar approach was also used by past researchers for defining several negative binomial and zero-inflated negative binomial regression models based on the appearance of linear and quadratic terms of the variance function. The nested BZINB regression models proposed in this study have several advantages; the likelihood ratio tests can be performed for choosing the best model, the models have flexible forms of marginal mean–variance relationship, the models can be fitted to bivariate zero-inflated count data with positive or negative correlations, and the models allow additional overdispersion of the two dependent variables. 相似文献

15.

On the Correlation Structure of Gaussian Copula Models for Geostatistical Count Data

下载免费PDF全文

Zifei Han Victor De Oliveira 《Australian & New Zealand Journal of Statistics》2016,58(1):47-69

We describe a class of random field models for geostatistical count data based on Gaussian copulas. Unlike hierarchical Poisson models often used to describe this type of data, Gaussian copula models allow a more direct modelling of the marginal distributions and association structure of the count data. We study in detail the correlation structure of these random fields when the family of marginal distributions is either negative binomial or zero‐inflated Poisson; these represent two types of overdispersion often encountered in geostatistical count data. We also contrast the correlation structure of one of these Gaussian copula models with that of a hierarchical Poisson model having the same family of marginal distributions, and show that the former is more flexible than the latter in terms of range of feasible correlation, sensitivity to the mean function and modelling of isotropy. An exploratory analysis of a dataset of Japanese beetle larvae counts illustrate some of the findings. All of these investigations show that Gaussian copula models are useful alternatives to hierarchical Poisson models, specially for geostatistical count data that display substantial correlation and small overdispersion. 相似文献

16.

Information geodesics for gamma models of communication clustering

《Journal of Statistical Computation and Simulation》2012,82(1-4):133-146

Recently, Akyildiz called for further work on non-Poisson models for communication arrivals in distributed networks such as cellular phone systems. The basic ‘random’ model for stochastic events is the Poisson process; for events on a line this resuits in an exponential disiribuuon of intervals between events. Network designers and managers need too monotor and quantify call clustering in order to optimize resaurce usage; the natural reference state from which to measure departures is that arising from a Poisson, process of calls. Here we consider gamma distributions, which contain exponential distributions as a special case. The surface representing gamma models has a natural Riemannian information metric and we obtain some geodesic sprays for this metric. The exponential distributions form a 1-dimensional subspace of the 2-dimensional space of all gamma distributions, so we have an isometric embedding of the random model as a subspace of the gamma models. This geometry may provide an appropriate structure on which to represent clustering as quantifiable departures from randomness and on which to impose dynamic control algorithms to optimize traffic at receiving nodes in distributed communication networks. In practice, we may expect correlation between call arrival times and call duration, reflecting for example peaks of different users of internet services. This would give rise to a twisted product of two surfaces with the twisting controlled by the correlation. Though bivariate gamma models do exist, such as Kibble's, none has tractabie information geometry nor sufficiently general marginal gammas,but a simulation method of approach is suggested. 相似文献

17.

Comparisons of some bivariate regression models

《Journal of Statistical Computation and Simulation》2012,82(7):937-949

The bivariate negative binomial regression (BNBR) and the bivariate Poisson log-normal regression (BPLR) models have been used to describe count data that are over-dispersed. In this paper, a new bivariate generalized Poisson regression (BGPR) model is defined. An advantage of the new regression model over the BNBR and BPLR models is that the BGPR can be used to model bivariate count data with either over-dispersion or under-dispersion. In this paper, we carry out a simulation study to compare the three regression models when the true data-generating process exhibits over-dispersion. In the simulation experiment, we observe that the bivariate generalized Poisson regression model performs better than the bivariate negative binomial regression model and the BPLR model. 相似文献

18.

Flexible Bivariate INAR(1) Processes Using Copulas

Dimitris Karlis Xanthi Pedeli 《统计学通讯:理论与方法》2013,42(4):723-740

Multivariate count time series data occur in many different disciplines. The class of INteger-valued AutoRegressive (INAR) processes has the great advantage to consider explicitly both the discreteness and autocorrelation characterizing this type of data. Moreover, extensions of the simple INAR(1) model to the multi-dimensional space make it possible to model more than one series simultaneously. However, existing models do not offer great flexibility for dependence modelling, allowing only for positive correlation. In this work, we consider a bivariate INAR(1) (BINAR(1)) process where cross-correlation is introduced through the use of copulas for the specification of the joint distribution of the innovations. We mainly emphasize on the parametric case that arises under the assumption of Poisson marginals. Other marginal distributions are also considered. A short application on a bivariate financial count series illustrates the model. 相似文献

19.

Dependent models for observations which include angular ones

Shogo Kato Kunio Shimizu 《Journal of statistical planning and inference》2008

This paper discusses some stochastic models for dependence of observations which include angular ones. First, we provide a theorem which constructs four-dimensional distributions with specified bivariate marginals on certain manifolds such as two tori, cylinders or discs. Some properties of the submodel of the proposed models are investigated. The theorem is also applicable to the construction of a related Markov process, models for incomplete observations, and distributions with specified marginals on the disc. Second, two maximum entropy distributions on the cylinder are discussed. The circular marginal of each model is distributed as the generalized von Mises distribution which represents a symmetric or asymmetric, unimodal or bimodal shape. The proposed cylindrical model is applied to two data sets. 相似文献

20.

A bivariate regression model for matched paired survival data: local influence and residual analysis

Gladys D. C. Barriga Francisco Louzada-Neto Edwin M. M. Ortega Vicente G. Cancho 《Statistical Methods and Applications》2010,19(4):477-495

The use of bivariate distributions plays a fundamental role in survival and reliability studies. In this paper, we consider a location scale model for bivariate survival times based on the proposal of a copula to model the dependence of bivariate survival data. For the proposed model, we consider inferential procedures based on maximum likelihood. Gains in efficiency from bivariate models are also examined in the censored data setting. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and compared to the performance of the bivariate regression model for matched paired survival data. Sensitivity analysis methods such as local and total influence are presented and derived under three perturbation schemes. The martingale marginal and the deviance marginal residual measures are used to check the adequacy of the model. Furthermore, we propose a new measure which we call modified deviance component residual. The methodology in the paper is illustrated on a lifetime data set for kidney patients. 相似文献