首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Dependent multivariate count data occur in several research studies. These data can be modelled by a multivariate Poisson or Negative binomial distribution constructed using copulas. However, when some of the counts are inflated, that is, the number of observations in some cells are much larger than other cells, then the copula-based multivariate Poisson (or Negative binomial) distribution may not fit well and it is not an appropriate statistical model for the data. There is a need to modify or adjust the multivariate distribution to account for the inflated frequencies. In this article, we consider the situation where the frequencies of two cells are higher compared to the other cells and develop a doubly inflated multivariate Poisson distribution function using multivariate Gaussian copula. We also discuss procedures for regression on covariates for the doubly inflated multivariate count data. For illustrating the proposed methodologies, we present real data containing bivariate count observations with inflations in two cells. Several models and linear predictors with log link functions are considered, and we discuss maximum likelihood estimation to estimate unknown parameters of the models.  相似文献   

2.
Copulas and frailty models are important tools to model bivariate survival data. Equivalence between Archimedean copula models and shared frailty models, e.g. between the Clayton-Oakes copula model and the shared gamma frailty model, has often been claimed in the literature. In this note we show that, in both the models, there is indeed a well-known equivalence between the copula functions; the modeling of the marginal survival functions, however, is quite different. The latter fact leads to different joint survival functions.  相似文献   

3.
In this paper we study estimating the joint conditional distributions of multivariate longitudinal outcomes using regression models and copulas. For the estimation of marginal models, we consider a class of time-varying transformation models and combine the two marginal models using nonparametric empirical copulas. Our models and estimation method can be applied in many situations where the conditional mean-based models are not good enough. Empirical copulas combined with time-varying transformation models may allow quite flexible modelling for the joint conditional distributions for multivariate longitudinal data. We derive the asymptotic properties for the copula-based estimators of the joint conditional distribution functions. For illustration we apply our estimation method to an epidemiological study of childhood growth and blood pressure.  相似文献   

4.
Multivariate count time series data occur in many different disciplines. The class of INteger-valued AutoRegressive (INAR) processes has the great advantage to consider explicitly both the discreteness and autocorrelation characterizing this type of data. Moreover, extensions of the simple INAR(1) model to the multi-dimensional space make it possible to model more than one series simultaneously. However, existing models do not offer great flexibility for dependence modelling, allowing only for positive correlation. In this work, we consider a bivariate INAR(1) (BINAR(1)) process where cross-correlation is introduced through the use of copulas for the specification of the joint distribution of the innovations. We mainly emphasize on the parametric case that arises under the assumption of Poisson marginals. Other marginal distributions are also considered. A short application on a bivariate financial count series illustrates the model.  相似文献   

5.
Copulas are powerful explanatory tools for studying dependence patterns in multivariate data. While the primary use of copula models is in multivariate dependence modelling, they also offer predictive value for regression analysis. This article investigates the utility of copula models for model‐based predictions from two angles. We assess whether, where, and by how much various copula models differ in their predictions of a conditional mean and conditional quantiles. From a model selection perspective, we then evaluate the predictive discrepancy between copula models using in‐sample and out‐of‐sample predictions both in bivariate and higher‐dimensional settings. Our findings suggest that some copula models are more difficult to distinguish in terms of their overall predictive power than others, and depending on the quantity of interest, the differences in predictions can be detected only in some targeted regions. The situations where copula‐based regression approaches would be advantageous over traditional ones are discussed using simulated and real data. The Canadian Journal of Statistics 47: 8–26; 2019 © 2018 Statistical Society of Canada  相似文献   

6.
Missing covariates data is a common issue in generalized linear models (GLMs). A model-based procedure arising from properly specifying joint models for both the partially observed covariates and the corresponding missing indicator variables represents a sound and flexible methodology, which lends itself to maximum likelihood estimation as the likelihood function is available in computable form. In this paper, a novel model-based methodology is proposed for the regression analysis of GLMs when the partially observed covariates are categorical. Pair-copula constructions are used as graphical tools in order to facilitate the specification of the high-dimensional probability distributions of the underlying missingness components. The model parameters are estimated by maximizing the weighted log-likelihood function by using an EM algorithm. In order to compare the performance of the proposed methodology with other well-established approaches, which include complete-cases and multiple imputation, several simulation experiments of Binomial, Poisson and Normal regressions are carried out under both missing at random and non-missing at random mechanisms scenarios. The methods are illustrated by modeling data from a stage III melanoma clinical trial. The results show that the methodology is rather robust and flexible, representing a competitive alternative to traditional techniques.  相似文献   

7.
We consider semiparametric multivariate data models based on copula representation of the common distribution function. A copula is characterized by a parameter of association and marginal distribution functions. This parameter and the marginal distributions are unknown. In this article, we study the estimator of the parameter of association in copulas with the marginal distribution functions assumed as nuisance parameters restricted by the assumption that the components are identically distributed. Results of this work could be used to construct special kinds of tests of homogeneity for random vectors having dependent components.  相似文献   

8.
Multivariate Dispersion Models Generated From Gaussian Copula   总被引:5,自引:0,他引:5  
In this paper a class of multivariate dispersion models generated from the multivariate Gaussian copula is presented. Being a multivariate extension of Jørgensen's (1987a) dispersion models, this class of multivariate models is parametrized by marginal position, dispersion and dependence parameters, producing a large variety of multivariate discrete and continuous models including the multivariate normal as a special case. Properties of the multivariate distributions are investigated, some of which are similar to those of the multivariate normal distribution, which makes these models potentially useful for the analysis of correlated non-normal data in a way analogous to that of multivariate normal data. As an example, we illustrate an application of the models to the regression analysis of longitudinal data, and establish an asymptotic relationship between the likelihood equation and the generalized estimating equation of Liang & Zeger (1986).  相似文献   

9.
10.
Some parametric families of multivariate extreme-value distributions have been proposed in recent years; several additional parametric families are derived here. The parametric models are fitted, using numerical maximum likelihood, to some environmental multivariate extreme data sets consisting of extreme concentrations of a pollutant at several monitoring stations in a region. Some multivariate nonnormal data analysis techniques are proposed to aid in the likelihood analysis. The new models, together with previous models, appear to be adequate for inferences in that they cover a wide range of possible dependence patterns.  相似文献   

11.
Specific efficacy criteria were defined by the International Headache Society for controlled clinical trials on acute migraine. They are derived from the pain profile and the timing of rescue medication intake. We present a methodology to improve the analysis of such trials. Instead of analysing each endpoint separately, we model the joint distribution and derive success rates in any criteria as predictions. We use cumulative regression models for each response at a time and a multivariate normal copula to model the dependence between responses. Parameters are estimated using maximum likelihood. Benefits of the method include a reduction in the number of tests performed and an increase in their power. The method is well suited to dose–response trials from which predictions can be used to select doses and optimize the design of subsequent trials. More generally, our method permits a very flexible modelling of longitudinal series of ordinal data. Copyright © 2003 John Wiley & Sons, Ltd.  相似文献   

12.
A bivariate model of claim frequencies and severities   总被引:1,自引:1,他引:0  
Bivariate claim data come from a population that consists of insureds who may claim either one, both or none of the two types of benefits covered by a policy. In the present paper, we develop a statistical procedure to fit bivariate distributions of claims in presence of covariates. This allows for a more accurate study of insureds' choice and size in the frequency and severity of the two types of claims. A generalised logistic model is employed to examine the frequency probabilities, whilst the three parameter Burr distribution is suggested to model the underlying severity distributions. The bivariate copula model is exploited in such a way that it allows us to adjust for a range of frequency dependence structures; a method for assessing the adequacy of the fitted severity model is outlined. A health claims dataset illustrates the methods; we describe the use of orthogonal polynomials for characterising the relationship between age and the frequency and severity models.  相似文献   

13.
Vine copula provides a flexible tool to capture asymmetry in modeling multivariate distributions. Nevertheless, its flexibility is achieved at the expense of exponentially increasing complexity of the model. To alleviate this issue, the simplifying assumption (SA) is commonly adapted in specific applications of vine copula models. In this paper, generalized linear models (GLMs) are proposed for the parameters in conditional bivariate copulas to relax the SA. In the spirit of the principle of parsimony, a regularization methodology is developed to control the number of parameters, leading to sparse vine copula models. The conventional vine copula with the SA, the proposed GLM-based vine copula, and the sparse vine copula are applied to several financial datasets, and the results show that our proposed models outperform the one with SA significantly in terms of the Bayesian information criterion.  相似文献   

14.
In this paper, we study the properties of a special class of frailty models when the frailty is common to several failure times. The models are closely linked to Archimedean copula models. We establish a useful formula for cumulative baseline hazard functions and develop a new estimator for cumulative baseline hazard functions in bivariate frailty regression models. Based on our proposed estimator, we present a graphical model checking procedure. We fit a leukemia data set using our model and end our paper with some discussions.  相似文献   

15.
We propose exploratory, easily implemented methods for diagnosing the appropriateness of an underlying copula model for bivariate failure time data, allowing censoring in either or both failure times. It is found that the proposed approach effectively distinguishes gamma from positive stable copula models when the sample is moderately large or the association is strong. Data from the Womens Health and Aging Study (WHAS, Guralnik et al., The Womenss Health and Aging Study: Health and Social Characterisitics of Older Women with Disability. National Institute on Aging: Bethesda, Mayland, 1995) are analyzed to demonstrate the proposed diagnostic methodology. The positive stable model gives a better overall fit to these data than the gamma frailty model, but it tends to underestimate association at the later time points. The finding is consistent with recent theory differentiating catastrophic from progressive disability onset in older adults. The proposed methods supply an interpretable quantity for copula diagnosis. We hope that they will usefully inform practitioners as to the reasonableness of their modeling choices.  相似文献   

16.
在非寿险损失预测的广义线性模型中,通常假设损失次数与损失强度相互独立,事实上二者之间往往存在一定的相依关系,可通过copula函数来刻画.在损失已经发生的条件下,假设损失次数服从零截断泊松分布,损失强度服从伽玛分布,可以建立损失次数与损失强度相互依赖的copula回归模型.把损失强度的分布扩展到逆高斯分布,并将此模型应用于一组车险保单数据进行实证研究.结果表明:该模型不但在损失预测方面优于独立假设下的广义线性模型,而且也优于损失强度服从伽马分布假设下的copula回归模型.  相似文献   

17.
Most existing reduced-form macroeconomic multivariate time series models employ elliptical disturbances, so that the forecast densities produced are symmetric. In this article, we use a copula model with asymmetric margins to produce forecast densities with the scope for severe departures from symmetry. Empirical and skew t distributions are employed for the margins, and a high-dimensional Gaussian copula is used to jointly capture cross-sectional and (multivariate) serial dependence. The copula parameter matrix is given by the correlation matrix of a latent stationary and Markov vector autoregression (VAR). We show that the likelihood can be evaluated efficiently using the unique partial correlations, and estimate the copula using Bayesian methods. We examine the forecasting performance of the model for four U.S. macroeconomic variables between 1975:Q1 and 2011:Q2 using quarterly real-time data. We find that the point and density forecasts from the copula model are competitive with those from a Bayesian VAR. During the recent recession the forecast densities exhibit substantial asymmetry, avoiding some of the pitfalls of the symmetric forecast densities from the Bayesian VAR. We show that the asymmetries in the predictive distributions of GDP growth and inflation are similar to those found in the probabilistic forecasts from the Survey of Professional Forecasters. Last, we find that unlike the linear VAR model, our fitted Gaussian copula models exhibit nonlinear dependencies between some macroeconomic variables. This article has online supplementary material.  相似文献   

18.
In this paper, we consider that the degradation of two performance characteristics of a product can be modelled by stochastic processes and jointly by copula functions, but different stochastic processes govern the behaviour of each performance characteristic (PC) degradation. Different heterogeneous and homogeneous models are presented considering copula functions and different combinations of the most used stochastic processes in degradation analysis as marginal distributions. This is an important aspect to consider because the behaviour of the degradation of each PC may be different in its nature. As the joint distributions of the proposed models result in complex distributions, the estimation of the parameters of interest is performed via MCMC. A simulation study is performed to compare heterogeneous and homogeneous models. In addition, the proposed models are implemented to crack propagation data of two terminals of an electronic device, and some insights are provided about the product reliability under heterogeneous models.  相似文献   

19.
我们应该选用什么样的相关性指标?   总被引:42,自引:0,他引:42       下载免费PDF全文
张尧庭 《统计研究》2002,19(9):41-44
现在大家在实际中使用的、教科书上见到的 ,度量两个变量之间相关性的指标 ,往往都是皮尔逊的相关系数 ,即使在多元分析教材中 ,谈到典型相关系数 ,它实际上也是基于二元的相关系数。用相关系数来反映变量之间的相关性有什么不好呢 ?我们都已用了很长的时间 ,用它说明了许多问题 ,现在为什么要提出选用什么样的相关性指标 ,难道除它之外 ,还有别的相关性指标吗 ?还有哪些可以选用的指标呢 ?凭什么准则来选用什么样的相关性指标呢 ?这些正是我们这篇文章要探讨的问题。金融市场的风险分析这几年有很快的发展 ,原来的一些分析方法很不适应这一…  相似文献   

20.
Vine copulas are a highly flexible class of dependence models, which are based on the decomposition of the density into bivariate building blocks. For applications one usually makes the simplifying assumption that copulas of conditional distributions are independent of the variables on which they are conditioned. However this assumption has been criticised for being too restrictive. We examine both simplified and non‐simplified vine copulas in three dimensions and investigate conceptual differences. We show and compare contour surfaces of three‐dimensional vine copula models, which prove to be much more informative than the contour lines of the bivariate marginals. Our investigation shows that non‐simplified vine copulas can exhibit arbitrarily irregular shapes, whereas simplified vine copulas appear to be smooth extrapolations of their bivariate margins to three dimensions. In addition to a variety of constructed examples, we also investigate a three‐dimensional subset of the well‐known uranium data set and visually detect the fact that a non‐simplified vine copula is necessary to capture its complex dependence structure.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号