首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Hea-Jung Kim 《Statistics》2013,47(1):89-106
This article introduces a class of weighted multivariate t-distributions, which includes the multivariate generalized Student t and multivariate skew t as its special members. This class is defined as the marginal distribution of a doubly truncated multivariate generalized Student t-distribution and studied from several aspects such as weighting of probability density functions, inequality constrained multivariate Student t-distributions, scale mixtures of multivariate normal and probabilistic representations. The relationships among these aspects are given, and various properties of the class are also discussed. Necessary theories and two applications are provided.  相似文献   

2.
Cluster analysis is the automated search for groups of homogeneous observations in a data set. A popular modeling approach for clustering is based on finite normal mixture models, which assume that each cluster is modeled as a multivariate normal distribution. However, the normality assumption that each component is symmetric is often unrealistic. Furthermore, normal mixture models are not robust against outliers; they often require extra components for modeling outliers and/or give a poor representation of the data. To address these issues, we propose a new class of distributions, multivariate t distributions with the Box-Cox transformation, for mixture modeling. This class of distributions generalizes the normal distribution with the more heavy-tailed t distribution, and introduces skewness via the Box-Cox transformation. As a result, this provides a unified framework to simultaneously handle outlier identification and data transformation, two interrelated issues. We describe an Expectation-Maximization algorithm for parameter estimation along with transformation selection. We demonstrate the proposed methodology with three real data sets and simulation studies. Compared with a wealth of approaches including the skew-t mixture model, the proposed t mixture model with the Box-Cox transformation performs favorably in terms of accuracy in the assignment of observations, robustness against model misspecification, and selection of the number of components.  相似文献   

3.
The classical bivariate F distribution arises from ratios of chi-squared random variables with common denominators. A consequent disadvantage is that its univariate F marginal distributions have one degree of freedom parameter in common. In this paper, we add a further independent chi-squared random variable to the denominator of one of the ratios and explore the extended bivariate F distribution, with marginals on arbitrary degrees of freedom, that results. Transformations linking F, beta and skew t distributions are then applied componentwise to produce bivariate beta and skew t distributions which also afford marginal (beta and skew t) distributions with arbitrary parameter values. We explore a variety of properties of these distributions and give an example of a potential application of the bivariate beta distribution in Bayesian analysis.  相似文献   

4.
Abstract. For probability distributions on ? q, a detailed study of the breakdown properties of some multivariate M‐functionals related to Tyler's [Ann. Statist. 15 (1987) 234] ‘distribution‐free’ M‐functional of scatter is given. These include a symmetrized version of Tyler's M‐functional of scatter, and the multivariate t M‐functionals of location and scatter. It is shown that for ‘smooth’ distributions, the (contamination) breakdown point of Tyler's M‐functional of scatter and of its symmetrized version are 1/q and , respectively. For the multivariate t M‐functional which arises from the maximum likelihood estimate for the parameters of an elliptical t distribution on ν ≥ 1 degrees of freedom the breakdown point at smooth distributions is 1/( q + ν). Breakdown points are also obtained for general distributions, including empirical distributions. Finally, the sources of breakdown are investigated. It turns out that breakdown can only be caused by contaminating distributions that are concentrated near low‐dimensional subspaces.  相似文献   

5.
We consider here a generalization of the skew-normal distribution, GSN(λ1,λ2,ρ), defined through a standard bivariate normal distribution with correlation ρ, which is a special case of the unified multivariate skew-normal distribution studied recently by Arellano-Valle and Azzalini [2006. On the unification of families of skew-normal distributions. Scand. J. Statist. 33, 561–574]. We then present some simple and useful properties of this distribution and also derive its moment generating function in an explicit form. Next, we show that distributions of order statistics from the trivariate normal distribution are mixtures of these generalized skew-normal distributions; thence, using the established properties of the generalized skew-normal distribution, we derive the moment generating functions of order statistics, and also present expressions for means and variances of these order statistics.Next, we introduce a generalized skew-tν distribution, which is a special case of the unified multivariate skew-elliptical distribution presented by Arellano-Valle and Azzalini [2006. On the unification of families of skew-normal distributions. Scand. J. Statist. 33, 561–574] and is in fact a three-parameter generalization of Azzalini and Capitanio's [2003. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t distribution. J. Roy. Statist. Soc. Ser. B 65, 367–389] univariate skew-tν form. We then use the relationship between the generalized skew-normal and skew-tν distributions to discuss some properties of generalized skew-tν as well as distributions of order statistics from bivariate and trivariate tν distributions. We show that these distributions of order statistics are indeed mixtures of generalized skew-tν distributions, and then use this property to derive explicit expressions for means and variances of these order statistics.  相似文献   

6.
Simultaneous estimation of scale parameters is considered in mixture distributions under squared-error loss. A general class of estimators is obtained which dominates the componentwise best multiple estimators and the moment estimators. As special cases, improved estimators are obtained for the multivariate t-distribution and the p-variate Lomax distribution.  相似文献   

7.
A class of weighted elliptical models useful for analyzing nonnormal and bimodal multivariate data is introduced. It is obtained from the marginal distribution of a centrally truncated multivariate elliptical distribution. As a special case, a finite mixture of weighted multinormal distribution is examined in detail, establishing connections with the multinormal and the finite mixture of multinormal. The special class of distributions is studied from several aspects such as weighting of probability density functions, association with centrally truncated distributions, and a finite scale mixture scheme. The relationships among these aspects are given, and various properties of the class are also discussed. For the inference of the class, an MCMC procedure and its numerical example are provided.  相似文献   

8.
This note mainly aims to illustrate that some quadratic problems are robust in a sense with respect to the probabilistic distributions involved. The secondary moments of the quadratic forms of a multivariate t distribution are calculated. Then, the resulting formulae are applied to the quadratic problems of quadratic sufficiency and quadratic prediction. It is shown by revisiting the two problems that the same conclusions hold when the multivariate normal distribution is replaced with a multivariate t distribution.  相似文献   

9.
For observable indicators with ordered categories one can assume underlying latent variables following certain marginal distributions. Transforming the latent variables changes its marginal distributions but not the observable qualitative indicators. The joint distribution of the latent variables can be constructed from the marginal distributions. There is a broad class of multivariate distributions for which the observable indicators are equivalent. By choosing the multivariate normal distribution from this class we can analyse a linear relationship between the transformed latent variables. This leads to latent structural equation models. Estimation of these latter models is therefore more general than the distributional assumption might initially suggest. Robustness of the estimation procedure is also discussed for deviations from this distribution family. Using ordinal business survey data of the German Ifo-institute we test the efficiency of firms' price expectations implied by the rational expectation hypothesis.  相似文献   

10.
In this article we introduce a new generalization of skew-t distributions, which contains the standard skew-t distribution, as a special case. This new class of distributions is an adequate model for modeling some dataset rather than the standard skew-t distributions. This kind of distributions can be represented as a scale-shape mixture of the extended skew-normal distributions. The main properties of this family of distributions are studied and a recurrence relation for the cumulative distribution functions (cdf) of them is presented. We derive the distribution of the order statistics from the trivariate exchangeable t-distribution in terms of our distribution and then an exact expression for the cdf of order statistics is derived. Likelihood inference for this distribution is also examined. The method is illustrated with a numerical example via a simulation study.  相似文献   

11.
Abstract. The entropy and mutual information index are important concepts developed by Shannon in the context of information theory. They have been widely studied in the case of the multivariate normal distribution. We first extend these tools to the full symmetric class of multivariate elliptical distributions and then to the more flexible families of multivariate skew‐elliptical distributions. We study in detail the cases of the multivariate skew‐normal and skew‐t distributions. We implement our findings to the application of the optimal design of an ozone monitoring station network in Santiago de Chile.  相似文献   

12.
Often the dependence in multivariate survival data is modeled through an individual level effect called the frailty. Due to its mathematical simplicity, the gamma distribution is often used as the frailty distribution for hazard modeling. However, it is well known that the gamma frailty distribution has many drawbacks. For example, it weakens the effect of covariates. In addition, in the presence of a multilevel model, overall frailty comes from several levels. To overcome such drawbacks, more heavy-tailed distributions are needed to model the frailty distribution in order to incorporate extra variability. In this article, we develop a class of log-skew-t distributions for the frailty. This class includes the log-normal distribution along with many other heavy tailed distributions, e.g., log-Cauchy, log normal, and log-t as special cases.

Conditional on the frailty, the survival times are assumed to be independent with proportional hazard structure. The modeling process is then completed by assuming multilevel frailty-effects. Instead of tuning a strict parameterization of the baseline hazard function, we consider the partial likelihood approach and thus leave the baseline function unspecified. By eliminating the hazard, the pre-specification and computation are simplified considerably.  相似文献   

13.
This article studies the minima stable property of the general multivariate Pareto distributions MP(k)(I), MP(k)(II), MP(k)(III), MP(k)(IV) which can be applied to characterize the MP(k) distribution via its weighted ordered coordinates minima and marginal distribution. Also, the multivariate semi-Pareto distribution (denoted by MSP) is discerned in the class of geometric minima infinite divisible and geometric minima stable distributions. If the exponent measure is satisfied by some functional equation, then the geometric minima stable property can be used to characterize the MSP distribution. Finally, the finite sample minima infinite divisible property of the MP(k)(I), (II), and (IV) distributions is also discussed.  相似文献   

14.
ABSTRACT

Elsewhere, I have promoted (univariate continuous) “transformation of scale” (ToS) distributions having densities of the form 2g?1(x)) where g is a symmetric distribution and Π is a transformation function with a special property. Here, I develop bivariate (readily multivariate) ToS distributions. Univariate ToS distributions have a transformation of random variable relationship with Azzalini-type skew-symmetric distributions; the bivariate ToS distribution here arises from marginal variable transformation of a particular form of bivariate skew-symmetric distribution. Examples are given, as are basic properties—unimodality, a covariance property, random variate generation—and connections with a bivariate inverse Gaussian distribution are pointed out.  相似文献   

15.
Skew-normal/independent distributions are a class of asymmetric thick-tailed distributions that include the skew-normal distribution as a special case. In this paper, we explore the use of Markov Chain Monte Carlo (MCMC) methods to develop a Bayesian analysis in multivariate measurement errors models. We propose the use of skew-normal/independent distributions to model the unobserved value of the covariates (latent variable) and symmetric normal/independent distributions for the random errors term, providing an appealing robust alternative to the usual symmetric process in multivariate measurement errors models. Among the distributions that belong to this class of distributions, we examine univariate and multivariate versions of the skew-normal, skew-t, skew-slash and skew-contaminated normal distributions. The results and methods are applied to a real data set.  相似文献   

16.
This article makes two contributions. First, we outline a simple simulation-based framework for constructing conditional distributions for multifactor and multidimensional diffusion processes, for the case where the functional form of the conditional density is unknown. The distributions can be used, for example, to form predictive confidence intervals for time period t + τ, given information up to period t. Second, we use the simulation-based approach to construct a test for the correct specification of a diffusion process. The suggested test is in the spirit of the conditional Kolmogorov test of Andrews. However, in the present context the null conditional distribution is unknown and is replaced by its simulated counterpart. The limiting distribution of the test statistic is not nuisance parameter-free. In light of this, asymptotically valid critical values are obtained via appropriate use of the block bootstrap. The suggested test has power against a larger class of alternatives than tests that are constructed using marginal distributions/densities. The findings of a small Monte Carlo experiment underscore the good finite sample properties of the proposed test, and an empirical illustration underscores the ease with which the proposed simulation and testing methodology can be applied.  相似文献   

17.
This paper considers multiple regression model with multivariate spherically symmetric errors to determine optimal β-expectation tolerance regions for the future regression vector (FRV) and future residual sum of squares (FRSS) by using the prediction distributions of some appropriate functions of future responses. The prediction distribution of the FRV, conditional on the observed responses, is multivariate Student-t distribution. Similarly, the prediction distribution of the FRSS is a beta distribution. The optimal β-expectation tolerance regions for the FRV and FRSS have been obtained based on the F -distribution and beta distribution, respectively. The results in this paper are applicable for multiple regression model with normal and Student-t errors.   相似文献   

18.
This paper presents a class of generalized Wald, generalized score and generalized likelihood ratio statistics for hypothesis testing and model selection for multivariate failure time data. These statistics are based on a marginal hazard model with a common baseline hazard function. The large sample distributions of these statistics are examined. It is shown that the proposed test statistics follow asymptotically a weighted sum of independent χ12 distributions.  相似文献   

19.
The distributions of the time from Human Immunodeficiency Virus (HIV) infection to the onset of Acquired Immune Deficiency Syndrome (AIDS) and of the residual time to AIDS diagnosis are important for modeling the growth of the AIDS epidemic and for predicting onset of the disease in an individual. Markers such as CD4 counts carry valuable information about disease progression and therefore about the two survival distributions. Building on the framework set out by Jewell and Kalbfleisch (1992), we study these two survival distributions based on stochastic models for the marker process (X(t)) and a marker-dependent hazard (h()). We examine various plausible CD4 marker processes and marker-dependent hazard functions for AIDS proposed in recent literature. For a random effects plus Brownian motion marker process X(t)=(a+bt+BM(t))4, where a has a normal distribution, b<0 is an unknown parameter and BM(t) is Brownian motion, and marker-dependent hazard h(X(t)), we prove that, given CD4 cell count X(t), the residual time to AIDS distribution does not depend on the time since infection t. Using simulation and numerical integration, we find the marginal incubation period distribution, the marginal hazard and the residual time distribution for several combinations of marker processes and marker-dependent hazards. An example using data from the Multicenter AIDS Cohort Study is given. A simple regression model relating the cube root of residual time to AIDS to CD4 count is suggested.  相似文献   

20.
Linear mixed models are widely used when multiple correlated measurements are made on each unit of interest. In many applications, the units may form several distinct clusters, and such heterogeneity can be more appropriately modelled by a finite mixture linear mixed model. The classical estimation approach, in which both the random effects and the error parts are assumed to follow normal distribution, is sensitive to outliers, and failure to accommodate outliers may greatly jeopardize the model estimation and inference. We propose a new mixture linear mixed model using multivariate t distribution. For each mixture component, we assume the response and the random effects jointly follow a multivariate t distribution, to conveniently robustify the estimation procedure. An efficient expectation conditional maximization algorithm is developed for conducting maximum likelihood estimation. The degrees of freedom parameters of the t distributions are chosen data adaptively, for achieving flexible trade-off between estimation robustness and efficiency. Simulation studies and an application on analysing lung growth longitudinal data showcase the efficacy of the proposed approach.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号