期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Finite mixtures of multivariate skew t-distributions: some recent and new results

Sharon Lee Geoffrey J. McLachlan 《Statistics and Computing》2014,24(2):181-202

Finite mixtures of multivariate skew t (MST) distributions have proven to be useful in modelling heterogeneous data with asymmetric and heavy tail behaviour. Recently, they have been exploited as an effective tool for modelling flow cytometric data. A number of algorithms for the computation of the maximum likelihood (ML) estimates for the model parameters of mixtures of MST distributions have been put forward in recent years. These implementations use various characterizations of the MST distribution, which are similar but not identical. While exact implementation of the expectation-maximization (EM) algorithm can be achieved for ‘restricted’ characterizations of the component skew t-distributions, Monte Carlo (MC) methods have been used to fit the ‘unrestricted’ models. In this paper, we review several recent fitting algorithms for finite mixtures of multivariate skew t-distributions, at the same time clarifying some of the connections between the various existing proposals. In particular, recent results have shown that the EM algorithm can be implemented exactly for faster computation of ML estimates for mixtures with unrestricted MST components. The gain in computational time is effected by noting that the semi-infinite integrals on the E-step of the EM algorithm can be put in the form of moments of the truncated multivariate non-central t-distribution, similar to the restricted case, which subsequently can be expressed in terms of the non-truncated form of the central t-distribution function for which fast algorithms are available. We present comparisons to illustrate the relative performance of the restricted and unrestricted models, and demonstrate the usefulness of the recently proposed methodology for the unrestricted MST mixture, by some applications to three real datasets. 相似文献

2.

Robust mixture modeling using the skew <Emphasis Type="Italic">t</Emphasis> distribution

Tsung I. Lin Jack C. Lee Wan J. Hsieh 《Statistics and Computing》2007,17(2):81-92

A finite mixture model using the Student's t distribution has been recognized as a robust extension of normal mixtures. Recently, a mixture of skew normal distributions has been found to be effective in the treatment of heterogeneous data involving asymmetric behaviors across subclasses. In this article, we propose a robust mixture framework based on the skew t distribution to efficiently deal with heavy-tailedness, extra skewness and multimodality in a wide range of settings. Statistical mixture modeling based on normal, Student's t and skew normal distributions can be viewed as special cases of the skew t mixture model. We present analytically simple EM-type algorithms for iteratively computing maximum likelihood estimates. The proposed methodology is illustrated by analyzing a real data example. 相似文献

3.

Parameter estimation for mixtures of skew Laplace normal distributions and application in mixture regression modeling

Fatma Zehra Doğru Olcay Arslan 《统计学通讯:理论与方法》2017,46(21):10879-10896

In this article, we propose mixtures of skew Laplace normal (SLN) distributions to model both skewness and heavy-tailedness in the neous data set as an alternative to mixtures of skew Student-t-normal (STN) distributions. We give the expectation–maximization (EM) algorithm to obtain the maximum likelihood (ML) estimators for the parameters of interest. We also analyze the mixture regression model based on the SLN distribution and provide the ML estimators of the parameters using the EM algorithm. The performance of the proposed mixture model is illustrated by a simulation study and two real data examples. 相似文献

4.

Robust mixture modeling using multivariate skew <Emphasis Type="Italic">t</Emphasis> distributions

Tsung-I Lin 《Statistics and Computing》2010,20(3):343-356

This paper presents a robust mixture modeling framework using the multivariate skew t distributions, an extension of the multivariate Student’s t family with additional shape parameters to regulate skewness. The proposed model results in a very complicated likelihood. Two variants of Monte Carlo EM algorithms are developed to carry out maximum likelihood estimation of mixture parameters. In addition, we offer a general information-based method for obtaining the asymptotic covariance matrix of maximum likelihood estimates. Some practical issues including the selection of starting values as well as the stopping criterion are also discussed. The proposed methodology is applied to a subset of the Australian Institute of Sport data for illustration. 相似文献

5.

The skew generalized t distribution as the scale mixture of a skew exponential power distribution and its applications in robust estimation

Olcay Arslan Ali İ. Genç 《Statistics》2013,47(5):481-498

In this paper, we consider the family of skew generalized t (SGT) distributions originally introduced by Theodossiou [P. Theodossiou, Financial data and the skewed generalized t distribution, Manage. Sci. Part 1 44 (12) ( 1998), pp. 1650–1661] as a skew extension of the generalized t (GT) distribution. The SGT distribution family warrants special attention, because it encompasses distributions having both heavy tails and skewness, and many of the widely used distributions such as Student's t, normal, Hansen's skew t, exponential power, and skew exponential power (SEP) distributions are included as limiting or special cases in the SGT family. We show that the SGT distribution can be obtained as the scale mixture of the SEP and generalized gamma distributions. We investigate several properties of the SGT distribution and consider the maximum likelihood estimation of the location, scale, and skewness parameters under the assumption that the shape parameters are known. We show that if the shape parameters are estimated along with the location, scale, and skewness parameters, the influence function for the maximum likelihood estimators becomes unbounded. We obtain the necessary conditions to ensure the uniqueness of the maximum likelihood estimators for the location, scale, and skewness parameters, with known shape parameters. We provide a simple iterative re-weighting algorithm to compute the maximum likelihood estimates for the location, scale, and skewness parameters and show that this simple algorithm can be identified as an EM-type algorithm. We finally present two applications of the SGT distributions in robust estimation. 相似文献

6.

Maximum likelihood inference for mixtures of skew Student-t-normal distributions through practical EM-type algorithms

Hsiu J. Ho Saumyadipta Pyne Tsung I. Lin 《Statistics and Computing》2012,22(1):287-299

This paper deals with the problem of maximum likelihood estimation for a mixture of skew Student-t-normal distributions, which is a novel model-based tool for clustering heterogeneous (multiple groups) data in the presence of skewed and heavy-tailed outcomes. We present two analytically simple EM-type algorithms for iteratively computing the maximum likelihood estimates. The observed information matrix is derived for obtaining the asymptotic standard errors of parameter estimates. A small simulation study is conducted to demonstrate the superiority of the skew Student-t-normal distribution compared to the skew t distribution. The proposed methodology is particularly useful for analyzing multimodal asymmetric data as produced by major biotechnological platforms like flow cytometry. We provide such an application with the help of an illustrative example. 相似文献

7.

A class of weighted multivariate distributions related to doubly truncated multivariate t-distribution

Hea-Jung Kim 《Statistics》2013,47(1):89-106

This article introduces a class of weighted multivariate t-distributions, which includes the multivariate generalized Student t and multivariate skew t as its special members. This class is defined as the marginal distribution of a doubly truncated multivariate generalized Student t-distribution and studied from several aspects such as weighting of probability density functions, inequality constrained multivariate Student t-distributions, scale mixtures of multivariate normal and probabilistic representations. The relationships among these aspects are given, and various properties of the class are also discussed. Necessary theories and two applications are provided. 相似文献

8.

Bayesian analysis of mixture modelling using the multivariate t distribution

Lin Tsung I. Lee Jack C. Ni Huey F. 《Statistics and Computing》2004,14(2):119-130

A finite mixture model using the multivariate t distribution has been shown as a robust extension of normal mixtures. In this paper, we present a Bayesian approach for inference about parameters of t-mixture models. The specifications of prior distributions are weakly informative to avoid causing nonintegrable posterior distributions. We present two efficient EM-type algorithms for computing the joint posterior mode with the observed data and an incomplete future vector as the sample. Markov chain Monte Carlo sampling schemes are also developed to obtain the target posterior distribution of parameters. The advantages of Bayesian approach over the maximum likelihood method are demonstrated via a set of real data. 相似文献

9.

Robust multivariate mixture regression models with incomplete data

Hwa Kyung Lim Naveen N. Narisetty 《Journal of Statistical Computation and Simulation》2017,87(2):328-347

Multivariate mixture regression models can be used to investigate the relationships between two or more response variables and a set of predictor variables by taking into consideration unobserved population heterogeneity. It is common to take multivariate normal distributions as mixing components, but this mixing model is sensitive to heavy-tailed errors and outliers. Although normal mixture models can approximate any distribution in principle, the number of components needed to account for heavy-tailed distributions can be very large. Mixture regression models based on the multivariate t distributions can be considered as a robust alternative approach. Missing data are inevitable in many situations and parameter estimates could be biased if the missing values are not handled properly. In this paper, we propose a multivariate t mixture regression model with missing information to model heterogeneity in regression function in the presence of outliers and missing values. Along with the robust parameter estimation, our proposed method can be used for (i) visualization of the partial correlation between response variables across latent classes and heterogeneous regressions, and (ii) outlier detection and robust clustering even under the presence of missing values. We also propose a multivariate t mixture regression model using MM-estimation with missing information that is robust to high-leverage outliers. The proposed methodologies are illustrated through simulation studies and real data analysis. 相似文献

10.

Estimation of a scale parameter in mixture models with unknown location

《Journal of statistical planning and inference》2005,128(1):191-218

Estimation of the scale parameter in mixture models with unknown location is considered under Stein's loss. Under certain conditions, the inadmissibility of the “usual” estimator is established by exhibiting better estimators. In addition, robust improvements are found for a specified submodel of the original model. The results are applied to mixtures of normal distributions and mixtures of exponential distributions. Improved estimators of the variance of a normal distribution are shown to be robust under any scale mixture of normals having variance greater than the variance of that normal distribution. In particular, Stein's (Ann. Inst. Statist. Math. 16 (1964) 155) and Brewster's and Zidek's (Ann. Statist. 2 (1974) 21) estimators obtained under the normal model are robust under the t model, for arbitrary degrees of freedom, and under the double-exponential model. Improved estimators for the variance of a t distribution with unknown and arbitrary degrees of freedom are also given. In addition, improved estimators for the scale parameter of the multivariate Lomax distribution (which arises as a certain mixture of exponential distributions) are derived and the robustness of Zidek's (Ann. Statist. 1 (1973) 264) and Brewster's (Ann. Statist. 2 (1974) 553) estimators of the scale parameter of an exponential distribution is established under a class of modified Lomax distributions. 相似文献

11.

The generalized Gudermannian distribution: inference and volatility modelling

Emrah Altun 《Statistics》2019,53(2):364-386

In this paper, we introduce a new distribution, called generalized Gudermannian (GG) distribution, and its skew extension for GARCH models in modelling daily Value-at-Risk (VaR). Basic structural properties of the proposed distribution are obtained including probability density and cumulative distribution functions, moments, and stochastic representation. The maximum likelihood method is used to estimate unknown parameters of the proposed model and finite sample performance of maximum likelihood estimates are evaluated by means of Monte-Carlo simulation study. The real data application on Nikkei 225 index is given to demonstrate the performance of GARCH model specified under skew extension of GG innovation distribution against normal, Student's-t, skew normal and generalized error and skew generalized error distributions in terms of the accuracy of VaR forecasts. The empirical results show that the GARCH model with GG innovation distribution produces the most accurate VaR forecasts for all confidence levels. 相似文献

12.

On a nonlinear Birnbaum–Saunders model based on a bivariate construction and its characteristics

Mohsen Khosravi Ahad Jamalizadeh Emilio Porcu 《统计学通讯:理论与方法》2013,42(3):772-793

Abstract

The Birnbaum-Saunders (BS) distribution is an asymmetric probability model that is receiving considerable attention. In this article, we propose a methodology based on a new class of BS models generated from the Student-t distribution. We obtain a recurrence relationship for a BS distribution based on a nonlinear skew–t distribution. Model parameters estimators are obtained by means of the maximum likelihood method, which are evaluated by Monte Carlo simulations. We illustrate the obtained results by analyzing two real data sets. These data analyses allow the adequacy of the proposed model to be shown and discussed by applying model selection tools. 相似文献

13.

Mixture of linear mixed models using multivariate t distribution

《Journal of Statistical Computation and Simulation》2012,82(4):771-787

Linear mixed models are widely used when multiple correlated measurements are made on each unit of interest. In many applications, the units may form several distinct clusters, and such heterogeneity can be more appropriately modelled by a finite mixture linear mixed model. The classical estimation approach, in which both the random effects and the error parts are assumed to follow normal distribution, is sensitive to outliers, and failure to accommodate outliers may greatly jeopardize the model estimation and inference. We propose a new mixture linear mixed model using multivariate t distribution. For each mixture component, we assume the response and the random effects jointly follow a multivariate t distribution, to conveniently robustify the estimation procedure. An efficient expectation conditional maximization algorithm is developed for conducting maximum likelihood estimation. The degrees of freedom parameters of the t distributions are chosen data adaptively, for achieving flexible trade-off between estimation robustness and efficiency. Simulation studies and an application on analysing lung growth longitudinal data showcase the efficacy of the proposed approach. 相似文献

14.

A generalized skew two-piece skew-elliptical distribution

Mahdi Salehi Ahad Jamalizadeh Mahdi Doostparast 《Statistical Papers》2014,55(2):409-429

We present a new generalized family of skew two-piece skew-elliptical (GSTPSE) models and derive some its statistical properties. It is shown that the new family of distribution may be written as a mixture of generalized skew elliptical distributions. Also, a new representation theorem for a special case of GSTPSE-distribution is given. Next, we will focus on t kernel density and prove that it is a scale mixture of the generalized skew two-piece skew normal distributions. An explicit expression for the central moments as well as a recurrence relations for its cumulative distribution function and density are obtained. Since, this special case is a uni-/bimodal distribution, a sufficient condition for each cases is given. A real data set on heights of Australian females athletes is analysed. Finally, some concluding remarks and open problems are discussed. 相似文献

15.

Bayesian analysis of skew-normal independent linear mixed models with heterogeneity in the random-effects population

Celso Rômulo Barbosa CabralVíctor Hugo Lachos Maria Regina Madruga 《Journal of statistical planning and inference》2012,142(1):181-200

We present a new class of models to fit longitudinal data, obtained with a suitable modification of the classical linear mixed-effects model. For each sample unit, the joint distribution of the random effect and the random error is a finite mixture of scale mixtures of multivariate skew-normal distributions. This extension allows us to model the data in a more flexible way, taking into account skewness, multimodality and discrepant observations at the same time. The scale mixtures of skew-normal form an attractive class of asymmetric heavy-tailed distributions that includes the skew-normal, skew-Student-t, skew-slash and the skew-contaminated normal distributions as special cases, being a flexible alternative to the use of the corresponding symmetric distributions in this type of models. A simple efficient MCMC Gibbs-type algorithm for posterior Bayesian inference is employed. In order to illustrate the usefulness of the proposed methodology, two artificial and two real data sets are analyzed. 相似文献

16.

Flexible mixture modeling via the multivariate t distribution with?the?Box-Cox transformation: an?alternative to?the?skew-t distribution

Lo K Gottardo R 《Statistics and Computing》2012,22(1):33-52

Cluster analysis is the automated search for groups of homogeneous observations in a data set. A popular modeling approach for clustering is based on finite normal mixture models, which assume that each cluster is modeled as a multivariate normal distribution. However, the normality assumption that each component is symmetric is often unrealistic. Furthermore, normal mixture models are not robust against outliers; they often require extra components for modeling outliers and/or give a poor representation of the data. To address these issues, we propose a new class of distributions, multivariate t distributions with the Box-Cox transformation, for mixture modeling. This class of distributions generalizes the normal distribution with the more heavy-tailed t distribution, and introduces skewness via the Box-Cox transformation. As a result, this provides a unified framework to simultaneously handle outlier identification and data transformation, two interrelated issues. We describe an Expectation-Maximization algorithm for parameter estimation along with transformation selection. We demonstrate the proposed methodology with three real data sets and simulation studies. Compared with a wealth of approaches including the skew-t mixture model, the proposed t mixture model with the Box-Cox transformation performs favorably in terms of accuracy in the assignment of observations, robustness against model misspecification, and selection of the number of components. 相似文献

17.

Constrained monotone EM algorithms for mixtures of multivariate <Emphasis Type="Italic">t</Emphasis> distributions 总被引：1，自引：0，他引：1

F. Greselin S. Ingrassia 《Statistics and Computing》2010,20(1):9-22

Mixtures of multivariate t distributions provide a robust parametric extension to the fitting of data with respect to normal mixtures. In presence of some noise component, potential outliers or data with longer-than-normal tails, one way to broaden the model can be provided by considering t distributions. In this framework, the degrees of freedom can act as a robustness parameter, tuning the heaviness of the tails, and downweighting the effect of the outliers on the parameters estimation. The aim of this paper is to extend to mixtures of multivariate elliptical distributions some theoretical results about the likelihood maximization on constrained parameter spaces. Further, a constrained monotone algorithm implementing maximum likelihood mixture decomposition of multivariate t distributions is proposed, to achieve improved convergence capabilities and robustness. Monte Carlo numerical simulations and a real data study illustrate the better performance of the algorithm, comparing it to earlier proposals. 相似文献

18.

A multivariate two-factor skew model

Arjun K. Gupta J. Tang 《Statistics》2013,47(4):301-309

It is well known that many data, such as the financial or demographic data, exhibit asymmetric distributions. In recent years, researchers have concentrated their efforts to model this asymmetry. Skew normal model is one of such models that are skew and yet possess many properties of the normal model. In this paper, a new multivariate skew model is proposed, along with its statistical properties. It includes the multivariate normal distribution and multivariate skew normal distribution as special cases. The quadratic form of this random vector follows a χ² distribution. The roles of the parameters in the model are investigated using contour plots of bivariate densities. 相似文献

19.

Two-way ANOVA when the distribution of the error terms is skew t

Nuri Celik Birdal Senoglu 《统计学通讯:模拟与计算》2019,48(1):287-301

In this article, we assume that the distribution of the error terms is skew t in two-way analysis of variance (ANOVA). Skew t distribution is very flexible for modeling the symmetric and the skew datasets, since it reduces to the well-known normal, skew normal, and Student's t distributions. We obtain the estimators of the model parameters by using the maximum likelihood (ML) and the modified maximum likelihood (MML) methodologies. We also propose new test statistics based on these estimators for testing the equality of the treatment and the block means and also the interaction effect. The efficiencies of the ML and the MML estimators and the power values of the test statistics based on them are compared with the corresponding normal theory results via Monte Carlo simulation study. Simulation results show that the proposed methodologies are more preferable. We also show that the test statistics based on the ML estimators are more powerful than the test statistics based on the MML estimators as expected. However, power values of the test statistics based on the MML estimators are very close to the corresponding test statistics based on the ML estimators. At the end of the study, a real life example is given to show the implementation of the proposed methodologies. 相似文献

20.

Objective Bayesian Analysis of Skew‐t Distributions

MARCIA D'ELIA BRANCO MARC G. GENTON BRUNERO LISEO 《Scandinavian Journal of Statistics》2013,40(1):63-85

Abstract. We study the Jeffreys prior and its properties for the shape parameter of univariate skew‐t distributions with linear and nonlinear Student's t skewing functions. In both cases, we show that the resulting priors for the shape parameter are symmetric around zero and proper. Moreover, we propose a Student's t approximation of the Jeffreys prior that makes an objective Bayesian analysis easy to perform. We carry out a Monte Carlo simulation study that demonstrates an overall better behaviour of the maximum a posteriori estimator compared with the maximum likelihood estimator. We also compare the frequentist coverage of the credible intervals based on the Jeffreys prior and its approximation and show that they are similar. We further discuss location‐scale models under scale mixtures of skew‐normal distributions and show some conditions for the existence of the posterior distribution and its moments. Finally, we present three numerical examples to illustrate the implications of our results on inference for skew‐t distributions. 相似文献