首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 937 毫秒
1.
The characteristic function of the multivariate Student t-distribution is obtained, and it is shown that this characteristic function has the pedogogical virtue of reducing the multivariate problem to the analogous univariate problem. Applications of the characteristic function are discussed.  相似文献   

2.
This article proposes a class of multivariate bilateral selection t distributions useful for analyzing non-normal (skewed and/or bimodal) multivariate data. The class is associated with a bilateral selection mechanism, and it is obtained from a marginal distribution of the centrally truncated multivariate t. It is flexible enough to include the multivariate t and multivariate skew-t distributions and mathematically tractable enough to account for central truncation of a hidden t variable. The class, closed under linear transformation, marginal, and conditional operations, is studied from several aspects such as shape of the probability density function, conditioning of a distribution, scale mixtures of multivariate normal, and a probabilistic representation. The relationships among these aspects are given, and various properties of the class are also discussed. Necessary theories and two applications are provided.  相似文献   

3.
The multivariate skew-t distribution (J Multivar Anal 79:93–113, 2001; J R Stat Soc, Ser B 65:367–389, 2003; Statistics 37:359–363, 2003) includes the Student t, skew-Cauchy and Cauchy distributions as special cases and the normal and skew–normal ones as limiting cases. In this paper, we explore the use of Markov Chain Monte Carlo (MCMC) methods to develop a Bayesian analysis of repeated measures, pretest/post-test data, under multivariate null intercept measurement error model (J Biopharm Stat 13(4):763–771, 2003) where the random errors and the unobserved value of the covariate (latent variable) follows a Student t and skew-t distribution, respectively. The results and methods are numerically illustrated with an example in the field of dentistry.  相似文献   

4.
Finite mixtures of multivariate skew t (MST) distributions have proven to be useful in modelling heterogeneous data with asymmetric and heavy tail behaviour. Recently, they have been exploited as an effective tool for modelling flow cytometric data. A number of algorithms for the computation of the maximum likelihood (ML) estimates for the model parameters of mixtures of MST distributions have been put forward in recent years. These implementations use various characterizations of the MST distribution, which are similar but not identical. While exact implementation of the expectation-maximization (EM) algorithm can be achieved for ‘restricted’ characterizations of the component skew t-distributions, Monte Carlo (MC) methods have been used to fit the ‘unrestricted’ models. In this paper, we review several recent fitting algorithms for finite mixtures of multivariate skew t-distributions, at the same time clarifying some of the connections between the various existing proposals. In particular, recent results have shown that the EM algorithm can be implemented exactly for faster computation of ML estimates for mixtures with unrestricted MST components. The gain in computational time is effected by noting that the semi-infinite integrals on the E-step of the EM algorithm can be put in the form of moments of the truncated multivariate non-central t-distribution, similar to the restricted case, which subsequently can be expressed in terms of the non-truncated form of the central t-distribution function for which fast algorithms are available. We present comparisons to illustrate the relative performance of the restricted and unrestricted models, and demonstrate the usefulness of the recently proposed methodology for the unrestricted MST mixture, by some applications to three real datasets.  相似文献   

5.
This study proposes a class of non-linear realized stochastic volatility (SV) model by applying the Box–Cox (BC) transformation, instead of the logarithmic transformation, to the realized estimator. The non-Gaussian distributions such as Student's t, non-central Student's t, and generalized hyperbolic skew Student's t-distributions are applied to accommodate heavy-tailedness and skewness in returns. The proposed models are fitted to daily returns and realized kernel of six stocks: SP500, FTSE100, Nikkei225, Nasdaq100, DAX, and DJIA using an Markov chain Monte Carlo Bayesian method, in which the Hamiltonian Monte Carlo (HMC) algorithm updates BC parameter and the Riemann manifold HMC algorithm updates latent variables and other parameters that are unable to be sampled directly. Empirical studies provide evidence against both the logarithmic transformation and raw versions of realized SV model.  相似文献   

6.
This paper presents a robust probabilistic mixture model based on the multivariate skew-t-normal distribution, a skew extension of the multivariate Student’s t distribution with more powerful abilities in modelling data whose distribution seriously deviates from normality. The proposed model includes mixtures of normal, t and skew-normal distributions as special cases and provides a flexible alternative to recently proposed skew t mixtures. We develop two analytically tractable EM-type algorithms for computing maximum likelihood estimates of model parameters in which the skewness parameters and degrees of freedom are asymptotically uncorrelated. Standard errors for the parameter estimates can be obtained via a general information-based method. We also present a procedure of merging mixture components to automatically identify the number of clusters by fitting piecewise linear regression to the rescaled entropy plot. The effectiveness and performance of the proposed methodology are illustrated by two real-life examples.  相似文献   

7.
Papers on the analysis of means (ANOM) have been circulating in the quality control literature for decades, routinely describing it as a statistical stand-alone concept. Therefore, we clarify that ANOM should rather be regarded as a special case of a much more universal approach known as multiple contrast tests (MCTs). Perceiving ANOM as a grand-mean-type MCT paves the way for implementing it in the open-source software R. We give a brief tutorial on how to exploit R's versatility and introduce the R package ANOM for drawing the familiar decision charts. Beyond that, we illustrate two practical aspects of data analysis with ANOM: firstly, we compare merits and drawbacks of ANOM-type MCTs and ANOVA F-test and assess their respective statistical powers, and secondly, we show that the benefit of using critical values from multivariate t-distributions for ANOM instead of simple Bonferroni quantiles is oftentimes negligible.  相似文献   

8.
ABSTRACT

The properties of a family of distributions generalizing the secant hyperbolic are developed. This family consists of symmetric distributions, with kurtosis ranging from 1.8 to infinity, and includes the logistic as a special case, the uniform as a limiting case, and closely approximates the normal and Student's t-distributions with corresponding kurtosis. A significant difference between this family and Student's t is that for any member of the generalized secant hyperbolic family, all moments are finite. Further, technical difficulties associated with evaluating moments of Student's t (especially for fractional degrees of freedom) are not present with this family. The properties of the maximum likelihood and modified maximum likelihood estimates of the location and scale parameters for complete samples are considered. Examples illustrate the methods developed in this work.  相似文献   

9.
This article presents an optimization-based approach for the design of acceptance sampling plans by variables for controlling nonconforming proportions when the standard deviation is unknown. The variables are described by rigorous noncentral Student’s t-distributions. Single and double acceptance sampling (AS) plans are addressed. The optimal design results from minimizing the average sampling number (ASN), subject to conditions holding at producer’s and consumer’s required quality levels. The problem is then solved employing a nonlinear programming solver. The results obtained are in close agreement with previous sampling plans found in the literature, outperforming them regarding the feasibility.  相似文献   

10.
A single parametric form is given for the symmetric distributions in the Pearson system with finite variance. In effect, these are Student's t-distributions with ν > 2 and all centered symmetric beta distributions. A different parametrization allows the inclusion of the t-distributions with ν ≤2 at the expense of symmetric beta distributions with a low shape parameter.  相似文献   

11.
Clustering gene expression time course data is an important problem in bioinformatics because understanding which genes behave similarly can lead to the discovery of important biological information. Statistically, the problem of clustering time course data is a special case of the more general problem of clustering longitudinal data. In this paper, a very general and flexible model-based technique is used to cluster longitudinal data. Mixtures of multivariate t-distributions are utilized, with a linear model for the mean and a modified Cholesky-decomposed covariance structure. Constraints are placed upon the covariance structure, leading to a novel family of mixture models, including parsimonious models. In addition to model-based clustering, these models are also used for model-based classification, i.e., semi-supervised clustering. Parameters, including the component degrees of freedom, are estimated using an expectation-maximization algorithm and two different approaches to model selection are considered. The models are applied to simulated data to illustrate their efficacy; this includes a comparison with their Gaussian analogues—the use of these Gaussian analogues with a linear model for the mean is novel in itself. Our family of multivariate t mixture models is then applied to two real gene expression time course data sets and the results are discussed. We conclude with a summary, suggestions for future work, and a discussion about constraining the degrees of freedom parameter.  相似文献   

12.
ABSTRACT

This article investigates the robustness of the shrinkage Bayesian estimator for the relative potency parameter in the combinations of multivariate bioassays proposed in Chen et al. (1999 Chen, D.G., Carter, E.M., Hubert, J.J., Kim, P.T. (1999). Empirical Bayesian estimation for combinations of multivariate bioassays. Biometrics 55(4):10351043. [Google Scholar]), which incorporated prior information on the model parameters based on Jeffreys’ rules. This investigation is carried out for the families of t-distribution and Cauchy-distribution based on the characteristics of bioassay theory since the t-distribution approaches the normal distribution which is the most commonly used distribution in the applications of bioassay as the degrees of freedom increases and the t-distribution approaches the Cauchy-distribution as the degrees of freedom approaches 1 which is also an important distribution in bioassay. A real data is used to illustrate the application of this investigation. This analysis further supports the application of the shrinkage Bayesian estimator to the theory of bioassay along with the empirical Bayesian estimator.  相似文献   

13.
This paper investigates improved testing inferences under a general multivariate elliptical regression model. The model is very flexible in terms of the specification of the mean vector and the dispersion matrix, and of the choice of the error distribution. The error terms are allowed to follow a multivariate distribution in the class of the elliptical distributions, which has the multivariate normal and Student-t distributions as special cases. We obtain Skovgaard's adjusted likelihood ratio (LR) statistics and Barndorff-Nielsen's adjusted signed LR statistics and we compare the methods through simulations. The simulations suggest that the proposed tests display superior finite sample behaviour as compared to the standard tests. Two applications are presented in order to illustrate the methods.  相似文献   

14.
In this study we investigate the problem of estimation and testing of hypotheses in multivariate linear regression models when the errors involved are assumed to be non-normally distributed. We consider the class of heavy-tailed distributions for this purpose. Although our method is applicable for any distribution in this class, we take the multivariate t-distribution for illustration. This distribution has applications in many fields of applied research such as Economics, Business, and Finance. For estimation purpose, we use the modified maximum likelihood method in order to get the so-called modified maximum likelihood estimates that are obtained in a closed form. We show that these estimates are substantially more efficient than least-square estimates. They are also found to be robust to reasonable deviations from the assumed distribution and also many data anomalies such as the presence of outliers in the sample, etc. We further provide test statistics for testing the relevant hypothesis regarding the regression coefficients.  相似文献   

15.
Multivariate mixture regression models can be used to investigate the relationships between two or more response variables and a set of predictor variables by taking into consideration unobserved population heterogeneity. It is common to take multivariate normal distributions as mixing components, but this mixing model is sensitive to heavy-tailed errors and outliers. Although normal mixture models can approximate any distribution in principle, the number of components needed to account for heavy-tailed distributions can be very large. Mixture regression models based on the multivariate t distributions can be considered as a robust alternative approach. Missing data are inevitable in many situations and parameter estimates could be biased if the missing values are not handled properly. In this paper, we propose a multivariate t mixture regression model with missing information to model heterogeneity in regression function in the presence of outliers and missing values. Along with the robust parameter estimation, our proposed method can be used for (i) visualization of the partial correlation between response variables across latent classes and heterogeneous regressions, and (ii) outlier detection and robust clustering even under the presence of missing values. We also propose a multivariate t mixture regression model using MM-estimation with missing information that is robust to high-leverage outliers. The proposed methodologies are illustrated through simulation studies and real data analysis.  相似文献   

16.
This paper presents a new theorem, as a substitute for existing results which are shown to have some errors, for evaluating the exact one-sided percentage points of the multivariate normal distribution with a singular negative product correlation structure. By extending the result from the multivariate normal distribution to the multivariate t-distribution with corresponding singular correlation structure, we tabulate the one-sided critical points for the Analysis of Means procedure.  相似文献   

17.
The studentized range test is a widely applied statistical procedure to compare several normal means within the analysis of variance. However, up to now no general methodology is available to perform the all-pair comparisons precisely, such as the computation of p-values or quantiles in the simple unbalanced one-way layout. Instead, a variety of approximations have been proposed in the past. This article focuses on exact computations of simultaneous confidence intervals and exact sample size determinations for all-pair comparisons in the analysis of variance involving arbitrary variance-covariance matrices. General power expressions in closed form are developed and numerical issues concerning the arising multivariate central and noncentral t-distributions are discussed. An application to the usual fixed effects analysis of covariance illustrates the use of the obtained results.  相似文献   

18.
In this paper, the hypothesis testing and confidence region construction for a linear combination of mean vectors for K independent multivariate normal populations are considered. A new generalized pivotal quantity and a new generalized test variable are derived based on the concepts of generalized p-values and generalized confidence regions. When only two populations are considered, our results are equivalent to those proposed by Gamage et al. [Generalized p-values and confidence regions for the multivariate Behrens–Fisher problem and MANOVA, J. Multivariate Aanal. 88 (2004), pp. 117–189] in the bivariate case, which is also known as the bivariate Behrens–Fisher problem. However, in some higher dimension cases, these two results are quite different. The generalized confidence region is illustrated with two numerical examples and the merits of the proposed method are numerically compared with those of the existing methods with respect to their expected areas, coverage probabilities under different scenarios.  相似文献   

19.
A critical issue in modeling binary response data is the choice of the links. We introduce a new link based on the Student’s t-distribution (t-link) for correlated binary data. The t-link relates to the common probit-normal link adding one additional parameter which controls the heaviness of the tails of the link. We propose an interesting EM algorithm for computing the maximum likelihood for generalized linear mixed t-link models for correlated binary data. In contrast with recent developments (Tan et al. in J. Stat. Comput. Simul. 77:929–943, 2007; Meza et al. in Comput. Stat. Data Anal. 53:1350–1360, 2009), this algorithm uses closed-form expressions at the E-step, as opposed to Monte Carlo simulation. Our proposed algorithm relies on available formulas for the mean and variance of a truncated multivariate t-distribution. To illustrate the new method, a real data set on respiratory infection in children and a simulation study are presented.  相似文献   

20.
Linear mixed models are widely used when multiple correlated measurements are made on each unit of interest. In many applications, the units may form several distinct clusters, and such heterogeneity can be more appropriately modelled by a finite mixture linear mixed model. The classical estimation approach, in which both the random effects and the error parts are assumed to follow normal distribution, is sensitive to outliers, and failure to accommodate outliers may greatly jeopardize the model estimation and inference. We propose a new mixture linear mixed model using multivariate t distribution. For each mixture component, we assume the response and the random effects jointly follow a multivariate t distribution, to conveniently robustify the estimation procedure. An efficient expectation conditional maximization algorithm is developed for conducting maximum likelihood estimation. The degrees of freedom parameters of the t distributions are chosen data adaptively, for achieving flexible trade-off between estimation robustness and efficiency. Simulation studies and an application on analysing lung growth longitudinal data showcase the efficacy of the proposed approach.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号