期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Class of Multivariate Bilateral Selection t Distributions and Its Properties

Hea-Jung Kim 《统计学通讯:理论与方法》2013,42(12):2136-2154

This article proposes a class of multivariate bilateral selection t distributions useful for analyzing non-normal (skewed and/or bimodal) multivariate data. The class is associated with a bilateral selection mechanism, and it is obtained from a marginal distribution of the centrally truncated multivariate t. It is flexible enough to include the multivariate t and multivariate skew-t distributions and mathematically tractable enough to account for central truncation of a hidden t variable. The class, closed under linear transformation, marginal, and conditional operations, is studied from several aspects such as shape of the probability density function, conditioning of a distribution, scale mixtures of multivariate normal, and a probabilistic representation. The relationships among these aspects are given, and various properties of the class are also discussed. Necessary theories and two applications are provided. 相似文献

2.

Sensitivity analysis of the t-distribution under truncated normal populations

《Journal of Statistical Computation and Simulation》2012,82(5):723-729

In recent literature, the truncated normal distribution has been used to model the stochastic structure for a variety of random structures. In this paper, the sensitivity of the t-random variable under a left-truncated normal population is explored. Simulation results are used to assess the errors associated when applying the student t-distribution to the case of an underlying left-truncated normal population. The maximum errors are modelled as a linear function of the magnitude of the truncation and sample size. In the case of a left-truncated normal population, adjustments to standard inferences for the mean, namely confidence intervals and observed significance levels, based on the t-random variable are introduced. 相似文献

3.

A note on the linear discriminant function when group means are equal

Gregory T. Schwemer M Ray Mickey 《统计学通讯:模拟与计算》2013,42(6):633-638

The performance of the sample linear discriminant function with known, proportional, covariance matrices and equal but unknown mean vectors is considered. Unconditional misclassification rates are obtained from the Student-t distribution. These results can be used as an aid in verifying simulation programs incorporating the linear discriminant function when Gaussian densities with unequal covariance matrices are used. 相似文献

4.

The studentized location linear discriminant function

C.Y. Leung 《统计学通讯:理论与方法》2013,42(11):3977-3990

The location linear discriminant function is used in a two-population classification problem when the available data are generated from both binary and continuous random variables. Asymptotic distribution of the studentized location linear discriminant function is derived directly without the inversion of the corresponding characteristic function. The resulting plug-in estimate of the overall error of misclassification consists of the estimate based on the limiting distribution of the discriminant plus a correction term up to the second order. By comparison, our estimate avoids exact knowledge of the Mahalanobis distances which is necessary when the expansions of Vlachonikolis (1985) are used in the case of an arbitrary cut-off point. An example is re-examined and analysed in the present context. 相似文献

5.

An Optimal Semiparametric Method for Two‐group Classification

《Scandinavian Journal of Statistics》2018,45(3):806-846

In the classical discriminant analysis, when two multivariate normal distributions with equal variance–covariance matrices are assumed for two groups, the classical linear discriminant function is optimal with respect to maximizing the standardized difference between the means of two groups. However, for a typical case‐control study, the distributional assumption for the case group often needs to be relaxed in practice. Komori et al. (Generalized t ‐statistic for two‐group classification. Biometrics 2015, 71: 404–416) proposed the generalized t ‐statistic to obtain a linear discriminant function, which allows for heterogeneity of case group. Their procedure has an optimality property in the class of consideration. We perform a further study of the problem and show that additional improvement is achievable. The approach we propose does not require a parametric distributional assumption on the case group. We further show that the new estimator is efficient, in that no further improvement is possible to construct the linear discriminant function more efficiently. We conduct simulation studies and real data examples to illustrate the finite sample performance and the gain that it produces in comparison with existing methods. 相似文献

6.

A class of weighted multivariate distributions related to doubly truncated multivariate t-distribution

Hea-Jung Kim 《Statistics》2013,47(1):89-106

This article introduces a class of weighted multivariate t-distributions, which includes the multivariate generalized Student t and multivariate skew t as its special members. This class is defined as the marginal distribution of a doubly truncated multivariate generalized Student t-distribution and studied from several aspects such as weighting of probability density functions, inequality constrained multivariate Student t-distributions, scale mixtures of multivariate normal and probabilistic representations. The relationships among these aspects are given, and various properties of the class are also discussed. Necessary theories and two applications are provided. 相似文献

7.

Interpretation of Canonical Discriminant Functions,Canonical Variates,and Principal Components 总被引：1，自引：0，他引：1

Alvin C. Rencher 《The American statistician》2013,67(3):217-225

Canonical discriminant functions are defined here as linear combinations that separate groups of observations, and canonical variates are defined as linear combinations associated with canonical correlations between two sets of variables. In standardized form, the coefficients in either type of canonical function provide information about the joint contribution of the variables to the canonical function. The standardized coefficients can be converted to correlations between the variables and the canonical function. These correlations generally alter the interpretation of the canonical functions. For canonical discriminant functions, the standardized coefficients are compared with the correlations, with partial t and F tests, and with rotated coefficients. For canonical variates, the discussion includes standardized coefficients, correlations between variables and the function, rotation, and redundancy analysis. Various approaches to interpretation of principal components are compared: the choice between the covariance and correlation matrices, the conversion of coefficients to correlations, the rotation of the coefficients, and the effect of special patterns in the covariance and correlation matrices. 相似文献

8.

Non‐Gaussian geostatistical modeling using (skew) t processes

Moreno Bevilacqua Christian Caamao‐Carrillo Reinaldo B. Arellano‐Valle Víctor Morales‐Oate 《Scandinavian Journal of Statistics》2021,48(1):212-245

We propose a new model for regression and dependence analysis when addressing spatial data with possibly heavy tails and an asymmetric marginal distribution. We first propose a stationary process with t marginals obtained through scale mixing of a Gaussian process with an inverse square root process with Gamma marginals. We then generalize this construction by considering a skew‐Gaussian process, thus obtaining a process with skew‐t marginal distributions. For the proposed (skew) t process, we study the second‐order and geometrical properties and in the t case, we provide analytic expressions for the bivariate distribution. In an extensive simulation study, we investigate the use of the weighted pairwise likelihood as a method of estimation for the t process. Moreover we compare the performance of the optimal linear predictor of the t process versus the optimal Gaussian predictor. Finally, the effectiveness of our methodology is illustrated by analyzing a georeferenced dataset on maximum temperatures in Australia. 相似文献

9.

An asymptotic approximation for EPMC in linear discriminant analysis based on monotone missing data

Nobumichi Shutoh 《Journal of statistical planning and inference》2012,142(1):110-125

In this paper, we propose an asymptotic approximation for the expected probabilities of misclassification (EPMC) in the linear discriminant function on the basis of k-step monotone missing training data for general k. We derive certain relations of the statistics in order to obtain the approximation. Finally, we perform Monte Carlo simulation to evaluate the accuracy of our result and to compare it with existing approximations. 相似文献

10.

Linear discrimination for three known normal populations

Mark J. Schervish 《Journal of statistical planning and inference》1984,10(2):167-175

A random vector is assumed to have one of three known multivariate normal distributions with equal covariance matrices. It is desired to separate the three distributions by means of a single linear discriminant function. Such a function can lead to a classification rule. The function whose classification rule minimizes the average of the three probabilities of misclassification is found. Also the function is found whose rule minimizes the maximum of the three probabilities of misclassification. 相似文献

11.

A nonparametric allocation scheme for classification based on transvariation probabilities

《Journal of Statistical Computation and Simulation》2012,82(8):977-987

In this paper, a nonparametric discriminant analysis procedure that is less sensitive than traditional procedures to deviations from the usual assumptions is proposed. The procedure uses the projection pursuit methodology where the projection index is the two-group transvariation probability. Montanari [A. Montanari, Linear discriminant analysis and transvariation, J. Classification 21 (2004), pp. 71–88] proposed and used this projection index to measure group separation but allocated the new observation using distances. Our procedure employs a method of allocation based on group–group transvariation probability to classify the new observation. A simulation study shows that the procedure proposed in this paper provides lower misclassification error rates than classical procedures like linear discriminant analysis and quadratic discriminant analysis and recent procedures like maximum depth and Montanari's transvariation-based classifiers, when the underlying distributions are skewed and/or the prior probabilities are unequal. 相似文献

12.

Discriminant analysis of survey data

Ching-Ho Leu Kam-Wah Tsui 《Journal of statistical planning and inference》1997,60(2):1115-290

We consider the problem of the effect of sample designs on discriminant analysis. The selection of the learning sample is assumed to depend on the population values of auxiliary variables. Under a superpopulation model with a multivariate normal distribution, unbiasedness and consistency are examined for the conventional estimators (derived under the assumptions of simple random sampling), maximum likelihood estimators, probability-weighted estimators and conditionally unbiased estimators of parameters. Four corresponding sampled linear discriminant functions are examined. The rates of misclassification of these four discriminant functions and the effect of sample design on these four rates of misclassification are discussed. The performances of these four discriminant functions are assessed in a simulation study. 相似文献

13.

A Central Limit Theorem in Non‐parametric Regression with Truncated,Censored and Dependent Data

下载免费PDF全文

Han‐Ying Liang Jacobo de Uña‐álvarez María del carmen Iglesias‐pérez 《Scandinavian Journal of Statistics》2015,42(1):256-269

On the basis of the idea of the Nadaraya–Watson (NW) kernel smoother and the technique of the local linear (LL) smoother, we construct the NW and LL estimators of conditional mean functions and their derivatives for a left‐truncated and right‐censored model. The target function includes the regression function, the conditional moment and the conditional distribution function as special cases. It is assumed that the lifetime observations with covariates form a stationary α‐mixing sequence. Asymptotic normality of the estimators is established. Finite sample behaviour of the estimators is investigated via simulations. A real data illustration is included too. 相似文献

14.

Risk comparison of the Stein-rule estimator in a linear regression model with omitted relevant regressors and multivariate<Emphasis Type="Italic">t</Emphasis> errors under the Pitman nearness criterion

Akio Namba Kazuhiro Ohtani 《Statistical Papers》2007,48(1):151-162

In this paper we consider a linear regression model with omitted relevant regressors and multivariatet error terms. The explicit formula for the Pitman nearness criterion of the Stein-rule (SR) estimator relative to the ordinary least squares (OLS) estimator is derived. It is shown numerically that the dominance of the SR estimator over the OLS estimator under the Pitman nearness criterion can be extended to the case of the multivariatet error distribution when the specification error is not severe. It is also shown that the dominance of the SR estimator over the OLS estimator cannot be extended to the case of the multivariatet error distribution when the specification error is severe. This research is partially supported by the Grants-in-Aid for 21st Century COE program. 相似文献

15.

Robust mixture modeling using the skew <Emphasis Type="Italic">t</Emphasis> distribution

Tsung I. Lin Jack C. Lee Wan J. Hsieh 《Statistics and Computing》2007,17(2):81-92

A finite mixture model using the Student's t distribution has been recognized as a robust extension of normal mixtures. Recently, a mixture of skew normal distributions has been found to be effective in the treatment of heterogeneous data involving asymmetric behaviors across subclasses. In this article, we propose a robust mixture framework based on the skew t distribution to efficiently deal with heavy-tailedness, extra skewness and multimodality in a wide range of settings. Statistical mixture modeling based on normal, Student's t and skew normal distributions can be viewed as special cases of the skew t mixture model. We present analytically simple EM-type algorithms for iteratively computing maximum likelihood estimates. The proposed methodology is illustrated by analyzing a real data example. 相似文献

16.

Characteristic function of the SGT distribution

Saralees Nadarajah 《Statistics》2013,47(4):437-439

An explicit closed form is derived for the characteristic function for the skew generalized t distribution studied by Arslan and Genç [The skew generalized t (SGT) distribution as the scale mixture of a skew exponential power distribution and its applications in robust estimation, Statistics 43(5) (2009), pp. 481–498]. The expression involves the Wright generalized hypergeometric Ψ–function. 相似文献

17.

Allowance for skewness in maximum-likelihood estimation with application to the location-scale model

Romn Viveros David A. Sprott 《Revue canadienne de statistique》1987,15(4):349-361

The use of maximum-likelihood estimation as discussed by Sprott and Viveros (1984) is extended to include the log F distribution to accommodate skewness. The role played by linear pivotals in relation to likelihood and efficiency is discussed. Normal, t, and log F likelihoods are defined and used to generate possible normal, t, and log F linear pivotal quantities. The results are applied to the location-scale family, where exact results are available to assess the numerical accuracy of the proposed procedure. Refinements using saddlepoint approximations are obtained. 相似文献

18.

On approximating distribution of the quadratic discriminant function

G. Rekabdar R. Chinipardaz B. Mansouri 《统计学通讯:模拟与计算》2017,46(5):3614-3626

The quadratic discriminant function (QDF) with known parameters has been represented in terms of a weighted sum of independent noncentral chi-square variables. To approximate the density function of the QDF as m-dimensional exponential family, its moments in each order have been calculated. This is done using the recursive formula for the moments via the Stein's identity in the exponential family. We validate the performance of our method using simulation study and compare with other methods in the literature based on the real data. The finding results reveal better estimation of misclassification probabilities, and less computation time with our method. 相似文献

19.

Mixture of linear mixed models using multivariate t distribution

《Journal of Statistical Computation and Simulation》2012,82(4):771-787

Linear mixed models are widely used when multiple correlated measurements are made on each unit of interest. In many applications, the units may form several distinct clusters, and such heterogeneity can be more appropriately modelled by a finite mixture linear mixed model. The classical estimation approach, in which both the random effects and the error parts are assumed to follow normal distribution, is sensitive to outliers, and failure to accommodate outliers may greatly jeopardize the model estimation and inference. We propose a new mixture linear mixed model using multivariate t distribution. For each mixture component, we assume the response and the random effects jointly follow a multivariate t distribution, to conveniently robustify the estimation procedure. An efficient expectation conditional maximization algorithm is developed for conducting maximum likelihood estimation. The degrees of freedom parameters of the t distributions are chosen data adaptively, for achieving flexible trade-off between estimation robustness and efficiency. Simulation studies and an application on analysing lung growth longitudinal data showcase the efficacy of the proposed approach. 相似文献

20.

Fitting parametric frailty and mixture models under biased sampling

P. Economou 《Journal of applied statistics》2009,36(1):53-66

Biased sampling from an underlying distribution with p.d.f. f(t), t>0, implies that observations follow the weighted distribution with p.d.f. f ^w(t)=w(t)f(t)/E[w(T)] for a known weight function w. In particular, the function w(t)=t ^α has important applications, including length-biased sampling (α=1) and area-biased sampling (α=2). We first consider here the maximum likelihood estimation of the parameters of a distribution f(t) under biased sampling from a censored population in a proportional hazards frailty model where a baseline distribution (e.g. Weibull) is mixed with a continuous frailty distribution (e.g. Gamma). A right-censored observation contributes a term proportional to w(t)S(t) to the likelihood; this is not the same as S ^w(t), so the problem of fitting the model does not simply reduce to fitting the weighted distribution. We present results on the distribution of frailty in the weighted distribution and develop an EM algorithm for estimating the parameters of the model in the important Weibull–Gamma case. We also give results for the case where f(t) is a finite mixture distribution. Results are presented for uncensored data and for Type I right censoring. Simulation results are presented, and the methods are illustrated on a set of lifetime data. 相似文献