期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Testing diagonality of high-dimensional covariance matrix under non-normality

Kai Xu 《Journal of Statistical Computation and Simulation》2017,87(16):3208-3224

Under non-normality, this article is concerned with testing diagonality of high-dimensional covariance matrix, which is more practical than testing sphericity and identity in high-dimensional setting. The existing testing procedure for diagonality is not robust against either the data dimension or the data distribution, producing tests with distorted type I error rates much larger than nominal levels. This is mainly due to bias from estimating some functions of high-dimensional covariance matrix under non-normality. Compared to the sphericity and identity hypotheses, the asymptotic property of the diagonality hypothesis would be more involved and we should be more careful to deal with bias. We develop a correction that makes the existing test statistic robust against both the data dimension and the data distribution. We show that the proposed test statistic is asymptotically normal without the normality assumption and without specifying an explicit relationship between the dimension p and the sample size n. Simulations show that it has good size and power for a wide range of settings. 相似文献

2.

Tests for high-dimensional covariance matrices using the theory of U-statistics

M. Rauf Ahmad D. von Rosen 《Journal of Statistical Computation and Simulation》2015,85(13):2619-2631

Test statistics for sphericity and identity of the covariance matrix are presented, when the data are multivariate normal and the dimension, p, can exceed the sample size, n. Under certain mild conditions mainly on the traces of the unknown covariance matrix, and using the asymptotic theory of U-statistics, the test statistics are shown to follow an approximate normal distribution for large p, also when p?n. The accuracy of the statistics is shown through simulation results, particularly emphasizing the case when p can be much larger than n. A real data set is used to illustrate the application of the proposed test statistics. 相似文献

3.

Testing homogeneity of several covariance matrices and multi-sample sphericity for high-dimensional data under non-normality

M. Rauf Ahmad 《统计学通讯:理论与方法》2017,46(8):3738-3753

A test for homogeneity of g ? 2 covariance matrices is presented when the dimension, p, may exceed the sample size, n_i, i = 1, …, g, and the populations may not be normal. Under some mild assumptions on covariance matrices, the asymptotic distribution of the test is shown to be normal when n_i, p → ∞. Under the null hypothesis, the test is extended for common covariance matrix to be of a specified structure, including sphericity. Theory of U-statistics is employed in constructing the tests and deriving their limits. Simulations are used to show the accuracy of tests. 相似文献

4.

Bayesian Inference on Multivariate Normal Covariance and Precision Matrices in a Star-Shaped Model with Missing Data

Xiaoqian Sun Dongchu Sun Zhuoqiong He 《统计学通讯:理论与方法》2013,42(4):642-666

In this article, we study Bayesian estimation for the covariance matrix Σ and the precision matrix Ω (the inverse of the covariance matrix) in the star-shaped model with missing data. Based on a Cholesky-type decomposition of the precision matrix Ω = Ψ′Ψ, where Ψ is a lower triangular matrix with positive diagonal elements, we develop the Jeffreys prior and a reference prior for Ψ. We then introduce a class of priors for Ψ, which includes the invariant Haar measures, Jeffreys prior, and reference prior. The posterior properties are discussed and the closed-form expressions for Bayesian estimators for the covariance matrix Σ and the precision matrix Ω are derived under the Stein loss, entropy loss, and symmetric loss. Some simulation results are given for illustration. 相似文献

5.

The formal posterior of a standard flat prior in MANOVA is incoherent

Morris L. Eaton William D. Sudderth 《Statistical Methods and Applications》1995,4(2):251-270

Summary A standard improper prior for the parameters of a MANOVA model is shown to yield an inference that is incoherent in the sense of Heath and Sudderth. The proof of incoherence is based on the fact that the formal Bayes estimate, sayδ ₀, of the covariance matrix based on the improper prior and a certain bounded loss function is uniformly inadmissible in that there is another estimatorδ _l and an ɛ>0 such that the risk functions satisfyR(δ _l,Σ)⩽R δ ₀,Σ)−ε for all values of the covariance matrix Σ. The estimatorδ _I is formal Bayes for an alternative improper prior which leads to a coherent inference. Research supported by National Science Foundation grants DMS-89-22607 (for Eaton) and DMS-9123358 (for Sudderth). 相似文献

6.

Central limit theorems for functionals of large sample covariance matrix and mean vector in matrix‐variate location mixture of normal distributions

Taras Bodnar Stepan Mazur Nestor Parolya 《Scandinavian Journal of Statistics》2019,46(2):636-660

In this paper, we consider the asymptotic distributions of functionals of the sample covariance matrix and the sample mean vector obtained under the assumption that the matrix of observations has a matrix‐variate location mixture of normal distributions. The central limit theorem is derived for the product of the sample covariance matrix and the sample mean vector. Moreover, we consider the product of the inverse sample covariance matrix and the mean vector for which the central limit theorem is established as well. All results are obtained under the large‐dimensional asymptotic regime, where the dimension p and the sample size n approach infinity such that p/n→c ∈ [0, + ∞) when the sample covariance matrix does not need to be invertible and p/n→c ∈ [0,1) otherwise. 相似文献

7.

Testing identity of high-dimensional covariance matrix

Hao Wang Baisen Liu Ning-Zhong Shi Shurong Zheng 《Journal of Statistical Computation and Simulation》2018,88(13):2600-2611

Two new statistics are proposed for testing the identity of high-dimensional covariance matrix. Applying the large dimensional random matrix theory, we study the asymptotic distributions of our proposed statistics under the situation that the dimension p and the sample size n tend to infinity proportionally. The proposed tests can accommodate the situation that the data dimension is much larger than the sample size, and the situation that the population distribution is non-Gaussian. The numerical studies demonstrate that the proposed tests have good performance on the empirical powers for a wide range of dimensions and sample sizes. 相似文献

8.

ASYMPTOTIC DISTRIBUTION OF THE LARGEST EIGENVALUE

《统计学通讯:模拟与计算》2013,42(3):357-373

ABSTRACT

This paper studies the asymptotic distribution of the largest eigenvalue of the sample covariance matrix. The multivariate distribution for the population is assumed to be elliptical with finite kurtosis 3κ. An expression as an expectation is obtained for the distribution function of the largest eigenvalue regardless of the multiplicity, m, of the population's largest eigenvalue. The asymptotic distribution function and density function are evaluated numerically for m = 2,3,4,5. The bootstrap of the average of the m largest eigenvalues is shown to be consistent for any underlying distribution with finite fourth-order cumulants. 相似文献

9.

A Bayesian analysis for the Wilcoxon signed-rank statistic

Richard A. Chechile 《统计学通讯:理论与方法》2018,47(21):5241-5254

A Bayesian analysis is provided for the Wilcoxon signed-rank statistic (T⁺). The Bayesian analysis is based on a sign-bias parameter φ on the (0, 1) interval. For the case of a uniform prior probability distribution for φ and for small sample sizes (i.e., 6 ? n ? 25), values for the statistic T⁺ are computed that enable probabilistic statements about φ. For larger sample sizes, approximations are provided for the asymptotic likelihood function P(T⁺|φ) as well as for the posterior distribution P(φ|T⁺). Power analyses are examined both for properly specified Gaussian sampling and for misspecified non Gaussian models. The new Bayesian metric has high power efficiency in the range of 0.9–1 relative to a standard t test when there is Gaussian sampling. But if the sampling is from an unknown and misspecified distribution, then the new statistic still has high power; in some cases, the power can be higher than the t test (especially for probability mixtures and heavy-tailed distributions). The new Bayesian analysis is thus a useful and robust method for applications where the usual parametric assumptions are questionable. These properties further enable a way to do a generic Bayesian analysis for many non Gaussian distributions that currently lack a formal Bayesian model. 相似文献

10.

Asymptotic calculations of functions of expected values and covariances of order statistics

M. A. Stephens 《Revue canadienne de statistique》1990,18(3):265-270

Suppose m and V are respectively the vector of expected values and the covariance matrix of the order statistics of a sample of size n from a continuous distribution F. A method is presented to calculate asymptotic values of functions of m and V ^–1, for distributions F which are sufficiently regular. Values are given for the normal, logistic, and extreme-value distributions; also, for completeness, for the uniform and exponential distributions, although for these other methods must be used. 相似文献

11.

A Comparison of Bayesian and Frequentist Interval Estimators in Regression that Utilize Uncertain Prior Information

下载免费PDF全文

Paul Kabaila Gayan Dharmarathne 《Australian & New Zealand Journal of Statistics》2015,57(1):99-118

相似文献

12.

Bayes and Empirical Bayes Estimation of the Parameter n in a Binomial Distribution

Husam Awni Bayoud 《统计学通讯:模拟与计算》2013,42(9):1422-1433

The problem of estimating the total number of trials n in a binomial distribution is reconsidered in this article for both cases of known and unknown probability of success p from the Bayesian viewpoint. Bayes and empirical Bayes point estimates for n are proposed under the assumption of a left-truncated prior distribution for n and a beta prior distribution for p. Simulation studies are provided in this article in order to compare the proposed estimate with the most familiar n estimates. 相似文献

13.

Bayesian predictive power: choice of prior and some recommendations for its use as probability of success in drug development

下载免费PDF全文

Kaspar Rufibach Hans Ulrich Burger Markus Abt 《Pharmaceutical statistics》2016,15(5):438-446

Bayesian predictive power, the expectation of the power function with respect to a prior distribution for the true underlying effect size, is routinely used in drug development to quantify the probability of success of a clinical trial. Choosing the prior is crucial for the properties and interpretability of Bayesian predictive power. We review recommendations on the choice of prior for Bayesian predictive power and explore its features as a function of the prior. The density of power values induced by a given prior is derived analytically and its shape characterized. We find that for a typical clinical trial scenario, this density has a u‐shape very similar, but not equal, to a β‐distribution. Alternative priors are discussed, and practical recommendations to assess the sensitivity of Bayesian predictive power to its input parameters are provided. Copyright © 2016 John Wiley & Sons, Ltd. 相似文献

14.

O(N 2)-Operation Approximation of Covariance Matrix Inverse in Gaussian Process Regression Based on Quasi-Newton BFGS Method

W. E. Leithead Yunong Zhang 《统计学通讯:模拟与计算》2013,42(2):367-380

Gaussian process (GP) is a Bayesian nonparametric regression model, showing good performance in various applications. However, during its model-tuning procedure, the GP implementation suffers from numerous covariance-matrix inversions of expensive O(N³) operations, where N is the matrix dimension. In this article, we propose using the quasi-Newton BFGS O(N²)-operation formula to approximate/replace recursively the inverse of covariance matrix at every iteration. The implementation accuracy is guaranteed carefully by a matrix-trace criterion and by the restarts technique to generate good initial guesses. A number of numerical tests are then performed based on the sinusoidal regression example and the Wiener–Hammerstein identification example. It is shown that by using the proposed implementation, more than 80% O(N³) operations could be eliminated, and a typical speedup of 5–9 could be achieved as compared to the standard maximum-likelihood-estimation (MLE) implementation commonly used in Gaussian process regression. 相似文献

15.

Parsimonious Estimation of the Covariance Matrix in Multinomial Probit Models

Edward Cripps Denzil G. Fiebig 《Econometric Reviews》2013,32(2):146-157

This article presents a Bayesian analysis of a multinomial probit model by building on previous work that specified priors on identified parameters. The main contribution of our article is to propose a prior on the covariance matrix of the latent utilities that permits elements of the inverse of the covariance matrix to be identically zero. This allows a parsimonious representation of the covariance matrix when such parsimony exists. The methodology is applied to both simulated and real data, and its ability to obtain more efficient estimators of the covariance matrix and regression coefficients is assessed using simulated data. 相似文献

16.

Selection of Variables in Multivariate Regression Models for Large Dimensions

《统计学通讯:理论与方法》2012,41(13-14):2465-2489

The Akaike information criterion, AIC, and Mallows’ C _p statistic have been proposed for selecting a smaller number of regressors in the multivariate regression models with fully unknown covariance matrix. All of these criteria are, however, based on the implicit assumption that the sample size is substantially larger than the dimension of the covariance matrix. To obtain a stable estimator of the covariance matrix, it is required that the dimension of the covariance matrix is much smaller than the sample size. When the dimension is close to the sample size, it is necessary to use ridge-type estimators for the covariance matrix. In this article, we use a ridge-type estimators for the covariance matrix and obtain the modified AIC and modified C _p statistic under the asymptotic theory that both the sample size and the dimension go to infinity. It is numerically shown that these modified procedures perform very well in the sense of selecting the true model in large dimensional cases. 相似文献

17.

Near Optimal Prediction from Relevant Components

INGE S. HELLAND SOLVE SAEBØ HA˚KON TJELMELAND 《Scandinavian Journal of Statistics》2012,39(4):695-713

Abstract. The random x regression model is approached through the group of rotations of the eigenvectors for the x ‐covariance matrix together with scale transformations for each of the corresponding regression coefficients. The partial least squares model can be constructed from the orbits of this group. A generalization of Pitman's Theorem says that the best equivariant estimator under a group is given by the Bayes estimator with the group's invariant measure as the prior. A straightforward application of this theorem turns out to be impossible since the relevant invariant prior leads to a non‐defined posterior. Nevertheless we can devise an approximate scale group with a proper invariant prior leading to a well‐defined posterior distribution with a finite mean. This Bayes estimator is explored using Markov chain Monte Carlo technique. The estimator seems to require heavy computations, but can be argued to have several nice properties. It is also a valid estimator when p>n. 相似文献

18.

Prediction intervals with a Dirichlet-process prior distribution

Gregory Campbell Myles Hollander 《Revue canadienne de statistique》1982,10(2):103-111

Let X₁, …,X_n, and Y₁, … Y_n be consecutive samples from a distribution function F which itself is randomly chosen according to the Ferguson (1973) Dirichlet-process prior distribution on the space of distribution functions. Typically, prediction intervals employ the observations X₁,…, X_n in the first sample in order to predict a specified function of the future sample Y₁, …, Y_n. Here one- and two-sided prediction intervals for at least q of N future observations are developed for the situation in which, in addition to the previous sample, there is prior information available. The information is specified via the parameter α of the Dirichlet process prior distribution. 相似文献

19.

Approximate reference priors for Gaussian random fields

Victor De Oliveira Zifei Han 《Scandinavian Journal of Statistics》2023,50(1):296-326

Reference priors are theoretically attractive for the analysis of geostatistical data since they enable automatic Bayesian analysis and have desirable Bayesian and frequentist properties. But their use is hindered by computational hurdles that make their application in practice challenging. In this work, we derive a new class of default priors that approximate reference priors for the parameters of some Gaussian random fields. It is based on an approximation to the integrated likelihood of the covariance parameters derived from the spectral approximation of stationary random fields. This prior depends on the structure of the mean function and the spectral density of the model evaluated at a set of spectral points associated with an auxiliary regular grid. In addition to preserving the desirable Bayesian and frequentist properties, these approximate reference priors are more stable, and their computations are much less onerous than those of exact reference priors. Unlike exact reference priors, the marginal approximate reference prior of correlation parameter is always proper, regardless of the mean function or the smoothness of the correlation function. This property has important consequences for covariance model selection. An illustration comparing default Bayesian analyses is provided with a dataset of lead pollution in Galicia, Spain. 相似文献

20.

Limiting distributions of Kolmogorov-Lévy-type statistics under the alternative

Andrzej Kozek 《Revue canadienne de statistique》1987,15(1):77-85

Let X_i, 1 ≤ i ≤ n, be independent identically distributed random variables with a common distribution function F, and let G be a smooth distribution function. We derive the limit distribution of □{ρ_α(F_n, G) - _α(F, G)}, where F_n is the empirical distribution function based on X₁,…,X_n and _α is a Kolmogorov-Lévy-type metric between distribution functions. For α ≤ 0 and two distribution functions F and G the metric p_α is given by p_α(F, G) = inf {? ≤ 0: G(x - α?) - ? F(x) ≤ G(x + α?) + ? for all x ?}. 相似文献