期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A distribution-free approach to inducing rank correlation among input variables

Ronald L. Iman W. J. Conover 《统计学通讯:模拟与计算》2013,42(3):311-334

A method for inducing a desired rank correlation matrix on a multivariate input random variable for use in a simulation study is introduced in this paper. This method is simple to use, is distribution free, preserves the exact form of the marginal distributions on the input variables, and may be used with any type of sampling scheme for which correlation of input variables is a meaningful concept. A Monte Carlo study provides an estimate of the bias and variability associated with the method. Input variables used in a model for study of geologic disposal of radioactive waste provide an example of the usefulness of this procedure. A textbook example shows how the output may be affected by the method presented in this paper. 相似文献

2.

Modeling Dependence in High Dimensions With Factor Copulas

Dong Hwan Oh Andrew J. Patton 《商业与经济统计学杂志》2017,35(1):139-154

This article presents flexible new models for the dependence structure, or copula, of economic variables based on a latent factor structure. The proposed models are particularly attractive for relatively high-dimensional applications, involving 50 or more variables, and can be combined with semiparametric marginal distributions to obtain flexible multivariate distributions. Factor copulas generally lack a closed-form density, but we obtain analytical results for the implied tail dependence using extreme value theory, and we verify that simulation-based estimation using rank statistics is reliable even in high dimensions. We consider “scree” plots to aid the choice of the number of factors in the model. The model is applied to daily returns on all 100 constituents of the S&P 100 index, and we find significant evidence of tail dependence, heterogeneous dependence, and asymmetric dependence, with dependence being stronger in crashes than in booms. We also show that factor copula models provide superior estimates of some measures of systemic risk. Supplementary materials for this article are available online. 相似文献

3.

Distributional aspects in latent variable models

M. Kukuk 《Statistical Papers》1994,35(1):231-242

For observable indicators with ordered categories one can assume underlying latent variables following certain marginal distributions. Transforming the latent variables changes its marginal distributions but not the observable qualitative indicators. The joint distribution of the latent variables can be constructed from the marginal distributions. There is a broad class of multivariate distributions for which the observable indicators are equivalent. By choosing the multivariate normal distribution from this class we can analyse a linear relationship between the transformed latent variables. This leads to latent structural equation models. Estimation of these latter models is therefore more general than the distributional assumption might initially suggest. Robustness of the estimation procedure is also discussed for deviations from this distribution family. Using ordinal business survey data of the German Ifo-institute we test the efficiency of firms' price expectations implied by the rational expectation hypothesis. 相似文献

4.

Posterior moments of scalar functions of a covariance matrix

Heiberger M. Richard 《统计学通讯:模拟与计算》2013,42(2-3):107-140

Given multivariate normal data and a certain spherically invariant prior distribution on the covariance matrix, it is desired to estimate the moments of the posterior marginal distributions of some scalar functions of the covariance matrix by importance sampling. To this end a family of distributions is defined on the group of orthogonal matrices and a procedure is proposed for selecting one of these distributions for use as a weighting distribution in the importance sampling process. In an example estimates are calculated for the posterior mean and variance of each element in the covariance matrix expressed in the original coordinates, for the posterior mean of each element in the correlation matrix expressed in the original coordinates, and for the posterior mean of each element in the covariance matrix expressed in the coordinates of the principal variables. 相似文献

5.

Modeling the Agreement of Discrete Bivariate Survival Times using Kappa Coefficient

Guo Y Manatunga AK 《Lifetime data analysis》2005,11(3):309-332

Using local kappa coefficients, we develop a method to assess the agreement between two discrete survival times that are measured on the same subject by different raters or methods. We model the marginal distributions for the two event times and local kappa coefficients in terms of covariates. An estimating equation is used for modeling the marginal distributions and a pseudo-likelihood procedure is used to estimate the parameters in the kappa model. The performance of the estimation procedure is examined through simulations. The proposed method can be extended to multivariate discrete survival distributions. 相似文献

6.

Empirical likelihood method for the multivariate accelerated failure time models

Ming Zheng Wen Yu 《Journal of statistical planning and inference》2011,141(2):972-983

In applications, multivariate failure time data appears when each study subject may potentially experience several types of failures or recurrences of a certain phenomenon, or failure times may be clustered. Three types of marginal accelerated failure time models dealing with multiple events data, recurrent events data and clustered events data are considered. We propose a unified empirical likelihood inferential procedure for the three types of models based on rank estimation method. The resulting log-empirical likelihood ratios are shown to possess chi-squared limiting distributions. The properties can be applied to do tests and construct confidence regions without the need to solve the rank estimating equations nor to estimate the limiting variance-covariance matrices. The related computation is easy to implement. The proposed method is illustrated by extensive simulation studies and a real example. 相似文献

7.

A strategy for constructing multivariate distributions

Abderrahmane Chakak Kenneth J. Koehler 《统计学通讯:模拟与计算》2013,42(3):537-550

Given a random vector (X₁,…, X_n) for which the univariate and bivariate marginal distributions belong to some specified families of distributions, we present a procedure for constructing families of multivariate distributions with the specified univariate and bivariate margins. Some general properties of the resulting families of multivariate distributions are reviewed. This procedure is illustrated by generalizing the bivariate Plackett (1965) and Clayton (1978) distributions to three dimensions. In addition to providing rich families of models for data analysis, this method of construction provides a convenient way of simulating observations from multivariate distributions with specific types of univariate and bivariate marginal distributions. A general algorithm for simulating random observations from these families of multivariate distributions is presented 相似文献

8.

In Memoriam: Bernard George Greenberg 1919-1985

James R. Abernathy Pranab K. Sen 《The American statistician》2013,67(3):183-184

The bivariate normal density with unit variance and correlation ρ is well known. We show that by integrating out ρ, the result is a function of the maximum norm. The Bayesian interpretation of this result is that if we put a uniform prior over ρ, then the marginal bivariate density depends only on the maximal magnitude of the variables. The square-shaped isodensity contour of this resulting marginal bivariate density can also be regarded as the equally weighted mixture of bivariate normal distributions over all possible correlation coefficients. This density links to the Khintchine mixture method of generating random variables. We use this method to construct the higher dimensional generalizations of this distribution. We further show that for each dimension, there is a unique multivariate density that is a differentiable function of the maximum norm and is marginally normal, and the bivariate density from the integral over ρ is its special case in two dimensions. 相似文献

9.

A comparative study of tests for paired lifetime data

Wang Z Ng HK 《Lifetime data analysis》2006,12(4):505-522

In this paper, we investigate different procedures for testing the equality of two mean survival times in paired lifetime studies. We consider Owen’s M-test and Q-test, a likelihood ratio test, the paired t-test, the Wilcoxon signed rank test and a permutation test based on log-transformed survival times in the comparative study. We also consider the paired t-test, the Wilcoxon signed rank test and a permutation test based on original survival times for the sake of comparison. The size and power characteristics of these tests are studied by means of Monte Carlo simulations under a frailty Weibull model. For less skewed marginal distributions, the Wilcoxon signed rank test based on original survival times is found to be desirable. Otherwise, the M-test and the likelihood ratio test are the best choices in terms of power. In general, one can choose a test procedure based on information about the correlation between the two survival times and the skewness of the marginal survival distributions. 相似文献

10.

Eliciting Dirichlet and Gaussian copula prior distributions for multinomial models

Fadlalla G. Elfadaly Paul H. Garthwaite 《Statistics and Computing》2017,27(2):449-467

In this paper, we propose novel methods of quantifying expert opinion about prior distributions for multinomial models. Two different multivariate priors are elicited using median and quartile assessments of the multinomial probabilities. First, we start by eliciting a univariate beta distribution for the probability of each category. Then we elicit the hyperparameters of the Dirichlet distribution, as a tractable conjugate prior, from those of the univariate betas through various forms of reconciliation using least-squares techniques. However, a multivariate copula function will give a more flexible correlation structure between multinomial parameters if it is used as their multivariate prior distribution. So, second, we use beta marginal distributions to construct a Gaussian copula as a multivariate normal distribution function that binds these marginals and expresses the dependence structure between them. The proposed method elicits a positive-definite correlation matrix of this Gaussian copula. The two proposed methods are designed to be used through interactive graphical software written in Java. 相似文献

11.

A comparison of methods for simulating correlated binary variables with specified marginal means and correlations

《Journal of Statistical Computation and Simulation》2012,82(11):2441-2452

Simulation studies employed to study properties of estimators for parameters in population-average models for clustered or longitudinal data require suitable algorithms for data generation. Methods for generating correlated binary data that allow general specifications of the marginal mean and correlation structures are particularly useful. We compare an algorithm based on dichotomizing multi-normal variates to one based on a conditional linear family (CLF) of distributions [Qaqish BF. A family of multivariate binary distributions for simulating correlated binary variables with specified marginal means and correlations. Biometrika. 2003;90:455–463] with respect to range restrictions induced on correlations. Examples include generating longitudinal binary data and generating correlated binary data compatible with specified marginal means and covariance structures for bivariate, overdispersed binomial outcomes. Results show the CLF method gives a wider range of correlations for longitudinal data having autocorrelated within-subject associations, while the multivariate probit method gives a wider range of correlations for clustered data having exchangeable-type correlations. In the case of a decaying-product correlation structure, it is shown that the CLF method achieves the nonparametric limits on the range of correlations, which cannot be surpassed by any method. 相似文献

12.

PARTIAL CORRELATION AND CONDITIONAL CORRELATION AS MEASURES OF CONDITIONAL INDEPENDENCE 总被引：1，自引：0，他引：1

Kunihiro Baba Ritei Shibata Masaaki Sibuya 《Australian & New Zealand Journal of Statistics》2004,46(4):657-664

This paper investigates the roles of partial correlation and conditional correlation as measures of the conditional independence of two random variables. It first establishes a sufficient condition for the coincidence of the partial correlation with the conditional correlation. The condition is satisfied not only for multivariate normal but also for elliptical, multivariate hypergeometric, multivariate negative hypergeometric, multinomial and Dirichlet distributions. Such families of distributions are characterized by a semigroup property as a parametric family of distributions. A necessary and sufficient condition for the coincidence of the partial covariance with the conditional covariance is also derived. However, a known family of multivariate distributions which satisfies this condition cannot be found, except for the multivariate normal. The paper also shows that conditional independence has no close ties with zero partial correlation except in the case of the multivariate normal distribution; it has rather close ties to the zero conditional correlation. It shows that the equivalence between zero conditional covariance and conditional independence for normal variables is retained by any monotone transformation of each variable. The results suggest that care must be taken when using such correlations as measures of conditional independence unless the joint distribution is known to be normal. Otherwise a new concept of conditional independence may need to be introduced in place of conditional independence through zero conditional correlation or other statistics. 相似文献

13.

Robust multivariate mixture regression models with incomplete data

Hwa Kyung Lim Naveen N. Narisetty 《Journal of Statistical Computation and Simulation》2017,87(2):328-347

Multivariate mixture regression models can be used to investigate the relationships between two or more response variables and a set of predictor variables by taking into consideration unobserved population heterogeneity. It is common to take multivariate normal distributions as mixing components, but this mixing model is sensitive to heavy-tailed errors and outliers. Although normal mixture models can approximate any distribution in principle, the number of components needed to account for heavy-tailed distributions can be very large. Mixture regression models based on the multivariate t distributions can be considered as a robust alternative approach. Missing data are inevitable in many situations and parameter estimates could be biased if the missing values are not handled properly. In this paper, we propose a multivariate t mixture regression model with missing information to model heterogeneity in regression function in the presence of outliers and missing values. Along with the robust parameter estimation, our proposed method can be used for (i) visualization of the partial correlation between response variables across latent classes and heterogeneous regressions, and (ii) outlier detection and robust clustering even under the presence of missing values. We also propose a multivariate t mixture regression model using MM-estimation with missing information that is robust to high-leverage outliers. The proposed methodologies are illustrated through simulation studies and real data analysis. 相似文献

14.

HIGH-DIMENSIONAL PARAMETRIC MODELLING OF MULTIVARIATE EXTREME EVENTS

Alec G. Stephenson 《Australian & New Zealand Journal of Statistics》2009,51(1):77-88

Multivariate extreme events are typically modelled using multivariate extreme value distributions. Unfortunately, there exists no finite parametrization for the class of multivariate extreme value distributions. One common approach is to model extreme events using some flexible parametric subclass. This approach has been limited to only two or three dimensions, primarily because suitably flexible high-dimensional parametric models have prohibitively complex density functions. We present an approach that allows a number of popular flexible models to be used in arbitrarily high dimensions. The approach easily handles missing and censored data, and can be employed when modelling componentwise maxima and multivariate threshold exceedances. The approach is based on a representation using conditionally independent marginal components, conditioning on positive stable random variables. We use Bayesian inference, where the conditioning variables are treated as auxiliary variables within Markov chain Monte Carlo simulations. We demonstrate these methods with an application to sea-levels, using data collected at 10 sites on the east coast of England. 相似文献

15.

Latent Variable Models in Heterogeneous Spaces for Observations of Mixed Types

Ernest Fokoué 《统计学通讯:理论与方法》2013,42(1):42-53

The maximum likelihood approach to the estimation of factor analytic model parameters most commonly deals with outcomes that are assumed to be multivariate Gaussian random variables in a homogeneous input space. In many practical settings, however, many studies needing factor analytic modeling involve data that, not only are not multivariate Gaussian variables, but also come from a partitioned input space. This article introduces an extension of the maximum likelihood factor analysis that handles multivariate outcomes made up of attributes with different probability distributions, and originating from a partitioned input space. An EM Algorithm combined with Fisher Scoring is used to estimate the parameters of the derived model. 相似文献

16.

Small sample sensitivity analysis techniques for computer models.with an application to risk assessment

Ronald L. Iman W.J. Conover 《统计学通讯:理论与方法》2013,42(17):1749-1842

As modeling efforts expand to a broader spectrum of areas the amount of computer time required to exercise the corresponding computer codes has become quite costly (several hours for a single run is not uncommon). This costly process can be directly tied to the complexity of the modeling and to the large number of input variables (often numbering in the hundreds) Further, the complexity of the modeling (usually involving systems of differential equations) makes the relationships among the input variables not mathematically tractable. In this setting it is desired to perform sensitivity studies of the input-output relationships. Hence, a judicious selection procedure for the choic of values of input variables is required, Latin hypercube sampling has been shown to work well on this type of problem.

However, a variety of situations require that decisions and judgments be made in the face of uncertainty. The source of this uncertainty may be lack ul knowledge about probability distributions associated with input variables, or about different hypothesized future conditions, or may be present as a result of different strategies associated with a decision making process In this paper a generalization of Latin hypercube sampling is given that allows these areas to be investigated without making additional computer runs. In particular it is shown how weights associated with Latin hypercube input vectors may be rhangpd to reflect different probability distribution assumptions on key input variables and yet provide: an unbiased estimate of the cumulative distribution function of the output variable. This allows for different distribution assumptions on input variables to be studied without additional computer runs and without fitting a response surface. In addition these same weights can be used in a modified nonparametric Friedman test to compare treatments, Sample size requirements needed to apply the results of the work are also considered. The procedures presented in this paper are illustrated using a model associated with the risk assessment of geologic disposal of radioactive waste. 相似文献

17.

Some contributions on the multivariate Poisson–Skellam probability distribution

Blache Paul Akpoue Jean-François Angers 《统计学通讯:理论与方法》2017,46(1):49-68

In this article, we introduce a new form of distribution whose components have the Poisson or Skellam marginal distributions. This new specification allows the incorporation of relevant information on the nature of the correlations between every component. In addition, we present some properties of this distribution. Unlike the multivariate Poisson distribution, it can handle variables with positive and negative correlations. It should be noted that we are only interested in modeling covariances of order 2, which means between all pairs of variables. Some simulations are presented to illustrate the estimation methods. Finally, an application of soccer teams data will highlight the relationship between number of points per season and the goal differential by some covariates. 相似文献

18.

The Gaussian rank correlation estimator: robustness properties

Kris Boudt Jonathan Cornelissen Christophe Croux 《Statistics and Computing》2012,22(2):471-483

The Gaussian rank correlation equals the usual correlation coefficient computed from the normal scores of the data. Although its influence function is unbounded, it still has attractive robustness properties. In particular, its breakdown point is above 12%. Moreover, the estimator is consistent and asymptotically efficient at the normal distribution. The correlation matrix obtained from pairwise Gaussian rank correlations is always positive semidefinite, and very easy to compute, also in high dimensions. We compare the properties of the Gaussian rank correlation with the popular Kendall and Spearman correlation measures. A simulation study confirms the good efficiency and robustness properties of the Gaussian rank correlation. In the empirical application, we show how it can be used for multivariate outlier detection based on robust principal component analysis. 相似文献

19.

The Multivariate Randomized Complete Block Design: A Novel Permutation Solution in Case of Ordered Categorical Variables

《统计学通讯:理论与方法》2012,41(16-17):3094-3109

In this article, multivariate extensions of the combination-based test statistics for the comparison of several treatments in the multivariate Randomized Complete Block designs are introduced in case of categorical response variables. Several tests for the multivariate Randomized Complete Block designs, including MANOVA procedure, are compared with the method proposed via a Monte Carlo simulation study. The method has also been applied to a real case study in the field of sensorial testing studies. Results suggest that in each experimental situation where normality of the supposed underlying continuous model is hard to justify and especially when errors have heavy-tailed distributions, the proposed nonparametric procedure can be considered as a valid solution. 相似文献

20.

Dynamic Graphics for Exploring Spatial Data with Application to Locating Global and Local Anomalies

John Haslett Ronan Bradley Peter Craig Antony Unwin Graham Wills 《The American statistician》2013,67(3):234-242

We explore the application of dynamic graphics to the exploratory analysis of spatial data. We introduce a number of new tools and illustrate their use with prototype software, developed at Trinity College, Dublin. These tools are used to examine local variability—anomalies—through plots of the data that display its marginal and multivariate distributions, through interactive smoothers, and through plots motivated by the spatial auto-covariance ideas implicit in the variogram. We regard these as alternative and linked views of the data. We conclude that the most important single view of the data is the Map View: All other views must be cross-referred to this, and the software must encourage this. The view can be enriched by overlaying on other pertinent spatial information. We draw attention to the possibilities of one-many linking, and to the use of line-objects to link pairs of data points. We draw attention to the parallels with work on Geographical Information Systems. 相似文献