期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

An RKHS framework for functional data analysis

Ana Kupresanin Hyejin Shin David King R.L. Eubank 《Journal of statistical planning and inference》2010

Linear combinations of random variables play a crucial role in multivariate analysis. Two extension of this concept are considered for functional data and shown to coincide using the Loève–Parzen reproducing kernel Hilbert space representation of a stochastic process. This theory is then used to provide an extension of the multivariate concept of canonical correlation. A solution to the regression problem of best linear unbiased prediction is obtained from this abstract canonical correlation formulation. The classical identities of Lawley and Rao that lead to canonical factor analysis are also generalized to the functional data setting. Finally, the relationship between Fisher's linear discriminant analysis and canonical correlation analysis for random vectors is extended to include situations with function-valued random elements. This allows for classification using the canonical Y scores and related distance measures. 相似文献

2.

Variable selection and interpretation in canonical correlation analysis

Noriah M. Al-Kandari Ian T Jolliffe 《统计学通讯:模拟与计算》2013,42(3):873-900

The canonical variates in canonical correlation analysis are often interpreted by looking at the weights or loadings of the variables in each canonical variate and effectively ignoring those variables whose weights or loadings are small. It is shown that such a procedure can be misleading. The related problem of selecting a subset of the original variables which preserves the information in the most important canonical variates is also examined. Because of different possible definitions of ‘the information in canonical variates’, any such subset selection needs very careful consideration. 相似文献

3.

企业年报数据中工业增加值异常情况的诊断

下载免费PDF全文

孙海燕《统计研究》1999,16(10):39-44

长期以来,我国政府对统计数据的质量极为重视,并通过各级统计人员的不懈努力,已经探索出一套较为完整的、准确的和有效的控制数据质量的措施和方法,积累了相当丰富的实际操作经验,对提高统计数据质量起到了必不可少的作用。但是,这些措施和方法更偏重于具体细则的实施和实际经验的运用,缺少控制数据质量方法的介入。特别是在不同经济模式下,统计数据摆动相当显著,仅凭经验判断虚假数据就远远不能满足要求。形势迫切地要求我们除了加大政策和法规实施力度外,还必须尽快引进现代统计方法,实现从经验到经验和方法论相结合的过渡,以… 相似文献

4.

Peter Bühlmann Philipp Rütimann Sara van de Geer Cun-Hui Zhang 《Journal of statistical planning and inference》2013

We consider estimation in a high-dimensional linear model with strongly correlated variables. We propose to cluster the variables first and do subsequent sparse estimation such as the Lasso for cluster-representatives or the group Lasso based on the structure from the clusters. Regarding the first step, we present a novel and bottom-up agglomerative clustering algorithm based on canonical correlations, and we show that it finds an optimal solution and is statistically consistent. We also present some theoretical arguments that canonical correlation based clustering leads to a better-posed compatibility constant for the design matrix which ensures identifiability and an oracle inequality for the group Lasso. Furthermore, we discuss circumstances where cluster-representatives and using the Lasso as subsequent estimator leads to improved results for prediction and detection of variables. We complement the theoretical analysis with various empirical results. 相似文献

5.

Analysis of Deviance for Hypothesis Testing in Generalized Partially Linear Models

Wolfgang Karl Härdle Li-Shan Huang 《商业与经济统计学杂志》2013,31(2):322-333

In this study, we develop nonparametric analysis of deviance tools for generalized partially linear models based on local polynomial fitting. Assuming a canonical link, we propose expressions for both local and global analysis of deviance, which admit an additivity property that reduces to analysis of variance decompositions in the Gaussian case. Chi-square tests based on integrated likelihood functions are proposed to formally test whether the nonparametric term is significant. Simulation results are shown to illustrate the proposed chi-square tests and to compare them with an existing procedure based on penalized splines. The methodology is applied to German Bundesbank Federal Reserve data. 相似文献

6.

Estimation and asymptotic covariance matrix for stochastic volatility models

Maddalena Cavicchioli 《Statistical Methods and Applications》2017,26(3):437-452

In this paper we compute the asymptotic variance-covariance matrix of the method of moments estimators for the canonical Stochastic Volatility model. Our procedure is based on a linearization of the initial process via the log-squared transformation of Breidt and Carriquiry (Modelling and prediction, honoring Seymour Geisel. Springer, Berlin, 1996). Knowledge of the asymptotic variance-covariance matrix of the method of moments estimators offers a concrete possibility for the use of the classical testing procedures. The resulting asymptotic standard errors are then compared with those proposed in the literature applying different parameter estimates. Applications on simulated data support our results. Finally, we present empirical applications on the daily returns of Euro-US dollar and Yen-US dollar exchange rates. 相似文献

7.

Canonical Correlation Analysis Through Linear Modeling

Keunbaik Lee Jae Keun Yoo 《Australian & New Zealand Journal of Statistics》2014,56(1):59-72

In this paper, we introduce linear modeling of canonical correlation analysis, which estimates canonical direction matrices by minimising a quadratic objective function. The linear modeling results in a class of estimators of canonical direction matrices, and an optimal class is derived in the sense described herein. The optimal class guarantees several of the following desirable advantages: first, its estimates of canonical direction matrices are asymptotically efficient; second, its test statistic for determining the number of canonical covariates always has a chi‐squared distribution asymptotically; third, it is straight forward to construct tests for variable selection. The standard canonical correlation analysis and other existing methods turn out to be suboptimal members of the class. Finally, we study the role of canonical variates as a means of dimension reduction for predictors and responses in multivariate regression. Numerical studies and data analysis are presented. 相似文献

8.

CANONICAL VARIATE ANALYSIS WITH SPATIALLY-CORRELATED DATA

N.A. Campbell H.T. Kiiveri 《Australian & New Zealand Journal of Statistics》1993,35(3):333-344

A general canonical variate model is derived when the observations are spatially correlated. For spatial covariance structures resulting from dependence of a pixel on its nearest neighbours, the solution reduces to an analysis of neighbour-corrected values. The usual analysis, in which spatial correlation is ignored, gives similar canonical vectors but over-estimates the canonical roots. A formula for approximating the reduction in the canonical roots to adjust for the spatial correlation is given. 相似文献

9.

A generalization of random vectors and a correlation theory for them - a trial (with discussions) 2

Hans-Peter Höschel 《Statistics》2013,47(3):263-297

There are defined generalized generalized random vectors. This notion contains usual random vectors, some classical and generalized stochastic processes and certain generalizations of them. There is developped a theory of correlation on basis of a theory of linear prediction (approximation) for such rendom vectors which includes especially canonical correlations. 相似文献

10.

Canonical correlation analysis for the vector AR(1) model with ARCH innovations

Ruey S. Tsay Shiqing Ling 《Journal of statistical planning and inference》2008

This paper extends the results of canonical correlation analysis of Anderson [2002. Canonical correlation analysis and reduced-rank regression in autoregressive models. Ann. Statist. 30, 1134–1154] to a vector AR(1) process with a vector ARCH(1) innovations. We obtain the limiting distributions of the sample matrices, the canonical correlations and the canonical vectors of the process. The extension is important because many time series in economics and finance exhibit conditional heteroscedasticity. We also use simulation to demonstrate the effects of ARCH innovations on the canonical correlation analysis in finite sample. Both the limiting distributions and simulation results show that overlooking the ARCH effects in canonical correlation analysis can easily lead to erroneous inference. 相似文献

11.

多源高维数据的多分类纵向整合分析及应用

吴梦云等《统计研究》2021,38(8):132-145

多分类数据分析在实证研究中具有重要意义。然而,由于高维数、小样本及低信噪比等原因,现有的多分类方法仍面临信息量不足而导致的效果不佳问题。为此,学者们通过收集更多信息源数据以更全面地刻画实际问题。不同于收集相同自变量的不同源样本,目前较为流行的多源数据收集了相同样本的不同源自变量,它们的独立性和相关性为统计建模带来了新的挑战。本文提出基于典型变量回归的多分类纵向整合分析方法,其中利用惩罚技术实现变量选择,并独特地考虑不同源数据间的关联结构,提出高效的ADMM算法进行模型优化。数值模拟结果表明,该方法在变量选择和分类预测上均具有优越性。基于我国上证50的多源股票数据,利用该方法对2019年股票日收益率的影响因素进行了实证探究。研究表明,本文提出的多分类整合分析在筛选出具有解释意义变量的同时具有更好的预测效果。相似文献

12.

Study of redundancy of vector variables in canonical correlations

C. G. Khatri 《统计学通讯:理论与方法》2013,42(4):1425-1440

Fujikoshi (1982) obtained the necessary and sufficient conditions for the increased number of variables in the two sets of vectors not affecting the original nonzero canonical correlations and used these to obtain the likelihood ratio test procedure. He assumed a nonsingular covariance matrix due to random variables. Here, we study the same problem when the covariance matrix is singular and establish some further results. In this study, we note that the unit canonical correlations have to be separated in some of the situations. These results are valid for complex random vector variables and in some situations, the test for redundancy is given for complex random variables. 相似文献

13.

Bayesian predictions for compound weibull model

S.K. Ashour D.R. Rashwan 《统计学通讯:理论与方法》2013,42(16):1613-1624

This paper is concerned with a full Bayesian analysis for some prediction problems of the compound model if the underlying distributions are Weibull with unequal shape parameters and the sample is type I censored. A numerical example and computer facilities will be used to illustrate the procedure. 相似文献

14.

Understanding Canonical Correlation through the General Linear Model and Principal Components

Keith E. Muller 《The American statistician》2013,67(4):342-354

Canonical correlation has been little used and little understood, even by otherwise sophisticated analysts. An alternative approach to canonical correlation, based on a general linear multivariate model, is presented. Properties of principal component analysis are used to help explain the method. Standard computational methods for full rank canonical correlation, techniques for canonical correlation on component scores, and canonical correlation with less than full rank are discussed. They are seen to be essentially equivalent when the model equation for canonical correlation on component scores is presented. The two approaches to less than full rank situations are equivalent in some senses, but quite different in usefulness, depending on the application. An example dataset is analyzed in detail to help demonstrate the conclusions. 相似文献

15.

Prediction intervals for generalized-order statistics with random sample size

《Journal of Statistical Computation and Simulation》2012,82(9):1725-1741

The problem of predicting future generalized-order statistics, by assuming the future sample size is a random variable, is discussed. A general expression for the coverage probability of the prediction intervals is derived. Since k-records and progressively type-II censored-order statistics are contained in the model of generalized-order statistics, the corresponding results for them can be deduced as special cases. When the future sample size has degenerate, binomial, Poisson and geometric distributions, numerical computations are given. The procedure for finding an optimal prediction interval is presented for each case. Finally, we apply our results to a real data set in life testing given in Lee and Wang [Statistical methods for survival data analysis. Hoboken, NJ: John Wiley and Sons; 2003, p. 58, Table 3.4] for illustrative the proposed procedure in this paper. 相似文献

16.

Canonical Correlation Analysis Using Small Number of Samples

Hye-Seung Lee 《统计学通讯:模拟与计算》2013,42(5):973-985

Canonical correlation assesses the relationship between two groups of variables. Although it has been a useful tool in a wide variety of research areas, it is not well known that weaker canonical correlations require larger sample sizes to be correctly inferred. In this article, we investigate small sample bias in canonical correlation analysis and apply the jackknife bias correction to the estimation of canonical correlations. We use bootstrap samples to obtain a better confidence interval for the jackknife canonical correlation estimator. 相似文献

17.

Weighted-Average Least Squares Prediction

Jan R. Magnus Wendun Wang Xinyu Zhang 《Econometric Reviews》2016,35(6):1040-1074

Prediction under model uncertainty is an important and difficult issue. Traditional prediction methods (such as pretesting) are based on model selection followed by prediction in the selected model, but the reported prediction and the reported prediction variance ignore the uncertainty from the selection procedure. This article proposes a weighted-average least squares (WALS) prediction procedure that is not conditional on the selected model. Taking both model and error uncertainty into account, we also propose an appropriate estimate of the variance of the WALS predictor. Correlations among the random errors are explicitly allowed. Compared to other prediction averaging methods, the WALS predictor has important advantages both theoretically and computationally. Simulation studies show that the WALS predictor generally produces lower mean squared prediction errors than its competitors, and that the proposed estimator for the prediction variance performs particularly well when model uncertainty increases. 相似文献

18.

Generalized additive models with unknown link function including variable selection

Gerhard Tutz Sebastian Petry 《Journal of applied statistics》2016,43(15):2866-2885

The generalized additive model is a well established and strong tool that allows modelling smooth effects of predictors on the response. However, if the link function, which is typically chosen as the canonical link, is misspecified, estimates can be biased. A procedure is proposed that simultaneously estimates the form of the link function and the unknown form of the predictor functions including selection of predictors. The procedure is based on boosting methodology, which obtains estimates by using a sequence of weak learners. It strongly dominates fitting procedures that are unable to modify a given link function if the true link function deviates from the fixed function. The performance of the procedure is shown in simulation studies and illustrated by real world examples. 相似文献

19.

THE REDUCED-RANK GROWTH CURVE MODEL FOR DISCRIMINANT ANALYSIS OF LONGITUDINAL DATA

Jeffrey M. Albert Anant M. Kshirsagar 《Australian & New Zealand Journal of Statistics》1993,35(3):345-357

This paper presents a method of discriminant analysis especially suited to longitudinal data. The approach is in the spirit of canonical variate analysis (CVA) and is similarly intended to reduce the dimensionality of multivariate data while retaining information about group differences. A drawback of CVA is that it does not take advantage of special structures that may be anticipated in certain types of data. For longitudinal data, it is often appropriate to specify a growth curve structure (as given, for example, in the model of Potthoff & Roy, 1964). The present paper focuses on this growth curve structure, utilizing it in a model-based approach to discriminant analysis. For this purpose the paper presents an extension of the reduced-rank regression model, referred to as the reduced-rank growth curve (RRGC) model. It estimates discriminant functions via maximum likelihood and gives a procedure for determining dimensionality. This methodology is exploratory only, and is illustrated by a well-known dataset from Grizzle & Allen (1969). 相似文献

20.

Parallel analysis approach for determining dimensionality in canonical correlation analysis

《Journal of Statistical Computation and Simulation》2012,82(17):3419-3431

ABSTRACT

Canonical correlations are maximized correlation coefficients indicating the relationships between pairs of canonical variates that are linear combinations of the two sets of original variables. The number of non-zero canonical correlations in a population is called its dimensionality. Parallel analysis (PA) is an empirical method for determining the number of principal components or factors that should be retained in factor analysis. An example is given to illustrate for adapting proposed procedures based on PA and bootstrap modified PA to the context of canonical correlation analysis (CCA). The performances of the proposed procedures are evaluated in a simulation study by their comparison with traditional sequential test procedures with respect to the under-, correct- and over-determination of dimensionality in CCA. 相似文献