共查询到20条相似文献,搜索用时 15 毫秒
1.
AbstractA method is proposed for the estimation of missing data in analysis of covariance models. This is based on obtaining an estimate of the missing observation that minimizes the error sum of squares. Specific derivation of this estimate is carried out for the one-factor analysis of covariance, and numerical examples are given to show the nature of the estimates produced. Parameter estimates of the imputed data are then compared with those of the incomplete data. 相似文献
2.
一、引言 风险投资所进行的是高风险高收益的投资 ,这种投资的特点与一般的证券投资不同 ,从而决定了其投资组合有别于一般基金的投资组合。( 1)风险投资所作的投资往往是开创性的 ,可能并无大量的历史数据可查 ,即便有也可能较少 ,不满足数理统计关于样本容量的要求 ;( 2 )风险投资的投资注重长期资本收益 ,因此其风险不宜只沿袭传统的系统风险和非系统风险的划分方法 ,其收益与证券市场整体收益的相关性可能并不显著 ;( 3)风险投资的投资收益分布并不一定符合正态分布 ,因此用方差或标准差来度量风险 ,可能并不切合实际 ,也就失去相应… 相似文献
3.
The problem of estimating a covariance matrix is considered in this paper. Using the so-called partial Iwasawa coordinates of the covariance matrix, a new improved estimator dominating the James-Stein estimator is proposed. The results of a simulation study verifies that the new estimator provides a substantial improvement in risk under Stein's loss. 相似文献
4.
Jong-Hyeon Jeong 《Journal of the Royal Statistical Society. Series A, (Statistics in Society)》2006,169(2):289-303
Summary. Competing risks situations can be encountered in many research areas such as medicine, social science and engineering. The main stream of analyses of those competing risks data has been nonparametric or semiparametric in the statistical literature. We propose a new parametric family to parameterize the cumulative incidence function completely. The new distribution is sufficiently flexible to fit various shapes of hazard patterns in survival data and increases the efficiency of the cumulative incidence estimates over the distribution-free approaches. A simple two-sample parametric test statistic is also proposed to compare the cumulative incidence functions between two groups at a given time point. The new parametric approach is illustrated by using breast cancer data sets from the National Surgical Adjuvant Breast and Bowel Project. 相似文献
5.
6.
Efficient estimation of the regression coefficients in longitudinal data analysis requires a correct specification of the covariance structure. If misspecification occurs, it may lead to inefficient or biased estimators of parameters in the mean. One of the most commonly used methods for handling the covariance matrix is based on simultaneous modeling of the Cholesky decomposition. Therefore, in this paper, we reparameterize covariance structures in longitudinal data analysis through the modified Cholesky decomposition of itself. Based on this modified Cholesky decomposition, the within-subject covariance matrix is decomposed into a unit lower triangular matrix involving moving average coefficients and a diagonal matrix involving innovation variances, which are modeled as linear functions of covariates. Then, we propose a fully Bayesian inference for joint mean and covariance models based on this decomposition. A computational efficient Markov chain Monte Carlo method which combines the Gibbs sampler and Metropolis–Hastings algorithm is implemented to simultaneously obtain the Bayesian estimates of unknown parameters, as well as their standard deviation estimates. Finally, several simulation studies and a real example are presented to illustrate the proposed methodology. 相似文献
7.
8.
Jin Hyun Nam 《统计学通讯:模拟与计算》2017,46(3):1796-1807
Among many classification methods, linear discriminant analysis (LDA) is a favored tool due to its simplicity, robustness, and predictive accuracy but when the number of genes is larger than the number of observations, it cannot be applied directly because the within-class covariance matrix is singular. Also, diagonal LDA (DLDA) is a simpler model compared to LDA and has better performance in some cases. However, in reality, DLDA requires a strong assumption based on mutual independence. In this article, we propose the modified LDA (MLDA). MLDA is based on independence, but uses the information that has an effect on classification performance with the dependence structure. We suggest two approaches. One is the case of using gene rank. The other involves no use of gene rank. We found that MLDA has better performance than LDA, DLDA, or K-nearest neighborhood and is comparable with support vector machines in real data analysis and the simulation study. 相似文献
9.
In practical survey sampling, missing data are unavoidable due to nonresponse, rejected observations by editing, disclosure control, or outlier suppression. We propose a calibrated imputation approach so that valid point and variance estimates of the population (or domain) totals can be computed by the secondary users using simple complete‐sample formulae. This is especially helpful for variance estimation, which generally require additional information and tools that are unavailable to the secondary users. Our approach is natural for continuous variables, where the estimation may be either based on reweighting or imputation, including possibly their outlier‐robust extensions. We also propose a multivariate procedure to accommodate the estimation of the covariance matrix between estimated population totals, which facilitates variance estimation of the ratios or differences among the estimated totals. We illustrate the proposed approach using simulation data in supplementary materials that are available online. 相似文献
10.
In this paper, we propose a multivariate t regression model with its mean and scale covariance modeled jointly for the analysis of longitudinal data. A modified Cholesky decomposition is adopted to factorize the dependence structure in terms of unconstrained autoregressive and scale innovation parameters. We present three distinct representations of the log-likelihood function of the model and study the associated properties. A computationally efficient Fisher scoring algorithm is developed for carrying out maximum likelihood estimation. The technique for the prediction of future responses in this context is also investigated. The implementation of the proposed methodology is illustrated through two real-life examples and extensive simulation studies. 相似文献
11.
《Journal of Statistical Computation and Simulation》2012,82(11):1449-1460
Analysis of covariance in designed experiments has a long history dating back to the middle of the twentieth century. Given the popularity of Bayesian approaches to statistical modelling and inference, it is somewhat surprising that there is so little literature on the application of Bayesian methods in this context. This paper proposes methods based on a recent formulation of the problem in terms of a multivariate variance components model which allows for a conjugate Bayesian analysis of balanced randomized block experiments with concomitant information. The analysis is complicated by a linear constraint involving two covariance matrices. Two solutions are proposed and implemented using Markov chain Monte Carlo methods. 相似文献
12.
Influence functions are derived for the parameters in covariance structure analysis, where the parameters are estimated by minimizing a discrepancy function between the assumed covariance matrix and the sample covariance matrix. The case of confirmatory factor analysis is studied precisely with a numerical example. Comparing with a general procedure called one-step estimation, the proposed procedure has two advantages:1) computing cost is cheaper, 2) the property that arbitrary influence can be decomposed into a fi-nite number of components discussed by Tanaka and Castano-Tostado(1990) can be used for efficient computing and the characterization of a covariance structure model from the sensitivity perspective. A numerical comparison is made among the confirmatory factor analysis and some procedures of ex-ploratory factor analysis by using the decomposition mentioned above. 相似文献
13.
Estimation of the lifetime distribution of industrial components and systems yields very important information for manufacturers and consumers. However, obtaining reliability data is time consuming and costly. In this context, degradation tests are a useful alternative approach to lifetime and accelerated life tests in reliability studies. The approximate method is one of the most used techniques for degradation data analysis. It is very simple to understand and easy to implement numerically in any statistical software package. This paper uses time series techniques in order to propose a modified approximate method (MAM). The MAM improves the standard one in two aspects: (1) it uses previous observations in the degradation path as a Markov process for future prediction and (2) it is not necessary to specify a parametric form for the degradation path. Characteristics of interest such as mean or median time to failure and percentiles, among others, are obtained by using the modified method. A simulation study is performed in order to show the improved properties of the modified method over the standard one. Both methods are also used to estimate the failure time distribution of the fatigue-crack-growth data set. 相似文献
14.
This paper proposes a new factor rotation for the context of functional principal components analysis. This rotation seeks to re-express a functional subspace in terms of directions of decreasing smoothness as represented by a generalized smoothing metric. The rotation can be implemented simply and we show on two examples that this rotation can improve the interpretability of the leading components. 相似文献
15.
Heiberger M. Richard 《统计学通讯:模拟与计算》2013,42(2-3):107-140
Given multivariate normal data and a certain spherically invariant prior distribution on the covariance matrix, it is desired to estimate the moments of the posterior marginal distributions of some scalar functions of the covariance matrix by importance sampling. To this end a family of distributions is defined on the group of orthogonal matrices and a procedure is proposed for selecting one of these distributions for use as a weighting distribution in the importance sampling process. In an example estimates are calculated for the posterior mean and variance of each element in the covariance matrix expressed in the original coordinates, for the posterior mean of each element in the correlation matrix expressed in the original coordinates, and for the posterior mean of each element in the covariance matrix expressed in the coordinates of the principal variables. 相似文献
16.
Rolando Biscay Lirio Dunia Giniebra Camejo Jean-Michel Loubes Lilian Muñiz Alvarez 《Statistical Methods and Applications》2014,23(2):149-174
In this paper, we propose a data-driven model selection approach for the nonparametric estimation of covariance functions under very general moments assumptions on the stochastic process. Observing i.i.d replications of the process at fixed observation points, we select the best estimator among a set of candidates using a penalized least squares estimation procedure with a fully data-driven penalty function, extending the work in Bigot et al. (Electron J Stat 4:822–855, 2010). We then provide a practical application of this estimate for a Kriging interpolation procedure to forecast rainfall data. 相似文献
17.
Hassan S. Bakouch Meitner Cadena Christophe Chesneau 《Journal of applied statistics》2021,48(16):3002
In this paper, we develop a new general class of skew distributions with flexibility properties on the tails. Moreover, such class can provide heavy and light tails. Some of its mathematical properties are studied, including the quantile function, the moments, the moment generating function and the mean of deviations. New skew distributions are derived and used to construct new models capturing asymmetry inherent to data. The estimation of the class parameters is investigated by the method of maximum likelihood and the performance of the estimators is assessed by a simulation study. Applications of the proposed distribution are explored for two climate data sets. The first data set concerns the annual heat wave index and the second data set involves temperature and precipitation measures from the meteorological station located at Schiphol, Netherlands. Data fitting results show that our models perform better than the competitors. 相似文献
18.
《Journal of statistical planning and inference》2006,136(2):447-466
Most applications in spatial statistics involve modeling of complex spatial–temporal dependency structures, and many of the problems of space and time modeling can be overcome by using separable processes. This subclass of spatial–temporal processes has several advantages, including rapid fitting and simple extensions of many techniques developed and successfully used in time series and classical geostatistics. In particular, a major advantage of these processes is that the covariance matrix for a realization can be expressed as the Kronecker product of two smaller matrices that arise separately from the temporal and purely spatial processes, and hence its determinant and inverse are easily determinable. However, these separable models are not always realistic, and there are no formal tests for separability of general spatial–temporal processes. We present here a formal method to test for separability. Our approach can be also used to test for lack of stationarity of the process. The beauty of our approach is that by using spectral methods the mechanics of the test can be reduced to a simple two-factor analysis of variance (ANOVA) procedure. The approach we propose is based on only one realization of the spatial–temporal process.We apply the statistical methods proposed here to test for separability and stationarity of spatial–temporal ozone fields using data provided by the US Environmental Protection Agency (EPA). 相似文献
19.
《Journal of Statistical Computation and Simulation》2012,82(16):3158-3174
ABSTRACTIn many real-world applications, the traditional theory of analysis of covariance (ANCOVA) leads to inadequate and unreliable results because of violation of the response variable observations from the essential Gaussian assumption that may be due to the heterogeneity of population, the presence of outlier or both of them. In this paper, we develop a Gaussian mixture ANCOVA model for modelling heterogeneous populations with a finite number of subpopulation. We provide the maximum likelihood estimates of the model parameters via an EM algorithm. We also drive the adjusted effects estimators for treatments and covariates. The Fisher information matrix of the model and asymptotic confidence intervals for the parameter are also discussed. We performed a simulation study to assess the performance of the proposed model. A real-world example is also worked out to explained the methodology. 相似文献