首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 897 毫秒
1.
Summary Nonsymmetric correspondence analysis is a model meant for the analysis of the dependence in a two-way continengy table, and is an alternative to correspondence analysis. Correspondence analysis is based on the decomposition of Pearson's Ф2-index Nonsymmetric correspondence analysis is based on the decomposition of Goodman-Kruskal's τ-index for predicatablity. In this paper, we approach nonsymmetric correspondence analysis as a statistical model based on a probability distribution. We provide algorithms for the maximum likelihood and the least-squares estimation with linear constraints upon model parameters. The nonsymmetric correspondence analysis model has many properties that can be useful for prediction analysis in contingency tables. Predictability measures are introduced to identify the categories of the response variable that can be best predicted, as well as the categories of the explanatory variable having the highest predictability power. We describe the interpretation of model parameters in two examples. In the end, we discuss the relations of nonsymmetric correspondence analysis with other reduced-rank models.  相似文献   

2.
In the classical principal component analysis (PCA), the empirical influence function for the sensitivity coefficient ρ is used to detect influential observations on the subspace spanned by the dominants principal components. In this article, we derive the influence function of ρ in the case where the reweighted minimum covariance determinant (MCD1) is used as estimator of multivariate location and scatter. Our aim is to confirm the reliability in terms of robustness of the MCD1 via the approach based on the influence function of the sensitivity coefficient.  相似文献   

3.
In robust statistics, the influence function was developed as an important measure of sensitivity of estimators to large values. As a measure of income inequality, the quintile share ratio was introduced and not much is known about the theoretical properties of its nonparametric estimator. One such property is its sensitivity to outliers. In this article, we derive the influence function of the quintile share ratio. As is to be expected from its definition, the influence function is unbounded. A nonparametric estimator for the quintile share ratio is defined and its sensitivity to outliers is investigated in a small simulation study.  相似文献   

4.
The combination of log-linear models and correspondence analysis have long been used to decompose contingency tables and aid in their interpretation. Until now, this approach has not been applied to the education Statewide Longitudinal Data System (SLDS), which contains administrative school data at the student level. While some research has been conducted using the SLDS, its primary use is for state education administrative reporting. This article uses the combination of log-linear models and correspondence analysis to gain insight into high school dropouts in two discrete regions in Kentucky, Appalachia and non-Appalachia, defined by the American Community Survey. The individual student records from the SLDS were categorized into one of the two regions and a log-linear model was used to identify the interactions between the demographic characteristics and the dropout categories, push-out and pull-out. Correspondence analysis was then used to visualize the interactions with the expanded push-out categories, boredom, course selection, expulsion, failing grade, teacher conflict, and pull-out categories, employment, family problems, illness, marriage, and pregnancy to provide insights into the regional differences. In this article, we demonstrate that correspondence analysis can extend the insights gained from SDLS data and provide new perspectives on dropouts. Supplementary materials for this article are available online.  相似文献   

5.
Taguchi's statistic has long been known to be a more appropriate measure of association for ordinal variables than the Pearson chi-squared statistic. Therefore, there is some advantage in using Taguchi's statistic for performing correspondence analysis when a two-way contingency table consists of one ordinal categorical variable. This article will explore the development of correspondence analysis using a decomposition of Taguchi's statistic.  相似文献   

6.
ABSTRACT

In influence analysis several problems arise in the field of Principal Components when applying different sample versions. Among these are the difficulty of determining a certain correspondence between the eigenvalues before and after the deletion of observations, the choice of the sign of the eigenvectors and the computational problem derived from the resolution of a great number of eigenproblems. In this article, such problems are discussed from the joint influence point of view and a solution is proposed by using approximations. Furthermore, the influence on a new parameter of interest is introduced: the proportion of variance explained by a set of principal components.  相似文献   

7.
Sensitivity analysis is to study the influence of a small change in the input data on the output of the analysis. Han and Huh (1995) developed a quantification method for the ranked data. However, the question of stability in the analysis of ranked data has not been considered. Here, we propose a method of sensitivity analysis for ranked data. Our aim is to evaluate perturbations by using a graphical approach suggested by Han and Huh (1995). It extends the results obtained by Tanaka (1984) and Huh (1989) for the sensitivity analysis in Hayashi’s third method of quantification and those by Huh and Park (1990) for the principal component reduction of the case influence derivatives in regression. A numerical example is provided to explain how to conduct sensitivity analysis based on the proposed approach.  相似文献   

8.
Correspondence analysis (CA) and nonsymmetric correspondence analysis are based on generalized singular value decomposition, and, in general, they are not equivalent. Taxicab correspondence analysis (TCA) is a \(\hbox {L}_{1}\) variant of CA, and it is based on the generalized taxicab singular value decomposition (GTSVD). Our aim is to study the taxicab variant of nonsymmetric correspondence analysis. We find that for diagonal metric matrices GTSVDs of a given data set are equivalent; from which we deduce the equivalence of TCA and taxicab nonsymmetric correspondence analysis. We also attempt to show that TCA stays as close as possible to the original correspondence matrix without calculating a dissimilarity (or similarity) measure between rows or columns. Further, we discuss some new geometric and distance aspects of TCA.  相似文献   

9.
Influence diagnostics in the tobit censored response model   总被引:1,自引:0,他引:1  
In this article, we develop influence diagnostic tools for the tobit model. Specifically, we discuss global influence methods based on the Cook distance and residuals with envelopes, and total and conformal local influence techniques. In order to analyze the sensitivity of the maximum likelihood estimators of the parameters of the model to small perturbations on the assumptions of the model and/or data, we consider several perturbation schemes, such as case-weight and response perturbations. Finally, we illustrate the developed methodology by means of a real data set.  相似文献   

10.
ABSTRACT

Hazard rate functions are often used in modeling of lifetime data. The Exponential Power Series (EPS) family has a monotone hazard rate function. In this article, the influence of input factors such as time and parameters on the variability of hazard rate function is assessed by local and global sensitivity analysis. Two different indices based on local and global sensitivity indices are presented. The simulation results for two datasets show that the hazard rate functions of the EPS family are sensitive to input parameters. The results also show that the hazard rate function of the EPS family is more sensitive to the exponential distribution than power series distributions.  相似文献   

11.
This article investigates case-deletion influence analysis via Cook’s distance and local influence analysis via conformal normal curvature for partially linear models with response missing at random. Local influence approach is developed to assess the sensitivity of parameter and nonparametric estimators to various perturbations such as case-weight, response variable, explanatory variable, and parameter perturbations on the basis of semiparametric estimating equations, which are constructed using the inverse probability weighted approach, rather than likelihood function. Residual and generalized leverage are also defined. Simulation studies and a dataset taken from the AIDS Clinical Trials are used to illustrate the proposed methods.  相似文献   

12.
For many questionnaires and surveys in the marketing, business, and health disciplines, items often involve ordinal scales (such as the Likert scale and rating scale) that are associated in sometimes complex ways. Techniques such as classical correspondence analysis provide a simple graphical means of describing the nature of the association. However, the procedure does not allow the researcher to specify how one item may be associated with another, nor does the analysis allow for the ordinal structure of the scales to be reflected. This article presents a graphical approach that can help the researcher to study in depth the complex association of the items and reflect the structure of the items. We will demonstrate the applicability of this approach using data collected from a study that involves identifying major factors that influence the level of patient satisfaction in a Neapolitan hospital.  相似文献   

13.
In this article, 18 multiple comparison tests are compared according to powers and type I error measures under some violations of analysis of variance assumptions with a Monte Carlo simulation study. Considered violations of assumptions are heterogeneity in subgroup variances and dependency between subgroups. Various numbers of subgroups and subgroup sizes are considered simultaneously with the violations of assumptions. Simulation results are analysed by using visual inspection, graphical representations, decision-tree and correspondence analyses. Wide inferences are drawn on the behaviour of considered tests with respect to measures used. Some general suggestions are given on which tests should be used or avoided under violations of assumptions.  相似文献   

14.
对应分析统计检验体系探讨   总被引:3,自引:0,他引:3  
对应分析因其结果的易读性,近些年得到了越来越广泛的应用。为了更好地应用对应分析,提出建立对应分析统计检验体系,包括对应分析适用性的统计检验以及对应分析效果的检验,同时还提出应用对应分析时应注意的其它问题。  相似文献   

15.
Andrews plots (Biometrics 28 (1972) 125-136), as a tool to graphically interpret multivariate data, have recently gained considerable recognition. In this article, we first review the previous literature and then suggest a modification to the traditional Andrews plots. Finally, we illustrate a few new applications of these plots in robust design studies and in correspondence analysis, using real data.  相似文献   

16.
The fact of estimating how a model output is influenced by the variations of inputs has become an important problematic in reliability and sensitivity analysis. This article is interested in estimating sensitivity indices useful to quantify the contribution of inputs to the variance of model output. A multivariate mixed kernel estimator is investigated since, until now, discrete and continuous inputs have been separately considered in kernel estimation of sensitivity indices. To illustrate the differences between the influence of mixed, discrete, and continuous inputs, analytical expressions of Sobol sensitivity indices are expressed in these three cases for the Ishigami test function. Besides, the performance of the mixed kernel estimator is illustrated through simulations in which the Bayesian procedure is applied for bandwidth parameter choice. An application is also realized on a real example. Finally, the use of an appropriate kernel estimator according to the type of inputs is found to be influential on the accuracy of sensitivity indices estimates.  相似文献   

17.
Crossover designs are commonly used in bioequivalence studies. However, the results can be affected by some outlying observations, which may lead to the wrong decision on bioequivalence. Therefore, it is essential to investigate the influence of aberrant observations. Chow and Tse in 1990 discussed this issue by considering the methods based on the likelihood distance and estimates distance. Perturbation theory provides a useful tool for the sensitivity analysis on statistical models. Hence, in this paper, we develop the influence functions via the perturbation scheme proposed by Hampel as an alternative approach on the influence analysis for a crossover design experiment. Moreover, the comparisons between the proposed approach and the method proposed by Chow and Tse are investigated. Two real data examples are provided to illustrate the results of these approaches. Our proposed influence functions show excellent performance on the identification of outlier/influential observations and are suitable for use with small sample size crossover designs commonly used in bioequivalence studies. Copyright © 2013 John Wiley & Sons, Ltd.  相似文献   

18.
In this article, we study the behavior of the coefficient of variation (CV) of a random variable that follows a symmetric distribution in the real line. Specifically, we estimate this coefficient using the maximum-likelihood (ML) method. In addition, we provide asymptotic inference for this parameter, which allows us to contrast hypothesis and construct confidence intervals. Furthermore, we produce influence diagnostics to evaluate the sensitivity of the ML estimate of this coefficient when atypical data are present. Moreover, we illustrate the obtained results by using financial real data. Finally, we carry out a simulation study to detect the potential influence of atypical observations on the ML estimator of the CV of a symmetric distribution. The illustration and simulation demonstrate the robustness of the ML estimation of this coefficient.  相似文献   

19.
This article reviews symmetrical global sensitivity analysis based on the analysis of variance of high-dimensional model representation. To overcome the computational difficulties and explore the use of symmetrical design of experiment (SDOE), two methods are presented. If the form of the objective function f is known, we use SDOE to estimate the symmetrical global sensitivity indices instead of Monte Carlo or quasi-Monte Carlo simulation. Otherwise, we use the observed values of the experiment to do symmetrical global sensitivity analysis. These methods are easy to implement and can reduce the computational cost. An example is given by symmetrical design of experiment.  相似文献   

20.
The application of subset correspondence analysis is a relatively new technique to deal with the analysis of categorical data with missingness. A simulation study is used to test the effects of Little and Rubin's missingness mechanisms, as well as missingness up to 50% on subset correspondence analysis. Missingness was simulated across 18 different scenarios and each scenario was repeated 10 times, with outcomes averaged across the 10 simulations. In this application, it was found that while missingness in excess of 30% has some effect on certain outcomes, there is no evidence to suggest that the missingness mechanism significantly affects results.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号