期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A probabilistic model for nonsymmetric correspondence analysis and prediction in contingency tables

Roberta Siciliano Ab Mooijart Peter G. M. van der Heijden 《Statistical Methods and Applications》1993,2(1):85-106

Summary Nonsymmetric correspondence analysis is a model meant for the analysis of the dependence in a two-way continengy table, and is an alternative to correspondence analysis. Correspondence analysis is based on the decomposition of Pearson's Ф²-index Nonsymmetric correspondence analysis is based on the decomposition of Goodman-Kruskal's τ-index for predicatablity. In this paper, we approach nonsymmetric correspondence analysis as a statistical model based on a probability distribution. We provide algorithms for the maximum likelihood and the least-squares estimation with linear constraints upon model parameters. The nonsymmetric correspondence analysis model has many properties that can be useful for prediction analysis in contingency tables. Predictability measures are introduced to identify the categories of the response variable that can be best predicted, as well as the categories of the explanatory variable having the highest predictability power. We describe the interpretation of model parameters in two examples. In the end, we discuss the relations of nonsymmetric correspondence analysis with other reduced-rank models. 相似文献

2.

Sensitivity Coefficient in Principal Component Analysis: Robust Case

Malika Cheikh 《统计学通讯:模拟与计算》2013,42(8):1622-1630

In the classical principal component analysis (PCA), the empirical influence function for the sensitivity coefficient ρ is used to detect influential observations on the subspace spanned by the dominants principal components. In this article, we derive the influence function of ρ in the case where the reweighted minimum covariance determinant (MCD¹) is used as estimator of multivariate location and scatter. Our aim is to confirm the reliability in terms of robustness of the MCD¹ via the approach based on the influence function of the sensitivity coefficient. 相似文献

3.

On the Influence Function of the Quintile Share Ratio

Tchilabalo Abozou Kpanzou 《统计学通讯:模拟与计算》2015,44(10):2492-2499

In robust statistics, the influence function was developed as an important measure of sensitivity of estimators to large values. As a measure of income inequality, the quintile share ratio was introduced and not much is known about the theoretical properties of its nonparametric estimator. One such property is its sensitivity to outliers. In this article, we derive the influence function of the quintile share ratio. As is to be expected from its definition, the influence function is unbounded. A nonparametric estimator for the quintile share ratio is defined and its sensitivity to outliers is investigated in a small simulation study. 相似文献

4.

A New Lens on High School Dropout: Use of Correspondence Analysis and the Statewide Longitudinal Data System

Kathryn Schaefer Ziemer Bianica Pires Vicki Lancaster Sallie Keller Mark Orr Stephanie Shipp 《The American statistician》2018,72(2):191-198

The combination of log-linear models and correspondence analysis have long been used to decompose contingency tables and aid in their interpretation. Until now, this approach has not been applied to the education Statewide Longitudinal Data System (SLDS), which contains administrative school data at the student level. While some research has been conducted using the SLDS, its primary use is for state education administrative reporting. This article uses the combination of log-linear models and correspondence analysis to gain insight into high school dropouts in two discrete regions in Kentucky, Appalachia and non-Appalachia, defined by the American Community Survey. The individual student records from the SLDS were categorized into one of the two regions and a log-linear model was used to identify the interactions between the demographic characteristics and the dropout categories, push-out and pull-out. Correspondence analysis was then used to visualize the interactions with the expanded push-out categories, boredom, course selection, expulsion, failing grade, teacher conflict, and pull-out categories, employment, family problems, illness, marriage, and pregnancy to provide insights into the regional differences. In this article, we demonstrate that correspondence analysis can extend the insights gained from SDLS data and provide new perspectives on dropouts. Supplementary materials for this article are available online. 相似文献

5.

Correspondence Analysis of Cumulative Frequencies Using a Decomposition of Taguchi's Statistic

Eric J. Beh Luigi D'ambra Biagio Simonetti 《统计学通讯:理论与方法》2013,42(9):1620-1632

Taguchi's statistic has long been known to be a more appropriate measure of association for ordinal variables than the Pearson chi-squared statistic. Therefore, there is some advantage in using Taguchi's statistic for performing correspondence analysis when a two-way contingency table consists of one ordinal categorical variable. This article will explore the development of correspondence analysis using a decomposition of Taguchi's statistic. 相似文献

6.

Influence Analysis in Principal Component Analysis Through Power-Series Expansions

A. Enguix-González J. M. Muñoz-Pichardo J. L. Moreno-Rebollo R. Pino-Mejías 《统计学通讯:理论与方法》2013,42(9-10):2025-2046

ABSTRACT

In influence analysis several problems arise in the field of Principal Components when applying different sample versions. Among these are the difficulty of determining a certain correspondence between the eigenvalues before and after the deletion of observations, the choice of the sign of the eigenvectors and the computational problem derived from the resolution of a great number of eigenproblems. In this article, such problems are discussed from the joint influence point of view and a solution is proposed by using approximations. Furthermore, the influence on a new parameter of interest is introduced: the proportion of variance explained by a set of principal components. 相似文献

7.

Sensitivity analysis for ranked data

《Journal of the Korean Statistical Society》2014,43(1):1-9

Sensitivity analysis is to study the influence of a small change in the input data on the output of the analysis. Han and Huh (1995) developed a quantification method for the ranked data. However, the question of stability in the analysis of ranked data has not been considered. Here, we propose a method of sensitivity analysis for ranked data. Our aim is to evaluate perturbations by using a graphical approach suggested by Han and Huh (1995). It extends the results obtained by Tanaka (1984) and Huh (1989) for the sensitivity analysis in Hayashi’s third method of quantification and those by Huh and Park (1990) for the principal component reduction of the case influence derivatives in regression. A numerical example is provided to explain how to conduct sensitivity analysis based on the proposed approach. 相似文献

8.

Some new aspects of taxicab correspondence analysis

Vartan Choulakian Biagio Simonetti Thu Pham Gia 《Statistical Methods and Applications》2014,23(3):401-416

Correspondence analysis (CA) and nonsymmetric correspondence analysis are based on generalized singular value decomposition, and, in general, they are not equivalent. Taxicab correspondence analysis (TCA) is a \(\hbox {L}_{1}\) variant of CA, and it is based on the generalized taxicab singular value decomposition (GTSVD). Our aim is to study the taxicab variant of nonsymmetric correspondence analysis. We find that for diagonal metric matrices GTSVDs of a given data set are equivalent; from which we deduce the equivalence of TCA and taxicab nonsymmetric correspondence analysis. We also attempt to show that TCA stays as close as possible to the original correspondence matrix without calculating a dissimilarity (or similarity) measure between rows or columns. Further, we discuss some new geometric and distance aspects of TCA. 相似文献

9.

Influence diagnostics in the tobit censored response model 总被引：1，自引：0，他引：1

Michelli Barros Manuel Galea Manuel González Víctor Leiva 《Statistical Methods and Applications》2010,19(3):379-397

In this article, we develop influence diagnostic tools for the tobit model. Specifically, we discuss global influence methods based on the Cook distance and residuals with envelopes, and total and conformal local influence techniques. In order to analyze the sensitivity of the maximum likelihood estimators of the parameters of the model to small perturbations on the assumptions of the model and/or data, we consider several perturbation schemes, such as case-weight and response perturbations. Finally, we illustrate the developed methodology by means of a real data set. 相似文献

10.

Sensitivity analysis of reliability functions of the exponential power series lifetime distribution

Mohammad Salehi Vaysi Saralees Nadarajah 《统计学通讯:模拟与计算》2013,42(10):2938-2952

ABSTRACT

Hazard rate functions are often used in modeling of lifetime data. The Exponential Power Series (EPS) family has a monotone hazard rate function. In this article, the influence of input factors such as time and parameters on the variability of hazard rate function is assessed by local and global sensitivity analysis. Two different indices based on local and global sensitivity indices are presented. The simulation results for two datasets show that the hazard rate functions of the EPS family are sensitive to input parameters. The results also show that the hazard rate function of the EPS family is more sensitive to the exponential distribution than power series distributions. 相似文献

11.

Sensitivity analysis of partially linear models with response missing at random

Ai-Xia Fan 《统计学通讯:模拟与计算》2017,46(7):5323-5339

This article investigates case-deletion influence analysis via Cook’s distance and local influence analysis via conformal normal curvature for partially linear models with response missing at random. Local influence approach is developed to assess the sensitivity of parameter and nonparametric estimators to various perturbations such as case-weight, response variable, explanatory variable, and parameter perturbations on the basis of semiparametric estimating equations, which are constructed using the inverse probability weighted approach, rather than likelihood function. Residual and generalized leverage are also defined. Simulation studies and a dataset taken from the AIDS Clinical Trials are used to illustrate the proposed methods. 相似文献

12.

The analysis of dependence for three ways contingency tables with ordinal variables: A case study of patient satisfaction data

Biagio Simonetti Eric J. Beh Luigi D'Ambra 《Journal of applied statistics》2010,37(1):91-103

For many questionnaires and surveys in the marketing, business, and health disciplines, items often involve ordinal scales (such as the Likert scale and rating scale) that are associated in sometimes complex ways. Techniques such as classical correspondence analysis provide a simple graphical means of describing the nature of the association. However, the procedure does not allow the researcher to specify how one item may be associated with another, nor does the analysis allow for the ordinal structure of the scales to be reflected. This article presents a graphical approach that can help the researcher to study in depth the complex association of the items and reflect the structure of the items. We will demonstrate the applicability of this approach using data collected from a study that involves identifying major factors that influence the level of patient satisfaction in a Neapolitan hospital. 相似文献

13.

Performance of some multiple comparison tests under heteroscedasticity and dependency

《Journal of Statistical Computation and Simulation》2012,82(10):1083-1100

In this article, 18 multiple comparison tests are compared according to powers and type I error measures under some violations of analysis of variance assumptions with a Monte Carlo simulation study. Considered violations of assumptions are heterogeneity in subgroup variances and dependency between subgroups. Various numbers of subgroups and subgroup sizes are considered simultaneously with the violations of assumptions. Simulation results are analysed by using visual inspection, graphical representations, decision-tree and correspondence analyses. Wide inferences are drawn on the behaviour of considered tests with respect to measures used. Some general suggestions are given on which tests should be used or avoided under violations of assumptions. 相似文献

14.

对应分析统计检验体系探讨 总被引：3，自引：0，他引：3

傅德印王晶《统计与信息论坛》2010,25(3):3-6

对应分析因其结果的易读性，近些年得到了越来越广泛的应用。为了更好地应用对应分析，提出建立对应分析统计检验体系，包括对应分析适用性的统计检验以及对应分析效果的检验，同时还提出应用对应分析时应注意的其它问题。相似文献

15.

Andrews plots for multivariate data: some new suggestions and applications

Ravindra Khattree Dayanand N. Naik 《Journal of statistical planning and inference》2002,100(2):411-425

Andrews plots (Biometrics 28 (1972) 125-136), as a tool to graphically interpret multivariate data, have recently gained considerable recognition. In this article, we first review the previous literature and then suggest a modification to the traditional Andrews plots. Finally, we illustrate a few new applications of these plots in robust design studies and in correspondence analysis, using real data. 相似文献

16.

A multivariate non-parametric kernel estimator for global sensitivity analysis

Lamia Djerroud Tristan Senga Kiessé Smail Adjabi 《统计学通讯:模拟与计算》2018,47(6):1606-1622

The fact of estimating how a model output is influenced by the variations of inputs has become an important problematic in reliability and sensitivity analysis. This article is interested in estimating sensitivity indices useful to quantify the contribution of inputs to the variance of model output. A multivariate mixed kernel estimator is investigated since, until now, discrete and continuous inputs have been separately considered in kernel estimation of sensitivity indices. To illustrate the differences between the influence of mixed, discrete, and continuous inputs, analytical expressions of Sobol sensitivity indices are expressed in these three cases for the Ishigami test function. Besides, the performance of the mixed kernel estimator is illustrated through simulations in which the Bayesian procedure is applied for bandwidth parameter choice. An application is also realized on a real example. Finally, the use of an appropriate kernel estimator according to the type of inputs is found to be influential on the accuracy of sensitivity indices estimates. 相似文献

17.

Influence analysis on crossover design experiment in bioequivalence studies

Yufen Huang Bo‐Shiang Ke 《Pharmaceutical statistics》2014,13(2):110-118

Crossover designs are commonly used in bioequivalence studies. However, the results can be affected by some outlying observations, which may lead to the wrong decision on bioequivalence. Therefore, it is essential to investigate the influence of aberrant observations. Chow and Tse in 1990 discussed this issue by considering the methods based on the likelihood distance and estimates distance. Perturbation theory provides a useful tool for the sensitivity analysis on statistical models. Hence, in this paper, we develop the influence functions via the perturbation scheme proposed by Hampel as an alternative approach on the influence analysis for a crossover design experiment. Moreover, the comparisons between the proposed approach and the method proposed by Chow and Tse are investigated. Two real data examples are provided to illustrate the results of these approaches. Our proposed influence functions show excellent performance on the identification of outlier/influential observations and are suitable for use with small sample size crossover designs commonly used in bioequivalence studies. Copyright © 2013 John Wiley & Sons, Ltd. 相似文献

18.

Influence diagnostics on the coefficient of variation of elliptically contoured distributions

Marco Riquelme Manuel Galea Antonio Sanhueza 《Journal of applied statistics》2011,38(3):513-532

In this article, we study the behavior of the coefficient of variation (CV) of a random variable that follows a symmetric distribution in the real line. Specifically, we estimate this coefficient using the maximum-likelihood (ML) method. In addition, we provide asymptotic inference for this parameter, which allows us to contrast hypothesis and construct confidence intervals. Furthermore, we produce influence diagnostics to evaluate the sensitivity of the ML estimate of this coefficient when atypical data are present. Moreover, we illustrate the obtained results by using financial real data. Finally, we carry out a simulation study to detect the potential influence of atypical observations on the ML estimator of the CV of a symmetric distribution. The illustration and simulation demonstrate the robustness of the ML estimation of this coefficient. 相似文献

19.

Symmetrical Design of Experiment in Global Sensitivity Analysis Based on ANOVA High-dimensional Model Representation

Chun Luo Yingshan Zhang 《统计学通讯:模拟与计算》2016,45(1):48-72

This article reviews symmetrical global sensitivity analysis based on the analysis of variance of high-dimensional model representation. To overcome the computational difficulties and explore the use of symmetrical design of experiment (SDOE), two methods are presented. If the form of the objective function f is known, we use SDOE to estimate the symmetrical global sensitivity indices instead of Monte Carlo or quasi-Monte Carlo simulation. Otherwise, we use the observed values of the experiment to do symmetrical global sensitivity analysis. These methods are easy to implement and can reduce the computational cost. An example is given by symmetrical design of experiment. 相似文献

20.

The effect of the mechanism and amount of missingness on subset correspondence analysis

Gillian M. Hendry Temesgen Zewotir Rajen N. Naidoo Delia North 《统计学通讯:模拟与计算》2017,46(9):7100-7115

The application of subset correspondence analysis is a relatively new technique to deal with the analysis of categorical data with missingness. A simulation study is used to test the effects of Little and Rubin's missingness mechanisms, as well as missingness up to 50% on subset correspondence analysis. Missingness was simulated across 18 different scenarios and each scenario was repeated 10 times, with outcomes averaged across the 10 simulations. In this application, it was found that while missingness in excess of 30% has some effect on certain outcomes, there is no evidence to suggest that the missingness mechanism significantly affects results. 相似文献