Robust Detection of Multiple Outliers in Grouped Multivariate Data |
| |
Authors: | Chrys Caroni Nedret Billor |
| |
Institution: |
a Department of Mathematics, National Technical University of Athens, Greece
b Department of Mathematics and Statistics, Auburn University, USA |
| |
Abstract: | Many methods have been developed for detecting multiple outliers in a single multivariate sample, but very few for the case where there may be groups in the data. We propose a method of simultaneously determining groups (as in cluster analysis) and detecting outliers, which are points that are distant from every group. Our method is an adaptation of the BACON algorithm proposed by Billor, Hadi and Velleman for the robust detection of multiple outliers in a single group of multivariate data. There are two versions of our method, depending on whether or not the groups can be assumed to have equal covariance matrices. The effectiveness of the method is illustrated by its application to two real data sets and further shown by a simulation study for different sample sizes and dimensions for 2 and 3 groups, with and without planted outliers in the data. When the number of groups is not known in advance, the algorithm could be used as a robust method of cluster analysis, by running it for various numbers of groups and choosing the best solution. |
| |
Keywords: | Multivariate data outliers robust methods BACON cluster analysis |
本文献已被 InformaWorld 等数据库收录! |
|