Better alternatives to current methods of scaling and weighting data for cluster analysis |
| |
Authors: | R. Gnanadesikan J.R. Kettenring Srinivas Maloor |
| |
Affiliation: | 1. Department of Statistics, Rutgers University, Piscataway, NJ 08854, USA;2. Charles A. Dana Research Institute for Scientists Emeriti (RISE), Drew University, Madison, NJ 07940, USA;3. Department of Electrical and Computer Engineering, Rutgers University, Piscataway, NJ 08854, USA |
| |
Abstract: | Scaling of multivariate data prior to cluster analysis is important as a preprocessing step. Currently there are methods for doing this. This paper proposes some alternatives, which are particularly directed at helping reveal cluster structures in data. These methods are applied to simulated and real data sets and their performances are compared to some currently used methods. The results indicate that, in many situations, the new methods are much better than the most popular method, called autoscaling. In the most challenging clustering example considered, their performances, while poor, are no worse than all the currently used methods. |
| |
Keywords: | Clustering Variable weighting Variable scaling Discriminant analysis |
本文献已被 ScienceDirect 等数据库收录! |
|