Divisive clustering of high dimensional data streams |
| |
Authors: | David P Hofmeyr Nicos G Pavlidis Idris A Eckley |
| |
Institution: | 1.Department of Mathematics and Statistics,Lancaster University,Lancaster,UK;2.Department of Management Science,Lancaster University,Lancaster,UK |
| |
Abstract: | Clustering streaming data is gaining importance as automatic data acquisition technologies are deployed in diverse applications. We propose a fully incremental projected divisive clustering method for high-dimensional data streams that is motivated by high density clustering. The method is capable of identifying clusters in arbitrary subspaces, estimating the number of clusters, and detecting changes in the data distribution which necessitate a revision of the model. The empirical evaluation of the proposed method on numerous real and simulated datasets shows that it is scalable in dimension and number of clusters, is robust to noisy and irrelevant features, and is capable of handling a variety of types of non-stationarity. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|