Incremental modelling for compositional data streams |
| |
Authors: | Yuan Wei Huiwen Wang Gilbert Saporta |
| |
Affiliation: | 1. School of Economics and Management, Beihang University, Beijing, China;2. Beijing Key Laboratory of Emergence Support Simulation Technologies for City Operations, Beijing, China;3. Applied Statistics, Conservatoire National des Arts et Métiers, Paris, France |
| |
Abstract: | ABSTRACTIncremental modelling of data streams is of great practical importance, as shown by its applications in advertising and financial data analysis. We propose two incremental covariance matrix decomposition methods for a compositional data type. The first method, exact incremental covariance decomposition of compositional data (C-EICD), gives an exact decomposition result. The second method, covariance-free incremental covariance decomposition of compositional data (C-CICD), is an approximate algorithm that can efficiently compute high-dimensional cases. Based on these two methods, many frequently used compositional statistical models can be incrementally calculated. We take multiple linear regression and principle component analysis as examples to illustrate the utility of the proposed methods via extensive simulation studies. |
| |
Keywords: | Compositional data Covariance matrix Eigen decomposition Data stream |
|
|