Modified linear discriminant analysis using block covariance matrix in high-dimensional data |
| |
Authors: | Jin Hyun Nam |
| |
Affiliation: | Department of Statistics, Sungkyunkwan University, Seoul, Republic of Korea |
| |
Abstract: | Among many classification methods, linear discriminant analysis (LDA) is a favored tool due to its simplicity, robustness, and predictive accuracy but when the number of genes is larger than the number of observations, it cannot be applied directly because the within-class covariance matrix is singular. Also, diagonal LDA (DLDA) is a simpler model compared to LDA and has better performance in some cases. However, in reality, DLDA requires a strong assumption based on mutual independence. In this article, we propose the modified LDA (MLDA). MLDA is based on independence, but uses the information that has an effect on classification performance with the dependence structure. We suggest two approaches. One is the case of using gene rank. The other involves no use of gene rank. We found that MLDA has better performance than LDA, DLDA, or K-nearest neighborhood and is comparable with support vector machines in real data analysis and the simulation study. |
| |
Keywords: | Block covariance matrix Classification High-dimension Modified LDA |
|
|