Mixtures of regression models with incomplete and noisy data |
| |
Authors: | Byoung Cheol Jung Sooyoung Cheon |
| |
Affiliation: | 1. Department of Statistics, University of Seoul, Seoul, Republic of Korea;2. Department of Applied Statistics, Korea University, Republic of Korea |
| |
Abstract: | The estimation of the mixtures of regression models is usually based on the normal assumption of components and maximum likelihood estimation of the normal components is sensitive to noise, outliers, or high-leverage points. Missing values are inevitable in many situations and parameter estimates could be biased if the missing values are not handled properly. In this article, we propose the mixtures of regression models for contaminated incomplete heterogeneous data. The proposed models provide robust estimates of regression coefficients varying across latent subgroups even under the presence of missing values. The methodology is illustrated through simulation studies and a real data analysis. |
| |
Keywords: | EM algorithm Maximum likelihood Missing values Mixtures of regression models Outliers |
|
|