Multivariate forests with missing mixed outcomes |
| |
Authors: | Abdessamad Dine François Bellavance |
| |
Affiliation: | 1. Department of Management, Ecole Supérieure de Technologie, Université Hassan Premier, Berrechid, Morocco;2. Department of Decision Sciences, HEC Montréal, Montréal, Québec, Canada |
| |
Abstract: | In this article, we propose a multivariate random forest method for multiple responses of mixed types with missing responses. Imputation is performed for each bootstrap sample used to build the individual trees that form the forest. The individual trees are built using a weighted splitting rule allowing downweighting of imputed observations. A simulation study shows the benefits of this approach over complete case analysis when missing responses are missing completely at random and missing at random (MAR). In particular, the gain in prediction accuracy of the proposed method is larger in the MAR case and also increases as the proportion of missing increases. |
| |
Keywords: | General location model GLOM Mixed responses Multiple imputation Multivariate tree Random forest. |
|
|