Model-based clustering of multivariate skew data with circular components and missing values |
| |
Authors: | Francesco Lagona Marco Picone |
| |
Affiliation: | 1. DIPES , University Roma Tre , Via Gabriello Chiabrera 199, 00145 , Rome , Italy;2. Department of Economics , University Roma Tre , Italy |
| |
Abstract: | Motivated by classification issues that arise in marine studies, we propose a latent-class mixture model for the unsupervised classification of incomplete quadrivariate data with two linear and two circular components. The model integrates bivariate circular densities and bivariate skew normal densities to capture the association between toroidal clusters of bivariate circular observations and planar clusters of bivariate linear observations. Maximum-likelihood estimation of the model is facilitated by an expectation maximization (EM) algorithm that treats unknown class membership and missing values as different sources of incomplete information. The model is exploited on hourly observations of wind speed and direction and wave height and direction to identify a number of sea regimes, which represent specific distributional shapes that the data take under environmental latent conditions. |
| |
Keywords: | circular data EM algorithm latent classes missing values skew normal unsupervised classification von Mises wave wind |
|
|