Sparse canonical correlation analysis between an alcohol biomarker and self-reported alcohol consumption |
| |
Authors: | Shanjun Helian Robert L. Cook |
| |
Affiliation: | 1. Department of Biostatistics, University of Florida, Gainesville, FL, USA;2. Department of Epidemiology, University of Florida, Gainesville, FL, USA |
| |
Abstract: | In investigating the correlation between an alcohol biomarker and self-report, we developed a method to estimate the canonical correlation between two high-dimensional random vectors with a small sample size. In reviewing the relevant literature, we found that our method is somewhat similar to an existing method, but that the existing method has been criticized as lacking theoretical grounding in comparison with an alternative approach. We provide theoretical and empirical grounding for our method, and we customize it for our application to produce a novel method, which selects linear combinations that are step functions with a sparse number of steps. |
| |
Keywords: | L1 penalty Partial canonical correlation Regularized canonical correlation analysis Repeated measures |
|
|