Using Lasso RCCA for cross-language information retrieval |
| |
Authors: | Emil Polajnar |
| |
Affiliation: | 1. Faculty of Social Sciences, University of Ljubljana, Ljubljana, Sloveniaemil.polajnar@fdv.uni-lj.si |
| |
Abstract: | ABSTRACTRestricted canonical correlation analysis and the lasso shrinkage method were paired together for canonical correlation analysis with non-negativity restrictions on datasets, where a sample size is much smaller than the number of variables. The method was implemented in an alternating least-squares algorithm and applied to cross-language information retrieval on a dataset with aligned documents in eight languages. A set of experiments was ran to evaluate the method and compare it to other methods in the field. |
| |
Keywords: | Alternating least-squares Cross-language information retrieval Lasso shrinkage Restricted canonical correlation |
|
|