首页 | 本学科首页   官方微博 | 高级检索  
     检索      


On Combining Wavelets Expansion and Sparse Linear Models for Regression on Metabolomic Data and Biomarker Selection
Authors:Nathalie Villa-Vialaneix  Noslen Hernández  Alain Paris  Céline Domange  Nathalie Priymenko  Philippe Besse
Institution:1. SAMM, Université Paris 1, Paris, France;2. IUT, Dpt STID, Université de Perpignan Via Domitia, Perpignan, France;3. Advanced Technologies Application Center, CENATAV, Havana, Cuba;4. INRA, Unité Mét@risk, AgroParisTech, Paris, France;5. AgroParisTech, UMR 0791 Modélisation Systémique Appliquée aux Ruminants, Paris, France;6. INRA, UMR 0791 Modélisation Systémique Appliquée aux Ruminants, Paris, France;7. ENVT, INRA, UMR 1089, Université de Toulouse, Toulouse, France;8. Institut de Mathématiques de Toulouse, UMR 5219, Université de Toulouse, Toulouse, France
Abstract:Wavelet thresholding of spectra has to be handled with care when the spectra are the predictors of a regression problem. Indeed, a blind thresholding of the signal followed by a regression method often leads to deteriorated predictions. The scope of this article is to show that sparse regression methods, applied in the wavelet domain, perform an automatic thresholding: the most relevant wavelet coefficients are selected to optimize the prediction of a given target of interest. This approach can be seen as a joint thresholding designed for a predictive purpose. The method is illustrated on a real world problem where metabolomic data are linked to poison ingestion. This example proves the usefulness of wavelet expansion and the good behavior of sparse and regularized methods. A comparison study is performed between the two-steps approach (wavelet thresholding and regression) and the one-step approach (selection of wavelet coefficients with a sparse regression). The comparison includes two types of wavelet bases, various thresholding methods, and various regression methods and is evaluated by calculating prediction performances. Information about the location of the most important features on the spectra was also obtained and used to identify the most relevant metabolites involved in the mice poisoning.
Keywords:Elasticnet  Metabolomics  Sparse regression  Wavelet
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号