首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A comparison of classification models to identify the Fragile X Syndrome
Authors:Rafael Pino-Mejías  Mercedes Carrasco-Mairena  Antonio Pascual-Acosta  María-Dolores Cubiles-De-La-Vega  Joaquín Muñoz-García
Institution:  a Statistics Department, University of Sevilla, Spain b Andalusian Prospective Center, Spain c Researcher of the FXS Association, Andalusia, Spain
Abstract:The main models of machine learning are briefly reviewed and considered for building a classifier to identify the Fragile X Syndrome (FXS). We have analyzed 172 patients potentially affected by FXS in Andalusia (Spain) and, by means of a DNA test, each member of the data set is known to belong to one of two classes: affected, not affected. The whole predictor set, formed by 40 variables, and a reduced set with only nine predictors significantly associated with the response are considered. Four alternative base classification models have been investigated: logistic regression, classification trees, multilayer perceptron and support vector machines. For both predictor sets, the best accuracy, considering both the mean and the standard deviation of the test error rate, is achieved by the support vector machines, confirming the increasing importance of this learning algorithm. Three ensemble methods - bagging, random forests and boosting - were also considered, amongst which the bagged versions of support vector machines stand out, especially when they are constructed with the reduced set of predictor variables. The analysis of the sensitivity, the specificity and the area under the ROC curve agrees with the main conclusions extracted from the accuracy results. All of these models can be fitted by free R programs.
Keywords:fragile X syndrome  support vector machines  multilayer perceptron  classification trees  logistic regression  ensemble methods  R system
本文献已被 InformaWorld 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号