首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Facilitating the Calculation of the Efficient Score Using Symbolic Computing
Authors:Alexander B Sibley  Zhiguo Li  Yu Jiang  Yi-Ju Li  Cliburn Chan  Andrew Allen
Institution:1. Duke Cancer Institute, Duke University Medical Center, Durham, NC;2. Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC
Abstract:The score statistic continues to be a fundamental tool for statistical inference. In the analysis of data from high-throughput genomic assays, inference on the basis of the score usually enjoys greater stability, considerably higher computational efficiency, and lends itself more readily to the use of resampling methods than the asymptotically equivalent Wald or likelihood ratio tests. The score function often depends on a set of unknown nuisance parameters which have to be replaced by estimators, but can be improved by calculating the efficient score, which accounts for the variability induced by estimating these parameters. Manual derivation of the efficient score is tedious and error-prone, so we illustrate using computer algebra to facilitate this derivation. We demonstrate this process within the context of a standard example from genetic association analyses, though the techniques shown here could be applied to any derivation, and have a place in the toolbox of any modern statistician. We further show how the resulting symbolic expressions can be readily ported to compiled languages, to develop fast numerical algorithms for high-throughput genomic analysis. We conclude by considering extensions of this approach. The code featured in this report is available online as part of the supplementary material.
Keywords:Computer algebra  Genome-wide association study  Mathematical statistics  Nuisance parameters  Python  Trio data
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号