Facilitating the Calculation of the Efficient Score Using Symbolic Computing期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Facilitating the Calculation of the Efficient Score Using Symbolic Computing

Authors:	Alexander B Sibley Zhiguo Li Yu Jiang Yi-Ju Li Cliburn Chan Andrew Allen

Institution:	1. Duke Cancer Institute, Duke University Medical Center, Durham, NC;2. Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC

Abstract:	The score statistic continues to be a fundamental tool for statistical inference. In the analysis of data from high-throughput genomic assays, inference on the basis of the score usually enjoys greater stability, considerably higher computational efficiency, and lends itself more readily to the use of resampling methods than the asymptotically equivalent Wald or likelihood ratio tests. The score function often depends on a set of unknown nuisance parameters which have to be replaced by estimators, but can be improved by calculating the efficient score, which accounts for the variability induced by estimating these parameters. Manual derivation of the efficient score is tedious and error-prone, so we illustrate using computer algebra to facilitate this derivation. We demonstrate this process within the context of a standard example from genetic association analyses, though the techniques shown here could be applied to any derivation, and have a place in the toolbox of any modern statistician. We further show how the resulting symbolic expressions can be readily ported to compiled languages, to develop fast numerical algorithms for high-throughput genomic analysis. We conclude by considering extensions of this approach. The code featured in this report is available online as part of the supplementary material.

Keywords:	Computer algebra Genome-wide association study Mathematical statistics Nuisance parameters Python Trio data

设为首页 | 免责声明 | 关于勤云 | 加入收藏