Statistical models for DNA copy number variation detection using read‐depth data from next generation sequencing experiments |
| |
Authors: | Tieming Ji Jie Chen |
| |
Affiliation: | 1. Department of Statistics, University of Missouri at Columbia, Columbia, MI, USA;2. Department of Biostatistics and Epidemiology, Medical College of Georgia, Augusta University, Augusta, GA, USA |
| |
Abstract: | In this ‘Big Data’ era, statisticians inevitably encounter data generated from various disciplines. In particular, advances in bio‐technology have enabled scientists to produce enormous datasets in various biological experiments. In the last two decades, we have seen high‐throughput microarray data resulting from various genomic studies. Recently, next generation sequencing (NGS) technology has been playing an important role in the study of genomic features, resulting in vast amount of NGS data. One frequent application of NGS technology is in the study of DNA copy number variants (CNVs). The resulting NGS read count data are then used by researchers to formulate their various scientific approaches to accurately detect CNVs. Computational and statistical approaches to the detection of CNVs using NGS data are, however, very limited at present. In this review paper, we will focus on read‐depth analysis in CNV detection and give a brief summary of currently used statistical analysis methods in searching for CNVs using NGS data. In addition, based on the review, we discuss the challenges we face and future research directions. The ultimate goal of this review paper is to give a timely exposition of the surveyed statistical methods to researchers in related fields. |
| |
Keywords: | CNVs next‐generation sequencing reads change point model read‐depth analysis |
|
|