Overcoming bias and systematic errors in next generation sequencing data

Margaret A Taub, Hector Corrada Bravo, Rafael A Irizarry
2010 Genome Medicine  
High-throughput sequencing involves the parallel sequen cing of millions of DNA fragments simultaneously. Abstract Considerable time and e ort has been spent in developing analysis and quality assessment methods to allow the use of microarrays in a clinical setting. As is the case for microarrays and other high-throughput technologies, data from new high-throughput sequencing technologies are subject to technological and biological biases and systematic errors that can impact downstream
more » ... . Only when these issues can be readily identi ed and reliably adjusted for will clinical applications of these new technologies be feasible. Although much work remains to be done in this area, we describe consistently observed biases that should be taken into account when analyzing highthroughput sequencing data. In this article, we review current knowledge about these biases, discuss their impact on analysis results, and propose solutions.
doi:10.1186/gm208 pmid:21144010 pmcid:PMC3025429 fatcat:unvc2hplnnes7ej2hv3aao55bq