A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Iterative error correction of long sequencing reads maximizes accuracy and improves contig assembly
2016
Briefings in Bioinformatics
Next-generation sequencers such as Illumina can now produce reads up to 300 bp with high throughput, which is attractive for genome assembly. A first step in genome assembly is to computationally correct sequencing errors. However, correcting all errors in these longer reads is challenging. Here, we show that reads with remaining errors after correction often overlap repeats, where short erroneous k-mers occur in other copies of the repeat. We developed an iterative error correction pipeline
doi:10.1093/bib/bbw003
pmid:26868358
pmcid:PMC5221426
fatcat:dbzyh5iodzdjtn23vzavlc6iue