A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
De Novo NGS Data Compression
[chapter]
2017
Algorithms for Next-Generation Sequencing Data
High throughput sequencing machines decipher billions of nucleotides from DNA molecules at unprecedented speed. This mass of data is stored into large text files structured as a list of small DNA fragments. They represent random overlap regions over one or several genomes. The overlap fragment generate a lot of redundancy that can be advantageously exploited to compress next generation sequencing (NGS) data. This is the main motivation for developing dedicated compressing techniques for this
doi:10.1007/978-3-319-59826-0_4
fatcat:pjctkfpul5cqvejdr5mtymfjm4