A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2018; you can also visit the original URL.
The file type is application/pdf
.
Performance Analysis of a Parallel, Multi-node Pipeline for DNA Sequencing
[chapter]
2016
Lecture Notes in Computer Science
Post-sequencing DNA analysis typically consists of read mapping followed by variant calling and is very time-consuming, even on a multi-core machine. Recently, we proposed Halvade, a parallel, multi-node implementation of a DNA sequencing pipeline according to the GATK Best Practices recommendations. The MapReduce programming model is used to distribute the workload among different workers. In this paper, we study the impact of different hardware configurations on the performance of Halvade.
doi:10.1007/978-3-319-32152-3_22
fatcat:sq2tl5dlwrg35mcxy5fjzcir24