Referee report. For: QUARTIC: QUick pArallel algoRithms for high-Throughput sequencIng data proCessing [version 3; peer review: 2 approved]

Ramon Amela Milian, Salvador Capella-Gutierrez
2020
Life science has entered the so-called 'big data era' where biologists, clinicians and bioinformaticians are overwhelmed with highthroughput sequencing data. While they offer new insights to decipher the genome structure they also raise major challenges to use them for daily clinical practice care and diagnosis purposes as they are bigger and bigger. Therefore, we implemented a software to reduce the time to delivery for the alignment and the sorting of highthroughput sequencing data. Our
more » ... ing data. Our solution is implemented using Message Passing Interface and is intended for high-performance computing architecture. The software scales linearly with respect to the size of the data and ensures a total reproducibility with the traditional tools. For example, a 300X whole genome can be aligned and sorted within less than 9 hours with 128 cores. The software offers significant speed-up using multi-cores and multi-nodes parallelization.
doi:10.5256/f1000research.30033.r72713 fatcat:wg3oosk36fhprhlfaga55bmkgq