A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
ArrowSAM: In-Memory Genomics Data Processing Using Apache Arrow
2020
2020 3rd International Conference on Computer Applications & Information Security (ICCAIS)
The rapidly growing size of genomics data bases, driven by advances in sequencing technologies, demands fast and cost-effective processing. However, processing this data creates many challenges, particularly in selecting appropriate algorithms and computing platforms. Computing systems need data closer to the processor for fast processing. Traditionally, due to cost, volatility and other physical constraints of DRAM, it was not feasible to place large amounts of working data sets in memory.
doi:10.1109/iccais48893.2020.9096725
fatcat:3hebcoquyzdm3hlbvf5ayyvo7i