A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
FASTAFS: file system virtualisation of random access compressed FASTA files
[article]
2020
bioRxiv
pre-print
The FASTA file format used to store polymeric sequence data has become a bioinformatics file standard used for decades. The relatively large files require additional files beyond the scope of the original format, to identify sequences and provide random access. Currently, multiple compressors have been developed to archive FASTA files back and forth, but these lack direct access to targeted content or metadata of the archive. Moreover, these solutions are not directly backwards compatible to
doi:10.1101/2020.11.11.377689
fatcat:4es44ocbanhhnko3nt4ll4o4ya