Higher Compression from Burrows-Wheeler Transform for DNA Sequence

Rexline S., Aju Richard, Trujilla Lobo
2017 International Journal of Computer Applications  
Large amount of space is required to store biological sequences in DNA database like GenBank sequence database. The data storage for biological sequences has become very essential in today's current situation. Standard compression algorithms are not competent enough to compress biological sequences. In recent times, special algorithms have been introduced specifically for the purpose of compressing the biological sequences like DNA and protein sequences. In this paper, the Burrows-Wheeler
more » ... orm (BWT) based approaches are explored to compress the biological sequences. In comparison with the existing general purpose compression algorithms, the proposed BWT based method compresses these types of sequences better and at the same time the cost of Burrows-Wheeler Transform is almost insignificant. General Terms Algorithms Keywords DNA sequence compression, Burrows-Wheeler Transform, BWT and genome.
doi:10.5120/ijca2017915261 fatcat:n7bl3a5e2ffyjkoedhv2ch36qq