A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2005; you can also visit the original URL.
The file type is application/pdf
.
Compressed pattern matching in DNA sequences
Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004.
We propose derivative Boyer-Moore (d-BM), a new compressed pattern matching algorithm in DNA sequences. This algorithm is based on the Boyer-Moore method, which is one of the most popular string matching algorithms. In this approach, we compress both DNA sequences and patterns by using two bits to represent each A, T, C, G character. Experiments indicate that this compressed pattern matching algorithm searches long DNA patterns (length > 50) more than 10 times faster than the exact match
doi:10.1109/csb.2004.1332418
dblp:conf/csb/ChenLR04
fatcat:747uhput2zfnvaiog3hapfxu7a