Good spaced seeds for homology search

Kwok Pui Choi, Fanfan Zeng, Louxin Zhang
Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering  
Motivation: Filtration is an important technique used to speed up local alignment as exemplified in the BLAST programs. Recently, Ma, Tromp and Li (2002) discovered that better filtering can be achieved by spacing out the matching positions according to a certain pattern, instead of contiguous positions to trigger a local alignment in their PatternHunter program. Such a match pattern is called a spaced seed. Results: Our numerical computation shows that the ranks of spaced seeds (based on
more » ... ivity) change with the sequences similarity. Since homologous sequences may have diverse similarity, we assess the sensitivity of spaced seeds over a range of similarity levels and present a list of good spaced seeds for facilitating homology search in DNA genomic sequences. We validate that the listed spaced seeds are indeed more sensitive using three arbitrarily chosen pairs of DNA genomic sequences.
doi:10.1109/bibe.2004.1317368 dblp:conf/bibe/ChoiZZ04 fatcat:achdiwprbngddjwdleueq74odq