Rapid Homology Search with Two-Stage Extension and Daughter Seeds [chapter]

Miklós Csűrös, Bin Ma
2005 Lecture Notes in Computer Science  
Using a seed to rapidly "hit" possible homologies for further examination is a common practice to speed up homology search in molecular sequences. It has been shown that a collection of higher weight seeds have better sensitivity than a single lower weight seed at the same speed. However, huge memory requirements diminish the advantages of high weight seeds. This paper describes a twostage extension method, which simulates high weight seeds with modest memory requirements. The paper also
more » ... s the use of so-called daughter seeds, which is an extension of the previously studied vector seed idea. Daughter seeds, especially when combined with the two-stage extension, provide the flexibility to maximize the independence between the seeds, which is a well-known criterion for maximizing sensitivity. Some other practical techniques to reduce memory usage are also discussed in the paper.
doi:10.1007/11533719_13 fatcat:szlgvb5kpzgopktmxuw6qmk27e