A GRAMMAR BASED METHODOLOGY FOR STRUCTURAL MOTIF FINDING IN ncRNA DATABASE SEARCH

Daniel Quest, William Tapprich, Hesham Ali
2007 Computational Systems Bioinformatics  
In recent years, sequence database searching has been conducted through local alignment heuristics, patternmatching, and comparison of short statistically significant patterns. While these approaches have unlocked many clues as to sequence relationships, they are limited in that they do not provide context-sensitive searching capabilities (e.g. considering pseudoknots, protein binding positions, and complementary base pairs). Stochastic grammars (hidden Markov models HMMs and stochastic
more » ... free grammars SCFG) do allow for flexibility in terms of local context, but the context comes at the cost of increased computational complexity. In this paper we introduce a new grammar based method for searching for RNA motifs that exist within a conserved RNA structure. Our method constrains computational complexity by using a chain of topology elements. Through the use of a case study we present the algorithmic approach and benchmark our approach against traditional methods.
doi:10.1142/9781860948732_0024 fatcat:hofksr4bavcqfjagm3xeu2qkiy