A novel index which precisely derives protein coding regions from cross-species genome alignments
release_fko4iyvm75dwtchxwp5x6d33uy
by
Hideki Noguchi,
Tetsushi Yada,
Yoshiyuki Sakaki
2002 Volume 13, p183-91
Abstract
We introduce here a novel index which precisely derives protein coding regions from cross-species genome alignments. The index is deeply related to frame recovery observed in coding sequence alignments, that is, if insertions or deletions of nucleotides causes frame shifts in coding regions, other in-dels which recover the reading frames will be often observed in the vicinity. In contrast, such frame recoveries are not observed in other conserved regions. We prepared two gene models: a model which finds gene by using sequence similarity and intrinsic gene measures (basic model), and the other model which finds gene by using frame recovery index in addition to sequence similarity and intrinsic gene measures (frame recovery model). We evaluated the prediction accuracies of the two models, and our benchmark test revealed that frame recovery model significantly improved the prediction accuracy in comparison with basic model.
In text/plain
format
Archived Files and Locations
application/pdf
319.6 kB
file_lc3pzo6x3bfgpnaiernglhslri
|
www.jsbi.org (web) web.archive.org (webarchive) |
14571387
access all versions, variants, and formats of this works (eg, pre-prints)