Automatic Gene Recognition without Using Training Data

Kiyoshi Asai, Yutaka Ueno, Katunobu Itou, Tetsushi Yada
1997 Genome Informatics Series  
In this paper, we propose a new approach for gene recognition, which uses no training data for the recognizer. In this approach, we start from a simple model, which only uses the knowledge of start codons and the stop codons, then the recognition of the DNA sequences by the recognizer and the training of the parameters of the recognizer by the result of the recognition are repeated. We applied this parse and train approach to the complete genome sequence of cyanobacterium, and achieved the
more » ... t same recognition rate with the case of using the whole sequence as training data. This results open the possibility to use automatic gene annotation system inthe early stage of sequencing projects.
doi:10.11234/gi1990.8.15 fatcat:lcuauczyejf4jdt4uejk5olvzu