Advancing the State of the Art in Computational Gene Prediction [chapter]

William H. Majoros, Uwe Ohler
Knowledge Discovery and Emergent Complexity in Bioinformatics  
Current methods for computationally predicting the locations and intron-exon structures of protein-coding genes in eukaryotic DNA are largely based on probabilistic, state-based generative models such as hidden Markov models and their various extensions. Unfortunately, little attention has been paid to the optimality of these models for the gene-parsing problem. Furthermore, as the prevalence of alternative splicing in human genes becomes more apparent, the "one gene, one parse" discipline
more » ... sed by virtually all current gene-finding systems becomes less attractive from a biomedical perspective. Because our ability to accurately identify all the isoforms of each gene in the genome is of direct importance to biomedicine, our ability to improve gene-finding accuracy both for human and non-human DNA clearly has a potential to significantly impact human health. In this paper we review current methods and suggest a number of possible directions for further research that may alleviate some of these problems and ultimately lead to better and more useful gene predictions.
doi:10.1007/978-3-540-71037-0_6 dblp:conf/kdecb/MajorosO06 fatcat:tud6gtajzjbddlbebm5llf2npm