A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Computational design of genes encoding completely overlapping protein domains: Influence of genetic code and taxonomic rank
[article]
2020
bioRxiv
pre-print
Overlapping genes (OLGs) with long protein-coding overlapping sequences are often excluded by genome annotation programs, with the exception of virus genomes. A recent study used a novel algorithm to construct OLGs from arbitrary protein domain pairs and concluded that virus genes are best suited for creating OLGs, a result which fitted with common assumptions. However, improving sequence evaluation using Hidden Markov Models shows that the previous result is an artifact originating from
doi:10.1101/2020.09.25.312959
fatcat:yji6vno2sffutjtuwt2z6qrtju