GENOME ANALYSIS OF GENBANK KNOWN RABBIT (Oryctolagus cuniculus) GENES
World Rabbit Science
After downloading all known rabbit genes, key words specific to complete coding sequences were used to filter out partial coding sequences. As a result, 160 full-length, nuclear, protein-coding and functionally annotated genes were extracted from GenBank database. These genes were subjected to synonymous codon and amino acid usage analysis. The results showed a clear base composition bias in the genes analyzed. The effective number of codons used (Nc values) ranged between 30.07 and 59.98 with
... .07 and 59.98 with a mean of 51.31 with a standard deviation (SD) of 5.71. The frequency of G + C at the synonymous third position of codons (GC3s) varied between 0.3 and 0.96 with a mean of 0.55 and a SD of 0.14, clearly indicating marked variation and heterogeneity in codon usage patterns among the different genes. The distribution of relative synonymous codon usage (RSCU) values calculated for all the genes indicated that codons with a G or C in the third position are widely employed, and although RCSU values for A/T or G/C ending codons are almost equal, G/C appears to play a dominant role. This pattern of dominance was confirmed by the distribution of amino acid occurrence. Leucine (CUN) and serine (UCN) were the most frequent amino acids, followed by arginine (CGN), proline (CCN), glycine (GGN) and alanine (GCN), in relatively equal proportions. The heterogeneity observed in the analyzed genes was then further probed by multivariate statistical analyses. A major trend in codon usage that correlated with gene expression values was revealed. These findings suggest that translational efficiency exerts a stronger influence on codon usage preference than compositional bias in the sampled rabbit genes.