Insights into Bacterial Genome Composition through Variable Target GC Content Profiling

Scott Mann, Jinyan Li, Yi-Ping Phoebe Chen
2010 Journal of Computational Biology  
This study presents a new computational method for guanine (G) and cytosine (C), or GC, content profiling based on the idea of multiple resolution sampling (MRS). The benefit of our new approach over existing techniques follows from its ability to locate significant regions without prior knowledge of the sequence, nor the features being sought. The use of MRS has provided novel insights into bacterial genome composition. Key findings include those that are related to the core composition of
more » ... composition of bacterial genomes, to the identification of large genomic islands (in Enterobacterial genomes), and to the identification of surface protein determinants in human pathogenic organisms (e.g., Staphylococcus genomes). We observed that bacterial surface binding proteins maintain abnormal GC content, potentially pointing to a viral origin. This study has demonstrated that GC content holds a high informational worth and hints at many underlying evolutionary processes. For online Supplementary Material, see www.liebertonline.com.
doi:10.1089/cmb.2009.0058 pmid:20078399 fatcat:j4mezgpdbna6pkde5m3mb6s324