Decomposition of DNA Sequence Complexity

Pedro Bernaola-Galván, José L. Oliver, Ramón Román-Roldán
1999 Physical Review Letters  
Profiles of sequence compositional complexity provide a view of the spatial heterogeneity of symbolic sequences at different levels of detail. Sequence compositional complexity profiles are here decomposed into partial profiles using the branching property of the Shannon entropy. This decomposition shows the complexity contributed by each individual symbol or group of symbols. In particular, we apply this method to the mapping rules (symbol groupings) commonly used in DNA sequence analysis. We
more » ... ind that strong-weak bindings are remarkable homogeneously distributed as compared to purine pyrimidine, and that A and T are the most heterogeneous distributed bases.
doi:10.1103/physrevlett.83.3336 fatcat:nn2ikxnl3zbw5djvsrahgw3jbe