Segmenting the Human Genome into Isochores

Paolo Cozzi, Luciano Milanesi, Giorgio Bernardi
2015 Evolutionary Bioinformatics  
The human genome is a mosaic of isochores, which are long (>200 kb) DNA sequences that are fairly homogeneous in base composition and can be assigned to five families comprising 33%-59% of GC composition. Although the compartmentalized organization of the mammalian genome has been investigated for more than 40 years, no satisfactory automatic procedure for segmenting the genome into isochores is available so far. We present a critical discussion of the currently available methods and a new
more » ... ach called isoSegmenter which allows segmenting the genome into isochores in a fast and completely automatic manner. This approach relies on two types of experimentally defined parameters, the compositional boundaries of isochore families and an optimal window size of 100 kb. The approach represents an improvement over the existing methods, is ideally suited for investigating long-range features of sequenced and assembled genomes, and is publicly available at https://github.com/bunop/isoSegmenter.
doi:10.4137/ebo.s27693 pmid:26640363 pmcid:PMC4662427 fatcat:z55l477z4jdgdmo6lozqlfvqka