113 Hits in 0.82 sec

Pervasive selection biases inferences of the species tree [article]

Rui Borges, Bastien Boussau, Gergely Szollosi, Carolin Kosiol
2020 bioRxiv   pre-print
Despite the importance of natural selection in species' evolutionary history, phylogenetic methods that take into account population-level processes ignore selection. Assuming neutrality is often based on the idea that selection occurs at a minority of loci in the genome and is unlikely to significantly compromise phylogenetic inferences. However, selection might behave more pervasively, as it the case of nearly neutral evolving mutations. Genome-wide processes like GC-bias and some of the
more » ... tion segregating at the coding regions are known to evolve in the nearly neutral range. As we are now using genome-wide data to estimate species tree, it is just natural to ask whether weak, but pervasive, selection is likely to blur species tree inferences. Here, we employed a polymorphism-aware phylogenetic model, specially tailored for measuring signatures of nucleotide usage biases, to test the impact of nearly neutrally in the substitution process. Analyses with simulated data indicate that while the inferred relationships among species are not significantly compromised, the genetic distances are systematically underestimated, with the deeper nodes suffering more than the younger ones. Such biases have implications for molecular dating. We found signatures of GC-bias considerably affecting the estimated divergence times (up to 21%) of worldwide fruit fly populations. Our findings call for the need to account for nearly neutral forces (or any other form of pervasive selection) when quantifying divergence or dating species evolution.
doi:10.1101/2020.07.30.228965 fatcat:qua3bxnygvdc5aap77fea33wem

The inference of gene trees with species trees [article]

Gergely J. Szöllosi, Eric Tannier, Vincent Daubin, Bastien Boussau
2013 arXiv   pre-print
On one hand, Boussau et al. programming, but without considering dated gene trees. This allowed them to avoid using a model of rate evolution.  ...  As such searches can be computationally intensive, SPRs are sometimes bounded [Boussau et al., 20 2013] , or rearrangements are directed [Szöllősi et al., 2012] .  ... 
arXiv:1311.0651v1 fatcat:nwhpj4yuojcwdinu5jcxjhsiie

Accurate detection of convergent amino-acid evolution with PCOC [article]

Carine Rey, Laurent Gueguen, Marie Semon, Bastien Boussau
2018 bioRxiv   pre-print
We used bppml (Dutheil and Boussau, 2008) to measure the likelihood of each model.  ...  associated to each branch, with rate heterogeneity across sites according to a Gamma distribution discretized in 4 classes (Yang, 1994) with the α parameter set to 1.0, using bppseqgen (Dutheil and Boussau  ... 
doi:10.1101/247296 fatcat:xl4szq4kmralzkczbbx4bn63ma

Relative time constraints improve molecular dating [article]

Gergely J Szollosi, Sebastian Hoehna, Tom A Williams, Dominik Schrempf, Vincent Daubin, Bastien Boussau
2020 biorxiv/medrxiv   pre-print
Scripts are available at  ...  The script is available at  ...  Data availability Scripts and data used to run the simulation analyses are available at Data for the empirical data analysis has been deposited at: https  ... 
doi:10.1101/2020.10.17.343889 fatcat:xlsebfnnqnhxtff6x6pg4lgapm

Probabilistic Graphical Model Representation in Phylogenetics [article]

Sebastian Höhna, Tracy A. Heath, Bastien Boussau, Michael J. Landis, Fredrik Ronquist, John P. Huelsenbeck
2013 arXiv   pre-print
using site-specific rate-multipliers , branch-specific rate-multipliers ), branch-specific substitution rate matrices (Yang and Roberts 1995; Galtier and Gouy 1998) and site-specific tree topologies (Boussau  ...  Hence, belief propagation can be used in various other examples such as a mixture over the rates of positive selection (Yang and Nielsen 2002; Huelsenbeck and Dyer 2004) , mixture over tree topologies (Boussau  ... 
arXiv:1312.2386v1 fatcat:7retp2em7fg33b37zlruxrb4ua

Bayesian investigation of SARS-CoV-2-related mortality in France [article]

Louis Duchemin, Philippe Veber, Bastien Boussau
2020 biorxiv/medrxiv   pre-print
fr/boussau/corona_french_regions ; an archive of the code state at the time of publication m is the Peer Community In Mathematical and Computational Biology 4 of 28 .  ... 
doi:10.1101/2020.06.09.20126862 fatcat:c6fko4v4wbellbukig3hspfvvq

Drift and Genome Complexity Revisited

Kenneth D. Whitney, Bastien Boussau, Eric J. Baack, Theodore Garland, Nancy A. Moran
2011 PLoS Genetics  
doi:10.1371/journal.pgen.1002092 pmid:21695239 pmcid:PMC3111538 fatcat:ekm4dijiuzcnhn2ftdnwpe7pue

Relative time constraints improve molecular dating

Gergely J Szöllősi, Sebastian Höhna, Tom A Williams, Dominik Schrempf, Vincent Daubin, Bastien Boussau
2021 Systematic Biology  
Scripts are available at 12  ...  with 102 taxa, calibrated nodes and node order constraints. ).Python code using the ete3 library (Huerta-Cepas, Serra, and Bork 2016) and RevBayes code to simulate sequences and  ... 
doi:10.1093/sysbio/syab084 pmid:34668564 fatcat:vaqukambwrcavlntg3mpg6b6si

Detecting convergent adaptive amino acid evolution: Supplementary material [article]

Carine Rey, Vincent Lanore, Philippe Veber, Laurent Gueguen, Nicolas Lartillot, Marie Semon, Bastien Boussau
2019 bioRxiv   pre-print
In evolutionary genomics, researchers have taken an interest in identifying in the genomes substitutions that subtend convergent phenotypic adaptations. This is a difficult question to address, because genomes contain billions of sites, many of which have substituted in the lineages where the adaptations took place, and yet are not linked to them. Those extra substitutions may be linked to other adaptations, may be neutral, or may be linked to mutational biases. Furthermore, one can think of
more » ... ious ways of defining substitutions of interest, and various methods that match those definitions have been used, resulting in different sets of candidate substitutions. In this manuscript we first clarify how adaptation to convergent phenotypic evolution can manifest itself in coding sequences. Second, we review methods that have been proposed to detect convergent adaptive evolution in coding sequences and expose the assumptions that underlie them. Finally, we examine their power on simulations of convergent changes, including in the presence of a confounding factor.
doi:10.1101/513010 fatcat:4j7fusbytzbzpi4kdfnayi6p6u

Efficient Exploration of the Space of Reconciled Gene Trees [article]

Gergely J. Szöllősi and Wojciech Rosikiewicz and Bastien Boussau and Eric Tannier and Vincent Daubin
2013 arXiv   pre-print
and Daubin, 2010; Boussau et al., 2012) .  ...  Joint inference of species and gene trees offers a path toward surmounting this obstacle (Boussau and Daubin, 2010; Boussau et al., 2012; .  ... 
arXiv:1306.2167v1 fatcat:a6gqpyr2urhjvn6nirevk3k7rm

Bayesian investigation of SARS-CoV-2-related mortality in France

Louis Duchemin, Philippe Veber, Bastien Boussau
2022 Peer Community Journal  
The code used for the experiments is available at french_regions Results We first investigate whether model 1 can capture the major trends of the epidemic in the  ... 
doi:10.24072/pcjournal.84 fatcat:x6ydhlbalnbezgcc5ikvz2yki4

CAARS: comparative assembly and annotation of RNA-Seq data

Carine Rey, Philippe Veber, Bastien Boussau, Marie Sémon, Janet Kelso
2018 Bioinformatics  
Quality of the trees can be improved by using reconstruction methods that use the information provided by the species tree (Boussau et al., 2013; Ullah et al., 2015) .  ...  Then, accurate gene trees are inferred using a phylogenetic pipeline that uses the information coming from both the alignments and the species tree (Boussau et al., 2013) and identifies events of gene  ... 
doi:10.1093/bioinformatics/bty903 pmid:30452539 pmcid:PMC6596894 fatcat:6zgu2kwjpzf7jncddtotwlcwy4

Gene transfers, like fossils, can date the Tree of Life [article]

Adrian A. Davin, Eric Tannier, Tom A. Williams, Bastien Boussau, Vincent Daubin, Gergely J. Szollosi
2017 bioRxiv   pre-print
Biodiversity has always been predominantly microbial and the scarcity of fossils from bacteria, archaea and microbial eukaryotes has prevented a comprehensive dating of the tree of life. Here we show that patterns of lateral gene transfer deduced from the analysis of modern genomes encode a novel and abundant source of information about the temporal coexistence of lineages throughout the history of life. We use new phylogenetic methods to reconstruct the history of thousands of gene families
more » ... demonstrate that dates implied by gene transfers are consistent with estimates from relaxed molecular clocks in Bacteria, Archaea and Eukaryotes. An inspection of discrepancies between transfers and clocks and a comparison with mammal fossils show that gene transfer in microbes is potentially as informative for dating the tree of life as the geological record in macroorganisms.
doi:10.1101/193813 fatcat:tpofgtu6erf5ppmit7qw4urche

The Inference of Gene Trees with Species Trees

Gergely J. Szöllősi, Eric Tannier, Vincent Daubin, Bastien Boussau
2014 Systematic Biology  
Figure reprinted with permission from Boussau et al. (2013) .  ...  As such searches can be computationally intensive, SPRs are sometimes bounded (Boussau et al. 2013) , or rearrangements are directed .  ... 
doi:10.1093/sysbio/syu048 pmid:25070970 pmcid:PMC4265139 fatcat:lsfkomvpfzdkff6gec46y34d6a

Parallel adaptations to high temperatures in the Archaean eon

Bastien Boussau, Samuel Blanquart, Anamaria Necsulea, Nicolas Lartillot, Manolo Gouy
2008 Nature  
Because fossils from the time when cellular life originated and diversified are scant and difficult to interpret 1 , alternative means to investigate the ecology of the Last Universal Common Ancestor (LUCA) and of the ancestors of the three domains of life are of great scientific value. It was recently recognized that footprints of the effect of temperature on ancestral organisms could be uncovered in extant genomes 2-4 . Accordingly, analyses of resurrected proteins predicted that the
more » ... ancestor was thermophilic and that Bacteria subsequently adapted to lower temperatures 3,4 . Since the archaeal ancestor is also thought to have been thermophilic 5 , LUCA was parsimoniously inferred as thermophilic too. However, an analysis of ribosomal RNAs supported the hypothesis of a non-hyperthermophilic LUCA 2 . Here we show that both rRNA and protein sequences analysed with advanced, realistic models of molecular evolution 6,7 provide independent support for two phases in the history of environmental temperature changes over the tree of life: in the first period, thermotolerance increased from a mesophilic LUCA to thermophilic ancestors of Bacteria and of Archaea-Eukaryota; in the second period, it decreased. Therefore, the two lineages descending from LUCA and leading to the ancestors of Bacteria and Archaea-Eukaryota convergently adapted to high temperatures, maybe in response to a climate change of the early Earth 1,8,9 , and/or aided by the transition from an RNA genome in LUCA to organisms with more thermostable DNA genomes 10,11 . This analysis unifies apparently contradictory results 2-4 into a coherent depiction of the evolution of an ecological trait over the entire tree of life. Was LUCA a hyperthermophilic, i.e., with optimal growth temperature (OGT) ≥ 80 °C, thermophilic (80 °C ≥ OGT ≥ 50 °C), or mesophilic (OGT ≤ 50 °C) organism? Several investigations of this question have relied on correlations between the species OGT and the composition of their macromolecular sequences. In extant prokaryotic species, the G+C content of ribosomal RNA (rRNA) stems (i.e., double-stranded parts) has been shown to correlate with OGT 12 . Exploiting this correlation, Galtier et al. obtained support for a non-hyperthermophilic LUCA 2 . In contrast, studies based on correlations between the composition of LUCA's proteins and OGT concluded in favour of a hyperthermophilic LUCA 13,14 and of hyperthermophilic ancestors for both Archaea and Bacteria. The discrepancy between these results could come from some unexplained incongruence between rRNA and proteins, or, as we shall see, from differences between employed evolutionary models. All three investigations 2,13,14 based their conclusions on comparisons of reconstructed ancestral sequence compositions with extant ones. Accurate modelling of the evolution of compositions is therefore crucial for such approaches. On one side, two of these studies 13,14 relied on homogeneous models of evolution which make the simplifying hypothesis that substitutions occur with constant probabilities along time and across all lineages. Had genomes and proteins evolved according to a homogeneous model, they would all share the same base and amino acid compositions. Clearly, rRNA 12 and protein sequences 15 do not. On the other side, Galtier et al. 2 relied on a branch-heterogeneous model of RNA sequence evolution. Branch-heterogeneous models are computationally more challenging but more realistic as they allow replacement or substitution probabilities to vary between lineages, and thus explicitly account
doi:10.1038/nature07393 pmid:19037246 fatcat:f3pzhvme5fewrbkc6gn6cn53fa
« Previous Showing results 1 — 15 out of 113 results