SIA: Selection Inference Using the Ancestral Recombination Graph
release_2c7g4hkkjnas5ae4yjrs76nzzy
by
Hussein A Hejase,
Ziyi Mo,
Leonardo Campagna,
Adam Siepel
2021
Abstract
Detecting signals of selection from genomic data is a central problem in population genetics. Coupling the rich information in the ancestral recombination graph (ARG) with a powerful and scalable deep learning framework, we developed a novel method to detect and quantify positive selection: Selection Inference using the Ancestral recombination graph (SIA). Built on a Long Short-Term Memory (LSTM) architecture, a particular type of a Recurrent Neural Network (RNN), SIA can be trained to explicitly infer a full range of selection coefficients, as well as the allele frequency trajectory and time of selection onset. We benchmarked SIA extensively on simulations under a European human demographic model, and found that it performs as well or better as some of the best available methods, including state-of-the-art machine-learning and ARG-based methods. In addition, we used SIA to estimate selection coefficients at several loci associated with human phenotypes of interest. SIA detected novel signals of selection particular to the European (CEU) population at the <jats:italic>MC1R</jats:italic> and <jats:italic>ABCC11</jats:italic> loci. In addition, it recapitulated signals of selection at the <jats:italic>LCT</jats:italic> locus and several pigmentation-related genes. Finally, we reanalyzed polymorphism data of a collection of recently radiated southern capuchino seedeater taxa in the genus <jats:italic>Sporophila</jats:italic> to quantify the strength of selection and improved the power of our previous methods to detect partial soft sweeps. Overall, SIA uses deep learning to leverage the ARG and thereby provides new insight into how selective sweeps shape genomic diversity.
In application/xml+jats
format
Archived Files and Locations
application/pdf
1.3 MB
file_d7i72lwzdbfdnowhz3kz5ru65a
|
www.biorxiv.org (repository) web.archive.org (webarchive) |
application/pdf
1.3 MB
file_dljrumrqonfovnbr2tb357jqsq
|
www.biorxiv.org (repository) web.archive.org (webarchive) |
post
Stage
unknown
Date 2021-06-23
access all versions, variants, and formats of this works (eg, pre-prints)
Crossref Metadata (via API)
Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar