Unknown

Dataset Information

0

Predicting the Landscape of Recombination Using Deep Learning.


ABSTRACT: Accurately inferring the genome-wide landscape of recombination rates in natural populations is a central aim in genomics, as patterns of linkage influence everything from genetic mapping to understanding evolutionary history. Here, we describe recombination landscape estimation using recurrent neural networks (ReLERNN), a deep learning method for estimating a genome-wide recombination map that is accurate even with small numbers of pooled or individually sequenced genomes. Rather than use summaries of linkage disequilibrium as its input, ReLERNN takes columns from a genotype alignment, which are then modeled as a sequence across the genome using a recurrent neural network. We demonstrate that ReLERNN improves accuracy and reduces bias relative to existing methods and maintains high accuracy in the face of demographic model misspecification, missing genotype calls, and genome inaccessibility. We apply ReLERNN to natural populations of African Drosophila melanogaster and show that genome-wide recombination landscapes, although largely correlated among populations, exhibit important population-specific differences. Lastly, we connect the inferred patterns of recombination with the frequencies of major inversions segregating in natural Drosophila populations.

SUBMITTER: Adrion JR 

PROVIDER: S-EPMC7253213 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting the Landscape of Recombination Using Deep Learning.

Adrion Jeffrey R JR   Galloway Jared G JG   Kern Andrew D AD  

Molecular biology and evolution 20200601 6


Accurately inferring the genome-wide landscape of recombination rates in natural populations is a central aim in genomics, as patterns of linkage influence everything from genetic mapping to understanding evolutionary history. Here, we describe recombination landscape estimation using recurrent neural networks (ReLERNN), a deep learning method for estimating a genome-wide recombination map that is accurate even with small numbers of pooled or individually sequenced genomes. Rather than use summa  ...[more]

Similar Datasets

| S-EPMC8002881 | biostudies-literature
| S-EPMC6121625 | biostudies-literature
| S-EPMC8016297 | biostudies-literature
| S-EPMC8071129 | biostudies-literature
2020-07-08 | GSE137436 | GEO
| S-EPMC6022534 | biostudies-literature
| S-EPMC8032721 | biostudies-literature
| S-EPMC8119673 | biostudies-literature
2020-07-08 | GSE137435 | GEO
| S-EPMC7372265 | biostudies-literature