Unknown

Dataset Information

0

A Probabilistic Model for Indel Evolution: Differentiating Insertions from Deletions.


ABSTRACT: Insertions and deletions (indels) are common molecular evolutionary events. However, probabilistic models for indel evolution are under-developed due to their computational complexity. Here, we introduce several improvements to indel modeling: 1) While previous models for indel evolution assumed that the rates and length distributions of insertions and deletions are equal, here we propose a richer model that explicitly distinguishes between the two; 2) we introduce numerous summary statistics that allow approximate Bayesian computation-based parameter estimation; 3) we develop a method to correct for biases introduced by alignment programs, when inferring indel parameters from empirical data sets; and 4) using a model-selection scheme, we test whether the richer model better fits biological data compared with the simpler model. Our analyses suggest that both our inference scheme and the model-selection procedure achieve high accuracy on simulated data. We further demonstrate that our proposed richer model better fits a large number of empirical data sets and that, for the majority of these data sets, the deletion rate is higher than the insertion rate.

SUBMITTER: Loewenthal G 

PROVIDER: S-EPMC8662616 | biostudies-literature | 2021 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Probabilistic Model for Indel Evolution: Differentiating Insertions from Deletions.

Loewenthal Gil G   Rapoport Dana D   Avram Oren O   Moshe Asher A   Wygoda Elya E   Itzkovitch Alon A   Israeli Omer O   Azouri Dana D   Cartwright Reed A RA   Mayrose Itay I   Pupko Tal T  

Molecular biology and evolution 20211201 12


Insertions and deletions (indels) are common molecular evolutionary events. However, probabilistic models for indel evolution are under-developed due to their computational complexity. Here, we introduce several improvements to indel modeling: 1) While previous models for indel evolution assumed that the rates and length distributions of insertions and deletions are equal, here we propose a richer model that explicitly distinguishes between the two; 2) we introduce numerous summary statistics th  ...[more]

Similar Datasets

| S-EPMC2527138 | biostudies-literature
| S-EPMC10275563 | biostudies-literature
| S-EPMC3806772 | biostudies-literature
| S-EPMC9900211 | biostudies-literature
| S-EPMC2459192 | biostudies-literature
| S-EPMC11914627 | biostudies-literature
2023-09-28 | GSE244096 | GEO
| S-EPMC9248902 | biostudies-literature
| S-EPMC2940567 | biostudies-literature
| PRJEB36040 | ENA