Dataset Information

A novel approach for choosing summary statistics in approximate Bayesian computation.

ABSTRACT: The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression as an alternative. To mitigate the lack of sufficiency, we also propose an approach for choosing summary statistics locally, in the putative neighborhood of the true parameter value. We study a demographic model motivated by the reintroduction of Alpine ibex (Capra ibex) into the Swiss Alps. The parameters of interest are the mean and standard deviation across microsatellites of the scaled ancestral mutation rate (θ(anc) = 4N(e)u) and the proportion of males obtaining access to matings per breeding season (ω). By simulation, we assess the properties of the posterior distribution obtained with the various methods. According to our criteria, ABC with summary statistics chosen locally via boosting with the L(2)-loss performs best. Applying that method to the ibex data, we estimate θ(anc)≈ 1.288 and find that most of the variation across loci of the ancestral mutation rate u is between 7.7 × 10(-4) and 3.5 × 10(-3) per locus per generation. The proportion of males with access to matings is estimated as ω≈ 0.21, which is in good agreement with recent independent estimates.

SUBMITTER: Aeschbacher S

PROVIDER: S-EPMC3522150 | biostudies-literature | 2012 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A novel approach for choosing summary statistics in approximate Bayesian computation.

Aeschbacher Simon S Beaumont Mark A MA Futschik Andreas A

Genetics 20120907 3

The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression ...[more]

PMID: 22960215

Dataset Information

A novel approach for choosing summary statistics in approximate Bayesian computation.

Publications

A novel approach for choosing summary statistics in approximate Bayesian computation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Informative and adaptive distances and summary statistics in sequential approximate Bayesian computation.
| S-EPMC10202307 | biostudies-literature

An automatic adaptive method to combine summary statistics in approximate Bayesian computation.
| S-EPMC7410215 | biostudies-literature

Personalized pathology test for Cardio-vascular disease: Approximate Bayesian computation with discriminative summary statistics learning.
| S-EPMC8939803 | biostudies-literature

An Approximate Bayesian Computation Approach for Modeling Genome Rearrangements.
| S-EPMC9692237 | biostudies-literature

Saa2016 - Mammalian methionine cycle - approximate bayesian computation
| MODEL1603150000 | biostudies-other

Cophylogeny reconstruction via an approximate Bayesian computation.
| S-EPMC4395844 | biostudies-literature

Quantum approximate Bayesian computation for NMR model inference.
| S-EPMC7643990 | biostudies-literature

Demographic inference through approximate-Bayesian-computation skyline plots.
| S-EPMC5518730 | biostudies-literature

Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation.
| S-EPMC2639274 | biostudies-literature

Lack of confidence in approximate Bayesian computation model choice.
| S-EPMC3174657 | biostudies-literature