Dataset Information

How Good Are Statistical Models at Approximating Complex Fitness Landscapes?

ABSTRACT: Fitness landscapes determine the course of adaptation by constraining and shaping evolutionary trajectories. Knowledge of the structure of a fitness landscape can thus predict evolutionary outcomes. Empirical fitness landscapes, however, have so far only offered limited insight into real-world questions, as the high dimensionality of sequence spaces makes it impossible to exhaustively measure the fitness of all variants of biologically meaningful sequences. We must therefore revert to statistical descriptions of fitness landscapes that are based on a sparse sample of fitness measurements. It remains unclear, however, how much data are required for such statistical descriptions to be useful. Here, we assess the ability of regression models accounting for single and pairwise mutations to correctly approximate a complex quasi-empirical fitness landscape. We compare approximations based on various sampling regimes of an RNA landscape and find that the sampling regime strongly influences the quality of the regression. On the one hand it is generally impossible to generate sufficient samples to achieve a good approximation of the complete fitness landscape, and on the other hand systematic sampling schemes can only provide a good description of the immediate neighborhood of a sequence of interest. Nevertheless, we obtain a remarkably good and unbiased fit to the local landscape when using sequences from a population that has evolved under strong selection. Thus, current statistical methods can provide a good approximation to the landscape of naturally evolving populations.

SUBMITTER: du Plessis L

PROVIDER: S-EPMC4989103 | biostudies-literature | 2016 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

How Good Are Statistical Models at Approximating Complex Fitness Landscapes?

du Plessis Louis L Leventhal Gabriel E GE Bonhoeffer Sebastian S

Molecular biology and evolution 20160514 9

Fitness landscapes determine the course of adaptation by constraining and shaping evolutionary trajectories. Knowledge of the structure of a fitness landscape can thus predict evolutionary outcomes. Empirical fitness landscapes, however, have so far only offered limited insight into real-world questions, as the high dimensionality of sequence spaces makes it impossible to exhaustively measure the fitness of all variants of biologically meaningful sequences. We must therefore revert to statistica ...[more]

PMID: 27189564

Similar Datasets

Project description:Computational models of the musculoskeletal system are scientific tools used to study human movement, quantify the effects of injury and disease, plan surgical interventions, or control realistic high-dimensional articulated prosthetic limbs. If the models are sufficiently accurate, they may embed complex relationships within the sensorimotor system. These potential benefits are limited by the challenge of implementing fast and accurate musculoskeletal computations. A typical hand muscle spans over 3 degrees of freedom (DOF), wrapping over complex geometrical constraints that change its moment arms and lead to complex posture-dependent variation in torque generation. Here, we report a method to accurately and efficiently calculate musculotendon length and moment arms across all physiological postures of the forearm muscles that actuate the hand and wrist. Then, we use this model to test the hypothesis that the functional similarities of muscle actions are embedded in muscle structure. The posture dependent muscle geometry, moment arms and lengths of modeled muscles were captured using autogenerating polynomials that expanded their optimal selection of terms using information measurements. The iterative process approximated 33 musculotendon actuators, each spanning up to 6 DOFs in an 18 DOF model of the human arm and hand, defined over the full physiological range of motion. Using these polynomials, the entire forearm anatomy could be computed in <10 μs, which is far better than what is required for real-time performance, and with low errors in moment arms (below 5%) and lengths (below 0.4%). Moreover, we demonstrate that the number of elements in these autogenerating polynomials does not increase exponentially with increasing muscle complexity; complexity increases linearly instead. Dimensionality reduction using the polynomial terms alone resulted in clusters comprised of muscles with similar functions, indicating the high accuracy of approximating models. We propose that this novel method of describing musculoskeletal biomechanics might further improve the applications of detailed and scalable models to describe human movement.

Dataset Information

How Good Are Statistical Models at Approximating Complex Fitness Landscapes?

Publications

How Good Are Statistical Models at Approximating Complex Fitness Landscapes?

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets