Dataset Information

Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed.

ABSTRACT: Multiple algorithms are used to predict the impact of missense mutations on protein structure and function using algorithm-generated sequence alignments or manually curated alignments. We compared the accuracy with native alignment of SIFT, Align-GVGD, PolyPhen-2, and Xvar when generating functionality predictions of well-characterized missense mutations (n = 267) within the BRCA1, MSH2, MLH1, and TP53 genes. We also evaluated the impact of the alignment employed on predictions from these algorithms (except Xvar) when supplied the same four alignments including alignments automatically generated by (1) SIFT, (2) Polyphen-2, (3) Uniprot, and (4) a manually curated alignment tuned for Align-GVGD. Alignments differ in sequence composition and evolutionary depth. Data-based receiver operating characteristic curves employing the native alignment for each algorithm result in area under the curve of 78-79% for all four algorithms. Predictions from the PolyPhen-2 algorithm were least dependent on the alignment employed. In contrast, Align-GVGD predicts all variants neutral when provided alignments with a large number of sequences. Of note, algorithms make different predictions of variants even when provided the same alignment and do not necessarily perform best using their own alignment. Thus, researchers should consider optimizing both the algorithm and sequence alignment employed in missense prediction.

SUBMITTER: Hicks S

PROVIDER: S-EPMC4154965 | biostudies-literature | 2011 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed.

Hicks Stephanie S Wheeler David A DA Plon Sharon E SE Kimmel Marek M

Human mutation 20110407 6

Multiple algorithms are used to predict the impact of missense mutations on protein structure and function using algorithm-generated sequence alignments or manually curated alignments. We compared the accuracy with native alignment of SIFT, Align-GVGD, PolyPhen-2, and Xvar when generating functionality predictions of well-characterized missense mutations (n = 267) within the BRCA1, MSH2, MLH1, and TP53 genes. We also evaluated the impact of the alignment employed on predictions from these algori ...[more]

PMID: 21480434

Dataset Information

Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed.

Publications

Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

FOGSAA: Fast Optimal Global Sequence Alignment Algorithm.
| S-EPMC3638164 | biostudies-literature

BALSA: Bayesian algorithm for local sequence alignment.
| S-EPMC101229 | biostudies-literature

SAGA: sequence alignment by genetic algorithm.
| S-EPMC145823 | biostudies-other

RAGA: RNA sequence alignment by genetic algorithm.
| S-EPMC147093 | biostudies-other

Prediction of antimicrobial peptides based on sequence alignment and support vector machine-pairwise algorithm utilizing LZ-complexity.
| S-EPMC4352747 | biostudies-literature

In silico analysis of missense substitutions using sequence-alignment based methods.
| S-EPMC3431198 | biostudies-literature

EpiAlignment: alignment with both DNA sequence and epigenomic data.
| S-EPMC6602515 | biostudies-literature

Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.
| S-EPMC1579236 | biostudies-literature

Pairwise Heuristic Sequence Alignment Algorithm Based on Deep Reinforcement Learning.
| S-EPMC8901008 | biostudies-literature

BitPAl: a bit-parallel, general integer-scoring sequence alignment algorithm.
| S-EPMC4221118 | biostudies-literature