Dataset Information

Identification of functional genetic variation in exome sequence analysis.

ABSTRACT: Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed studies, either population based or functional, for each of the identified variants are not practicable. Therefore algorithms that yield valid predictions of a variant's functional significance are needed. Over the past decade, several programs have been developed to predict the probability that an observed sequence variant will have a deleterious effect on protein function. These algorithms range from empirical programs that classify using known biochemical properties to statistical algorithms trained using a variety of data sources, including sequence conservation data, biochemical properties, and functional data. Using data from the pilot3 study of the 1000 Genomes Project available through Genetic Analysis Workshop 17, we compared the results of four programs (SIFT, PolyPhen, MAPP, and VarioWatch) used to predict the functional relevance of variants in 101 genes. Analysis was conducted without knowledge of the simulation model. Agreement between programs was modest ranging from 59.4% to 71.4% and only 3.5% of variants were classified as deleterious and 10.9% as tolerated across all four programs.

SUBMITTER: Jaffe A

PROVIDER: S-EPMC3287847 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Identification of functional genetic variation in exome sequence analysis.

Jaffe Andrew A Wojcik Genevieve G Chu Audrey A Golozar Asieh A Maroo Ankit A Duggal Priya P Klein Alison P AP

BMC proceedings 20111129

Recent technological advances have allowed us to study individual genomes at a base-pair resolution and have demonstrated that the average exome harbors more than 15,000 genetic variants. However, our ability to understand the biological significance of the identified variants and to connect these observed variants with phenotypes is limited. The first step in this process is to identify genetic variation that is likely to result in changes to protein structure and function, because detailed stu ...[more]

PMID: 22373437

Dataset Information

Identification of functional genetic variation in exome sequence analysis.

Publications

Identification of functional genetic variation in exome sequence analysis.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Functional and genetic analysis of haplotypic sequence variation at the nicastrin genomic locus.
| S-EPMC3683320 | biostudies-literature

Genetic variation in an individual human exome.
| S-EPMC2493042 | biostudies-literature

Copy number variation detection and genotyping from exome sequence data.
| S-EPMC3409265 | biostudies-literature

Exome Sequence Analysis Suggests that Genetic Burden Contributes to Phenotypic Variability and Complex Neuropathy.
| S-EPMC4545408 | biostudies-literature

Identification of copy number variants from exome sequence data.
| S-EPMC4132917 | biostudies-literature

Whole-exome analysis in Tunisian Imazighen and Arabs shows the impact of demography in functional variation.
| S-EPMC8548440 | biostudies-literature

Whole exome sequence analysis of Peters anomaly.
| S-EPMC4395516 | biostudies-literature

Identification of Common and Rare Genetic Variation Associated With Plasma Protein Levels Using Whole-Exome Sequencing and Mass Spectrometry.
| S-EPMC6301071 | biostudies-literature

Whole-exome sequence analysis of anthropometric traits illustrates challenges in identifying effects of rare genetic variants.
| S-EPMC9772568 | biostudies-literature