Dataset Information

A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies.

ABSTRACT: Analysis of de novo mutations (DNMs) from sequencing data of nuclear families has identified risk genes for many complex diseases, including multiple neurodevelopmental and psychiatric disorders. Most of these efforts have focused on mutations in protein-coding sequences. Evidence from genome-wide association studies (GWASs) strongly suggests that variants important to human diseases often lie in non-coding regions. Extending DNM-based approaches to non-coding sequences is challenging, however, because the functional significance of non-coding mutations is difficult to predict. We propose a statistical framework for analyzing DNMs from whole-genome sequencing (WGS) data. This method, TADA-Annotations (TADA-A), is a major advance of the TADA method we developed earlier for DNM analysis in coding regions. TADA-A is able to incorporate many functional annotations such as conservation and enhancer marks, to learn from data which annotations are informative of pathogenic mutations, and to combine both coding and non-coding mutations at the gene level to detect risk genes. It also supports meta-analysis of multiple DNM studies, while adjusting for study-specific technical effects. We applied TADA-A to WGS data of ∼300 autism-affected family trios across five studies and discovered several autism risk genes. The software is freely available for all research uses.

SUBMITTER: Liu Y

PROVIDER: S-EPMC5992125 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies.

Liu Yuwen Y Liang Yanyu Y Cicek A Ercument AE Li Zhongshan Z Li Jinchen J Muhle Rebecca A RA Krenzer Martina M Mei Yue Y Wang Yan Y Knoblauch Nicholas N Morrison Jean J Zhao Siming S Jiang Yi Y Geller Evan E Ionita-Laza Iuliana I Wu Jinyu J Xia Kun K Noonan James P JP Sun Zhong Sheng ZS He Xin X

American journal of human genetics 20180510 6

Analysis of de novo mutations (DNMs) from sequencing data of nuclear families has identified risk genes for many complex diseases, including multiple neurodevelopmental and psychiatric disorders. Most of these efforts have focused on mutations in protein-coding sequences. Evidence from genome-wide association studies (GWASs) strongly suggests that variants important to human diseases often lie in non-coding regions. Extending DNM-based approaches to non-coding sequences is challenging, however, ...[more]

PMID: 29754769

Dataset Information

A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies.

Publications

A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Whole-exome sequencing for finding de novo mutations in sporadic mental retardation.
| S-EPMC3046476 | biostudies-literature

De novo mutations discovered in 8 Mexican American families through whole genome sequencing.
| S-EPMC4143763 | biostudies-literature

De novo mutations revealed by whole-exome sequencing are strongly associated with autism.
| S-EPMC3667984 | biostudies-literature

Whole-genome sequencing reveals de-novo mutations associated with nonsyndromic cleft lip/palate.
| S-EPMC9273634 | biostudies-literature

A case-control collapsing analysis identifies epilepsy genes implicated in trio sequencing studies focused on de novo mutations.
| S-EPMC5724893 | biostudies-literature

A framework for the detection of de novo mutations in family-based sequencing data.
| S-EPMC5255947 | biostudies-literature

Whole exome sequencing identifies de novo mutations in GATA6 associated with congenital diaphragmatic hernia.
| S-EPMC3955383 | biostudies-literature

De novo mutations identified by whole-genome sequencing implicate chromatin modifications in obsessive-compulsive disorder.
| S-EPMC8754407 | biostudies-literature

mTADA is a framework for identifying risk genes from de novo mutations in multiple traits.
| S-EPMC7287090 | biostudies-literature

Whole-Exome Sequencing in Family Trios Reveals De Novo Mutations Associated with Type 1 Diabetes Mellitus.
| S-EPMC10044903 | biostudies-literature