Dataset Information

Genetic prediction in the Genetic Analysis Workshop 18 sequencing data.

ABSTRACT: High-throughput sequencing data can be used to predict phenotypes from genotypes, and this corresponds to establishing a prognostic model. In extended pedigrees the relatedness of subjects provides additional information so that genetic values, fixed or random genetic components, and heritability can be estimated. At the Genetic Analysis Workshop 18, the working group on genetic prediction dealt with both establishing a prognostic model and, in one contribution, comparing standard logistic regression with robust logistic regression in a sample of unrelated affected or unaffected individuals. Results of both logistic regression approaches were similar. All other contributions to this group used extended family data, in general using the quantitative trait blood pressure. The individual contributions varied in several important aspects, such as the estimation of the kinship matrix and the estimation method. Contributors chose various approaches for model validation, including different versions of cross-validation or within-family validation. Within-family validation included model building in the upper generations and validation in later generations. The choice of the statistical model and the computational algorithm had substantial effects on computation time. If decorrelation approaches were applied, the computational burden was substantially reduced. Some software packages estimated negative eigenvalues, although eigenvalues of correlation matrices should be non-negative. Most statistical models and software packages have been developed for experimental crosses and planned breeding programs. With their specialized pedigree structures, they are not sufficiently flexible to accommodate the variability of human pedigrees in general, and improved implementations are required.

SUBMITTER: Ziegler A

PROVIDER: S-EPMC4310867 | biostudies-other | 2014 Sep

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

Genetic prediction in the Genetic Analysis Workshop 18 sequencing data.

Ziegler Andreas A Bohossian Nora N Diego Vincent P VP Yao Chen C

Genetic epidemiology 20140901

High-throughput sequencing data can be used to predict phenotypes from genotypes, and this corresponds to establishing a prognostic model. In extended pedigrees the relatedness of subjects provides additional information so that genetic values, fixed or random genetic components, and heritability can be estimated. At the Genetic Analysis Workshop 18, the working group on genetic prediction dealt with both establishing a prognostic model and, in one contribution, comparing standard logistic regre ...[more]

PMID: 25112190

Dataset Information

Genetic prediction in the Genetic Analysis Workshop 18 sequencing data.

Publications

Genetic prediction in the Genetic Analysis Workshop 18 sequencing data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Analysis of Genetic Analysis Workshop 18 data with gene-based penalized regression.
| S-EPMC4143805 | biostudies-literature

Genetic association analysis using weighted false discovery rate approach on Genetic Analysis Workshop 18 data.
| S-EPMC4143671 | biostudies-literature

Genetic association analysis for common variants in the Genetic Analysis Workshop 18 data: a Dirichlet regression approach.
| S-EPMC4143809 | biostudies-literature

Accounting for relatedness in family-based association studies: application to Genetic Analysis Workshop 18 data.
| S-EPMC4143672 | biostudies-literature

Using a Bayesian latent variable approach to detect pleiotropy in the Genetic Analysis Workshop 18 data.
| S-EPMC4143687 | biostudies-literature

Evaluation of gene-based association tests for analyzing rare variants using Genetic Analysis Workshop 18 data.
| S-EPMC4143759 | biostudies-literature

Combined linkage and family-based association analysis improves candidate gene detection in Genetic Analysis Workshop 18 simulation data.
| S-EPMC4143774 | biostudies-literature

Gaussian graphical models for phenotypes using pedigree data and exploratory analysis using networks with genetic and nongenetic factors based on Genetic Analysis Workshop 18 data.
| S-EPMC4143694 | biostudies-literature

Data for Genetic Analysis Workshop 18: human whole genome sequence, blood pressure, and simulated phenotypes in extended pedigrees.
| S-EPMC4145406 | biostudies-literature

Entropy-based method for assessing the influence of genetic markers and covariates on hypertension: application to Genetic Analysis Workshop 18 data.
| S-EPMC4143731 | biostudies-literature