Unknown

Dataset Information

0

Combining Random Forests and a Signal Detection Method Leads to the Robust Detection of Genotype-Phenotype Associations.


ABSTRACT: Genome wide association studies (GWAS) are a well established methodology to identify genomic variants and genes that are responsible for traits of interest in all branches of the life sciences. Despite the long time this methodology has had to mature the reliable detection of genotype-phenotype associations is still a challenge for many quantitative traits mainly because of the large number of genomic loci with weak individual effects on the trait under investigation. Thus, it can be hypothesized that many genomic variants that have a small, however real, effect remain unnoticed in many GWAS approaches. Here, we propose a two-step procedure to address this problem. In a first step, cubic splines are fitted to the test statistic values and genomic regions with spline-peaks that are higher than expected by chance are considered as quantitative trait loci (QTL). Then the SNPs in these QTLs are prioritized with respect to the strength of their association with the phenotype using a Random Forests approach. As a case study, we apply our procedure to real data sets and find trustworthy numbers of, partially novel, genomic variants and genes involved in various egg quality traits.

SUBMITTER: Ramzan F 

PROVIDER: S-EPMC7465705 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combining Random Forests and a Signal Detection Method Leads to the Robust Detection of Genotype-Phenotype Associations.

Ramzan Faisal F   Gültas Mehmet M   Bertram Hendrik H   Cavero David D   Schmitt Armin Otto AO  

Genes 20200805 8


Genome wide association studies (GWAS) are a well established methodology to identify genomic variants and genes that are responsible for traits of interest in all branches of the life sciences. Despite the long time this methodology has had to mature the reliable detection of genotype-phenotype associations is still a challenge for many quantitative traits mainly because of the large number of genomic loci with weak individual effects on the trait under investigation. Thus, it can be hypothesiz  ...[more]

Similar Datasets

2019-08-27 | GSE125279 | GEO
| S-EPMC2850440 | biostudies-literature
| PRJNA1047225 | ENA
| PRJNA1047206 | ENA
2023-12-01 | GSE229783 | GEO
| S-EPMC3312209 | biostudies-literature
| S-EPMC3463421 | biostudies-literature
| S-EPMC9302835 | biostudies-literature
| S-EPMC6785219 | biostudies-literature
| S-EPMC5048068 | biostudies-literature