Unknown

Dataset Information

0

GOLD standard dataset for Alzheimer genes.


ABSTRACT: Alzheimer disease is a genetically complex multigenic neurodegenerative disorder, resulting from the interaction between multiple genes. Most of the earlier studies reported only few specific genes that have involvement in Alzheimer. However more than hundreds of susceptible genes have been observed, that have significant role in the development and progression of Alzheimer. Among all the existing data resources, Genetic association database is the most popular data source that contains information about genes, their association classes into positive, negative and neutral class and supporting reference. However, it contains lot of false positives and negatives associations. We have taken this data as reference and performed the double fold cross validation to compile the comprehensive list of Alzheimer genes, their association class viz, positive, negative or ambiguous with the disease and reference sentence confirming the association. The data generated will be used as a GOLD standard reference data set for the training of machine learning classifier to predict the classification of published literature not only in Alzheimer but in other diseases as well. In addition, positive associated genes data can also be used for the system level modelling or meta analysis of Alzheimer.

SUBMITTER: Raj S 

PROVIDER: S-EPMC7176823 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

GOLD standard dataset for Alzheimer genes.

Raj Sushrutha S   Vishnoi Anchal A   Srivastava Alok A  

Data in brief 20200401


Alzheimer disease is a genetically complex multigenic neurodegenerative disorder, resulting from the interaction between multiple genes. Most of the earlier studies reported only few specific genes that have involvement in Alzheimer. However more than hundreds of susceptible genes have been observed, that have significant role in the development and progression of Alzheimer. Among all the existing data resources, Genetic association database is the most popular data source that contains informat  ...[more]

Similar Datasets

2015-05-08 | PXD000792 | Pride
| S-EPMC8609765 | biostudies-literature
| S-EPMC9290855 | biostudies-literature
2017-08-10 | PXD003236 | Pride
| S-EPMC10850109 | biostudies-literature
| S-EPMC10315428 | biostudies-literature
| S-EPMC8535810 | biostudies-literature
| S-EPMC7467160 | biostudies-literature
2023-01-16 | GSE222355 | GEO
| S-EPMC4550898 | biostudies-other