Unknown

Dataset Information

0

Knowledge database assisted gene marker selection for chronic lymphocytic leukemia.


ABSTRACT: Objective To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. Methods A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis, in which 753 CLL target genes were curated. Expression values for these genes were used for case-control classification of four CLL datasets, with a sparse representation-based variable selection (SRVS) approach employed for feature (gene) selection. Results were compared with outcomes obtained by using analysis of variance (ANOVA)-based gene selection approaches. Results For each of the four datasets, SRVS selected a subset of genes from the 753 CLL target genes, resulting in significantly higher classification accuracy, compared with randomly selected genes (100%, 100%, 93.94%, 89.39%). The SRVS method outperformed ANOVA in terms of classification accuracy. Conclusion Gene markers selected from the 753 CLL genes could enable significantly greater accuracy in the prediction of CLL. SRVS provides an effective method for gene marker selection.

SUBMITTER: Xiang X 

PROVIDER: S-EPMC6134680 | biostudies-literature | 2018 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Knowledge database assisted gene marker selection for chronic lymphocytic leukemia.

Xiang Xixi X   Wang Yu-Ping YP   Cao Hongbao H   Zhang Xi X  

The Journal of international medical research 20180712 8


Objective To investigate whether previously curated chronic lymphocytic leukemia (CLL) risk genes could be leveraged in gene marker selection for the diagnosis and prediction of CLL. Methods A CLL genetic database (CLL_042017) was developed through a comprehensive CLL-gene relation data analysis, in which 753 CLL target genes were curated. Expression values for these genes were used for case-control classification of four CLL datasets, with a sparse representation-based variable selection (SRVS)  ...[more]

Similar Datasets

| S-EPMC4831394 | biostudies-other
| S-EPMC2891437 | biostudies-literature
| S-EPMC3791640 | biostudies-literature
| S-EPMC9264813 | biostudies-literature
| S-EPMC3146619 | biostudies-literature
2011-06-19 | E-GEOD-22858 | biostudies-arrayexpress
| S-EPMC4449150 | biostudies-other
| S-EPMC5525469 | biostudies-other