Transcriptomics

Dataset Information

0

Machine-learning model based on glycosyltransferase expression predicts multiple cancer types, subtypes and survival probability


ABSTRACT: Whereas current tools for diagnosing cancer focus on identifying markers in limited types of the disease, such as the PAM50 kit known primarily for subtyping breast cancer, our research reveals that 27 different cancers have a unique glycosyltransferase (GT) fingerprint that can be detected using a single test, with 93.7% accuracy in external validation. Four models were built on the expression patterns of 71 GTs that are universally influential in inducing the formation of cancer cells and giving them the impetus to metastasize. The first differentiates cancer tissue from normal tissue, the second distinguishes cancer types, and the third and fourth models differentiate between subtypes of cancer, in particular breast cancer and glioma. Data from the Cancer Genome Atlas database were used for deriving the models, while independent publicly available databases and newly collected patient samples were used for validation. Running these models against external databases showed just how powerful they are, producing results that were highly comparable to those achieved in internal testing. Furthermore, the breast cancer classifier differentiated between breast cancer subtypes with an average of 81% accuracy in external testing, far surpassing the industry-standard PAM50’s accuracy ratio of 43%. Meanwhile, a separate prognostic model designed to predict the probability of survival among glioma patients achieved high accuracy by zooming in on four GT genes that are strongly related to prognosis. Taken together, the models demonstrate the revolutionary potential of focusing on GT genes to simplify cancer diagnostics.

ORGANISM(S): Homo sapiens

PROVIDER: GSE254461 | GEO | 2024/02/03

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2019-07-02 | GSE122630 | GEO
2013-10-10 | GSE41685 | GEO
2014-08-22 | E-GEOD-55070 | biostudies-arrayexpress
2014-05-30 | E-GEOD-58102 | biostudies-arrayexpress
2021-08-07 | GSE181548 | GEO
2021-08-07 | GSE181466 | GEO
2011-10-22 | E-GEOD-32530 | biostudies-arrayexpress
2011-10-22 | E-GEOD-32531 | biostudies-arrayexpress
2022-02-22 | PXD024322 | Pride
2011-05-10 | E-GEOD-26082 | biostudies-arrayexpress