Unknown

Dataset Information

0

Gene Expression Analysis for Uterine Cervix and Corpus Cancer Characterization.


ABSTRACT: The analysis of gene expression quantification data is a powerful and widely used approach in cancer research. This work provides new insights into the transcriptomic changes that occur in healthy uterine tissue compared to those in cancerous tissues and explores the differences associated with uterine cancer localizations and histological subtypes. To achieve this, RNA-Seq data from the TCGA database were preprocessed and analyzed using the KnowSeq package. Firstly, a kNN model was applied to classify uterine cervix cancer, uterine corpus cancer, and healthy uterine samples. Through variable selection, a three-gene signature was identified (VWCE, CLDN15, ADCYAP1R1), achieving consistent 100% test accuracy across 20 repetitions of a 5-fold cross-validation. A supplementary similar analysis using miRNA-Seq data from the same samples identified an optimal two-gene miRNA-coding signature potentially regulating the three-gene signature previously mentioned, which attained optimal classification performance with an 82% F1-macro score. Subsequently, a kNN model was implemented for the classification of cervical cancer samples into their two main histological subtypes (adenocarcinoma and squamous cell carcinoma). A uni-gene signature (ICA1L) was identified, achieving 100% test accuracy through 20 repetitions of a 5-fold cross-validation and externally validated through the CGCI program. Finally, an examination of six cervical adenosquamous carcinoma (mixed) samples revealed a pattern where the gene expression value in the mixed class aligned closer to the histological subtype with lower expression, prompting a reconsideration of the diagnosis for these mixed samples. In summary, this study provides valuable insights into the molecular mechanisms of uterine cervix and corpus cancers. The newly identified gene signatures demonstrate robust predictive capabilities, guiding future research in cancer diagnosis and treatment methodologies.

SUBMITTER: Almorox L 

PROVIDER: S-EPMC10970626 | biostudies-literature | 2024 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Gene Expression Analysis for Uterine Cervix and Corpus Cancer Characterization.

Almorox Lucía L   Antequera Laura L   Rojas Ignacio I   Herrera Luis Javier LJ   Ortuño Francisco M FM  

Genes 20240228 3


The analysis of gene expression quantification data is a powerful and widely used approach in cancer research. This work provides new insights into the transcriptomic changes that occur in healthy uterine tissue compared to those in cancerous tissues and explores the differences associated with uterine cancer localizations and histological subtypes. To achieve this, RNA-Seq data from the TCGA database were preprocessed and analyzed using the KnowSeq package. Firstly, a kNN model was applied to c  ...[more]

Similar Datasets

| S-EPMC2510814 | biostudies-literature
| S-EPMC9905715 | biostudies-literature
| S-EPMC1475650 | biostudies-literature
| S-EPMC6607970 | biostudies-literature
2012-02-23 | GSE30759 | GEO
| S-EPMC5930686 | biostudies-literature
| S-EPMC6540175 | biostudies-literature
| S-EPMC1154117 | biostudies-other
2012-02-23 | E-GEOD-30759 | biostudies-arrayexpress
| S-EPMC4239392 | biostudies-literature