Unknown

Dataset Information

0

Establishing a novel colorectal cancer predictive model based on unique gut microbial single nucleotide variant markers.


ABSTRACT: Current metagenomic species-based colorectal cancer (CRC) microbial biomarkers may confuse diagnosis because the genetic content of different microbial strains, even those belonging to the same species, may differ from 5% to 30%. Here, a total of 7549 non-redundant single nucleotide variants (SNVs) were annotated in 25 species from 3 CRC cohorts (n = 249). Then, 22 microbial SNV markers that contributed to distinguishing subjects with CRC from healthy subjects were identified by the random forest algorithm to construct a novel CRC predictive model. Excitingly, the predictive model showed high accuracy both in the training (AUC = 75.35%) and validation cohorts (AUC = 73.08%-88.02%). We further explored the specificity of these SNV markers in a broader background by performing a meta-analysis across 4 metabolic disease cohorts. Among these SNV markers, 3 SNVs that were enriched in CRC patients and located in the genomes of Eubacterium rectale and Faecalibacterium prausnitzii were CRC specific (AUC = 72.51%-94.07%).

SUBMITTER: Ma C 

PROVIDER: S-EPMC7808391 | biostudies-literature | 2021 Jan-Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Establishing a novel colorectal cancer predictive model based on unique gut microbial single nucleotide variant markers.

Ma Chenchen C   Chen Kaining K   Wang Yuanyuan Y   Cen Chaoping C   Zhai Qixiao Q   Zhang Jiachao J  

Gut microbes 20210101 1


Current metagenomic species-based colorectal cancer (CRC) microbial biomarkers may confuse diagnosis because the genetic content of different microbial strains, even those belonging to the same species, may differ from 5% to 30%. Here, a total of 7549 non-redundant single nucleotide variants (SNVs) were annotated in 25 species from 3 CRC cohorts (n = 249). Then, 22 microbial SNV markers that contributed to distinguishing subjects with CRC from healthy subjects were identified by the random fores  ...[more]

Similar Datasets

| S-EPMC3658526 | biostudies-literature
| S-EPMC10457972 | biostudies-literature
| S-EPMC5628696 | biostudies-literature
| S-EPMC6192032 | biostudies-literature
| S-EPMC4493402 | biostudies-literature
| S-EPMC2631215 | biostudies-other
| S-EPMC10914822 | biostudies-literature
| S-EPMC5998544 | biostudies-literature
| S-EPMC4887298 | biostudies-literature
| S-EPMC5609900 | biostudies-literature