Unknown

Dataset Information

0

Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma.


ABSTRACT:

Background

DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-stage HCCs.

Methods

We conducted low-depth whole-genome sequencing (WGS) to profile SCNAs in 384 plasma samples of hepatitis B virus (HBV)-related HCC and cancer-free HBV patients, using one discovery and two validation cohorts. To fully capture the robust signals of WGS data from the complete genome, we developed a machine learning-based statistical model that is focused on detection accuracy in early-stage HCC.

Findings

We built the model using a discovery cohort of 209 patients, achieving an overall area under curve (AUC) of 0.893, with 0.874 for early-stage (Barcelona clinical liver cancer [BCLC] stage 0-A) and 0.933 for advanced-stage (BCLC stage B-D). The performance of the model was then assessed in two validation cohorts (76 and 99 patients) that only consisted of patients with stage 0-A HCC. Our model exhibited a robust predictive performance, with an AUC of 0.920 and 0.812 for the two validation cohorts. Further analyses showed the impact of tumor sample heterogeneity in model training on detecting early-stage tumors, and a refined model addressing the heterogeneity in the discovery cohort significantly increased model performance in validation.

Interpretation

We developed an SCNA-based, machine learning-driven model in the non-invasive detection of early-stage HCC in HBV patients and demonstrated its performance through strict independent validations.

SUBMITTER: Tao K 

PROVIDER: S-EPMC7276513 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma.

Tao Kaishan K   Bian Zhenyuan Z   Zhang Qiong Q   Guo Xu X   Yin Chun C   Wang Yang Y   Zhou Kaixiang K   Wan Shaogui S   Shi Meifang M   Bao Dengke D   Yang Chuhu C   Xing Jinliang J  

EBioMedicine 20200605


<h4>Background</h4>DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-stage HCCs.<h4>Methods</h4>We conducted low-depth whole-genome sequencing (WGS) to profile SCNAs in 384 plasma samples of hepatitis B virus (HBV)-related HCC and cancer-free HBV patients, using  ...[more]

Similar Datasets

| S-EPMC7331695 | biostudies-literature
| S-ECPF-GEOD-37382 | biostudies-other
| S-EPMC5518683 | biostudies-literature
| S-EPMC3827077 | biostudies-literature
| S-EPMC4527302 | biostudies-literature
| S-EPMC5474888 | biostudies-literature
| S-EPMC5528970 | biostudies-other
| S-EPMC2796813 | biostudies-literature
| S-ECPF-GEOD-23056 | biostudies-other
2012-07-27 | GSE37385 | GEO