Machine learning-based detection of immune-mediated diseases from genome-wide cell-free DNA sequencing datasets
Ontology highlight
ABSTRACT: The early detection of tissue and organ damage associated with autoimmune diseases (AID) has been identified as key to improve long-term survival, but non-invasive biomarkers are lacking. Elevated cell-free DNA (cfDNA) levels have been observed in AID and inflammatory bowel disease (IBD), prompting interest to use cfDNA as a potential non-invasive diagnostic and prognostic biomarker. Despite these known disease-related changes in concentration, it remains impossible to identify AID and IBD patients through cfDNA analysis alone. By using unsupervised clustering on large sets of shallow whole-genome sequencing (sWGS) cfDNA data, we uncover AID- and IBD-specific genome-wide patterns in plasma cfDNA in both the obstetric and general AID and IBD populations. Supervised learning of the genome-wide patterns allows AID prediction with 50% sensitivity at 95% specificity. Importantly, the method can identify pregnant women with AID during routine non-invasive prenatal screening. Since AID pregnancies have an increased risk of severe complications, early recognition or detection of new onset AID can redirect pregnancy management and limit potential adverse events. This method opens up new avenues for screening, diagnosis and monitoring of AID and IBD.
INSTRUMENT(S): Illumina HiSeq 4000
ORGANISM(S): Homo sapiens
SUBMITTER: Huiwen Che
PROVIDER: E-MTAB-11607 | biostudies-arrayexpress |
REPOSITORIES: biostudies-arrayexpress
ACCESS DATA