Unknown

Dataset Information

0

A somatic hypermutation-based machine learning model stratifies individuals with Crohn's disease and controls.


ABSTRACT: Crohn's disease (CD) is a chronic relapsing-remitting inflammatory disorder of the gastrointestinal tract that is characterized by altered innate and adaptive immune function. Although massively parallel sequencing studies of the T cell receptor repertoire identified oligoclonal expansion of unique clones, much less is known about the B cell receptor (BCR) repertoire in CD. Here, we present a novel BCR repertoire sequencing data set from ileal biopsies from pediatric patients with CD and controls, and identify CD-specific somatic hypermutation (SHM) patterns, revealed by a machine learning (ML) algorithm trained on BCR repertoire sequences. Moreover, ML classification of a different data set from blood samples of adults with CD versus controls identified that V gene usage, clusters, or mutation frequencies yielded excellent results in classifying the disease (F1 > 90%). In summary, we show that an ML algorithm enables the classification of CD based on unique BCR repertoire features with high accuracy.

SUBMITTER: Safra M 

PROVIDER: S-EPMC9977146 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

A somatic hypermutation-based machine learning model stratifies individuals with Crohn's disease and controls.

Safra Modi M   Werner Lael L   Peres Ayelet A   Polak Pazit P   Salamon Naomi N   Schvimer Michael M   Weiss Batia B   Barshack Iris I   Shouval Dror S DS   Yaari Gur G  

Genome research 20221216 1


Crohn's disease (CD) is a chronic relapsing-remitting inflammatory disorder of the gastrointestinal tract that is characterized by altered innate and adaptive immune function. Although massively parallel sequencing studies of the T cell receptor repertoire identified oligoclonal expansion of unique clones, much less is known about the B cell receptor (BCR) repertoire in CD. Here, we present a novel BCR repertoire sequencing data set from ileal biopsies from pediatric patients with CD and control  ...[more]

Similar Datasets

| S-EPMC8006302 | biostudies-literature
| S-EPMC8589399 | biostudies-literature
| S-EPMC8749460 | biostudies-literature
| S-EPMC9819919 | biostudies-literature
| S-EPMC8700628 | biostudies-literature
| S-EPMC3727322 | biostudies-literature
| S-EPMC10486516 | biostudies-literature
| S-EPMC10076664 | biostudies-literature
| S-EPMC8781386 | biostudies-literature
| S-EPMC6312024 | biostudies-literature