Unknown

Dataset Information

0

Large-scale network analysis reveals the sequence space architecture of antibody repertoires.


ABSTRACT: The architecture of mouse and human antibody repertoires is defined by the sequence similarity networks of the clones that compose them. The major principles that define the architecture of antibody repertoires have remained largely unknown. Here, we establish a high-performance computing platform to construct large-scale networks from comprehensive human and murine antibody repertoire sequencing datasets (>100,000 unique sequences). Leveraging a network-based statistical framework, we identify three fundamental principles of antibody repertoire architecture: reproducibility, robustness and redundancy. Antibody repertoire networks are highly reproducible across individuals despite high antibody sequence dissimilarity. The architecture of antibody repertoires is robust to the removal of up to 50-90% of randomly selected clones, but fragile to the removal of public clones shared among individuals. Finally, repertoire architecture is intrinsically redundant. Our analysis provides guidelines for the large-scale network analysis of immune repertoires and may be used in the future to define disease-associated and synthetic repertoires.

SUBMITTER: Miho E 

PROVIDER: S-EPMC6428871 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Large-scale network analysis reveals the sequence space architecture of antibody repertoires.

Miho Enkelejda E   Roškar Rok R   Greiff Victor V   Reddy Sai T ST  

Nature communications 20190321 1


The architecture of mouse and human antibody repertoires is defined by the sequence similarity networks of the clones that compose them. The major principles that define the architecture of antibody repertoires have remained largely unknown. Here, we establish a high-performance computing platform to construct large-scale networks from comprehensive human and murine antibody repertoire sequencing datasets (>100,000 unique sequences). Leveraging a network-based statistical framework, we identify  ...[more]

Similar Datasets

| S-EPMC4868480 | biostudies-literature
| S-EPMC2895651 | biostudies-literature
| S-EPMC6412061 | biostudies-literature
| S-EPMC2373757 | biostudies-literature
| S-EPMC5774755 | biostudies-literature
| S-EPMC7913176 | biostudies-literature
| S-EPMC2944327 | biostudies-literature
| S-EPMC7962209 | biostudies-literature
| S-EPMC7733803 | biostudies-literature
| S-EPMC8030225 | biostudies-literature