Unknown

Dataset Information

0

A Haystack Heuristic for Autoimmune Disease Biomarker Discovery Using Next-Gen Immune Repertoire Sequencing Data.


ABSTRACT: Large-scale DNA sequencing of immunological repertoires offers an opportunity for the discovery of novel biomarkers for autoimmune disease. Available bioinformatics techniques however, are not adequately suited for elucidating possible biomarker candidates from within large immunosequencing datasets due to unsatisfactory scalability and sensitivity. Here, we present the Haystack Heuristic, an algorithm customized to computationally extract disease-associated motifs from next-generation-sequenced repertoires by contrasting disease and healthy subjects. This technique employs a local-search graph-theory approach to discover novel motifs in patient data. We apply the Haystack Heuristic to nine million B-cell receptor sequences obtained from nearly 100 individuals in order to elucidate a new motif that is significantly associated with multiple sclerosis. Our results demonstrate the effectiveness of the Haystack Heuristic in computing possible biomarker candidates from high throughput sequencing data and could be generalized to other datasets.

SUBMITTER: Apeltsin L 

PROVIDER: S-EPMC5509648 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Haystack Heuristic for Autoimmune Disease Biomarker Discovery Using Next-Gen Immune Repertoire Sequencing Data.

Apeltsin Leonard L   Wang Shengzhi S   von Büdingen H-Christian HC   Sirota Marina M  

Scientific reports 20170713 1


Large-scale DNA sequencing of immunological repertoires offers an opportunity for the discovery of novel biomarkers for autoimmune disease. Available bioinformatics techniques however, are not adequately suited for elucidating possible biomarker candidates from within large immunosequencing datasets due to unsatisfactory scalability and sensitivity. Here, we present the Haystack Heuristic, an algorithm customized to computationally extract disease-associated motifs from next-generation-sequenced  ...[more]

Similar Datasets

| S-EPMC4011907 | biostudies-other
2017-10-04 | PXD006484 | Pride
| S-EPMC5650670 | biostudies-literature
| S-EPMC4225161 | biostudies-literature
| S-EPMC5808239 | biostudies-literature
| S-EPMC10741845 | biostudies-literature
| S-EPMC3630001 | biostudies-literature
| S-EPMC6925417 | biostudies-literature
| S-EPMC4487992 | biostudies-literature
| S-EPMC3933208 | biostudies-other