Dataset Information

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.

ABSTRACT: ImmuneDB is a system for storing and analyzing high-throughput immune receptor sequencing data. Unlike most existing tools, which utilize flat-files, ImmuneDB stores data in a well-structured MySQL database, enabling efficient data queries. It can take raw sequencing data as input and annotate receptor gene usage, infer clonotypes, aggregate results, and run common downstream analyses such as calculating selection pressure and constructing clonal lineages. Alternatively, pre-annotated data can be imported and analyzed data can be exported in a variety of common Adaptive Immune Receptor Repertoire (AIRR) file formats. To validate ImmuneDB, we compare its results to those of another pipeline, MiXCR. We show that the biological conclusions drawn would be similar with either tool, while ImmuneDB provides the additional benefits of integrating other common tools and storing data in a database. ImmuneDB is freely available on GitHub at https://github.com/arosenfeld/immunedb, on PyPi at https://pypi.org/project/ImmuneDB, and a Docker container is provided at https://hub.docker.com/r/arosenfeld/immunedb. Full documentation is available at http://immunedb.com.

SUBMITTER: Rosenfeld AM

PROVIDER: S-EPMC6161679 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.

Rosenfeld Aaron M AM Meng Wenzhao W Luning Prak Eline T ET Hershberg Uri U

Frontiers in immunology 20180921

ImmuneDB is a system for storing and analyzing high-throughput immune receptor sequencing data. Unlike most existing tools, which utilize flat-files, ImmuneDB stores data in a well-structured MySQL database, enabling efficient data queries. It can take raw sequencing data as input and annotate receptor gene usage, infer clonotypes, aggregate results, and run common downstream analyses such as calculating selection pressure and constructing clonal lineages. Alternatively, pre-annotated data can b ...[more]

PMID: 30298069

Dataset Information

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.

Publications

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

VDJPipe: a pipelined tool for pre-processing immune repertoire sequencing data.
| S-EPMC5637252 | biostudies-literature

Adaptive Immune Receptor Repertoire Community recommendations for sharing immune-repertoire sequencing data.
| S-EPMC5790180 | biostudies-literature

Guidelines for reproducible analysis of adaptive immune receptor repertoire sequencing data.
| S-EPMC11097599 | biostudies-literature

Anchor Clustering for million-scale immune repertoire sequencing data.
| S-EPMC10809746 | biostudies-literature

ImSpectR - R package to quantify immune repertoire diversity in spectratype and repertoire sequencing data.
| S-EPMC7703782 | biostudies-literature

Ultrasensitive allele inference from immune repertoire sequencing data with MiXCR.
| S-EPMC11694755 | biostudies-literature

Novel Allele Detection Tool Benchmark and Application With Antibody Repertoire Sequencing Dataset.
| S-EPMC8576399 | biostudies-literature

Novel Ensemble Feature Selection Approach and Application in Repertoire Sequencing Data.
| S-EPMC9086194 | biostudies-literature

A novel compression tool for efficient storage of genome resequencing data.
| S-EPMC3074166 | biostudies-literature

IGHV allele similarity clustering improves genotype inference from adaptive immune receptor repertoire sequencing data.
| S-EPMC10484671 | biostudies-literature