Unknown

Dataset Information

0

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.


ABSTRACT: ImmuneDB is a system for storing and analyzing high-throughput immune receptor sequencing data. Unlike most existing tools, which utilize flat-files, ImmuneDB stores data in a well-structured MySQL database, enabling efficient data queries. It can take raw sequencing data as input and annotate receptor gene usage, infer clonotypes, aggregate results, and run common downstream analyses such as calculating selection pressure and constructing clonal lineages. Alternatively, pre-annotated data can be imported and analyzed data can be exported in a variety of common Adaptive Immune Receptor Repertoire (AIRR) file formats. To validate ImmuneDB, we compare its results to those of another pipeline, MiXCR. We show that the biological conclusions drawn would be similar with either tool, while ImmuneDB provides the additional benefits of integrating other common tools and storing data in a database. ImmuneDB is freely available on GitHub at https://github.com/arosenfeld/immunedb, on PyPi at https://pypi.org/project/ImmuneDB, and a Docker container is provided at https://hub.docker.com/r/arosenfeld/immunedb. Full documentation is available at http://immunedb.com.

SUBMITTER: Rosenfeld AM 

PROVIDER: S-EPMC6161679 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.

Rosenfeld Aaron M AM   Meng Wenzhao W   Luning Prak Eline T ET   Hershberg Uri U  

Frontiers in immunology 20180921


ImmuneDB is a system for storing and analyzing high-throughput immune receptor sequencing data. Unlike most existing tools, which utilize flat-files, ImmuneDB stores data in a well-structured MySQL database, enabling efficient data queries. It can take raw sequencing data as input and annotate receptor gene usage, infer clonotypes, aggregate results, and run common downstream analyses such as calculating selection pressure and constructing clonal lineages. Alternatively, pre-annotated data can b  ...[more]

Similar Datasets

| S-EPMC5637252 | biostudies-literature
| S-EPMC11097599 | biostudies-literature
| S-EPMC7703782 | biostudies-literature
| S-EPMC8576399 | biostudies-literature
| S-EPMC3074166 | biostudies-literature
| S-EPMC7416706 | biostudies-literature
| S-EPMC7213749 | biostudies-literature
| S-EPMC3367215 | biostudies-literature
| S-EPMC5509648 | biostudies-literature
| S-EPMC6297828 | biostudies-literature