Dataset Information

VDJPipe: a pipelined tool for pre-processing immune repertoire sequencing data.

ABSTRACT: Pre-processing of high-throughput sequencing data for immune repertoire profiling is essential to insure high quality input for downstream analysis. VDJPipe is a flexible, high-performance tool that can perform multiple pre-processing tasks with just a single pass over the data files.Processing tasks provided by VDJPipe include base composition statistics calculation, read quality statistics calculation, quality filtering, homopolymer filtering, length and nucleotide filtering, paired-read merging, barcode demultiplexing, 5' and 3' PCR primer matching, and duplicate reads collapsing. VDJPipe utilizes a pipeline approach whereby multiple processing steps are performed in a sequential workflow, with the output of each step passed as input to the next step automatically. The workflow is flexible enough to handle the complex barcoding schemes used in many immunosequencing experiments. Because VDJPipe is designed for computational efficiency, we evaluated this by comparing execution times with those of pRESTO, a widely-used pre-processing tool for immune repertoire sequencing data. We found that VDJPipe requires <10% of the run time required by pRESTO.VDJPipe is a high-performance tool that is optimized for pre-processing large immune repertoire sequencing data sets.

SUBMITTER: Christley S

PROVIDER: S-EPMC5637252 | biostudies-literature | 2017 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

VDJPipe: a pipelined tool for pre-processing immune repertoire sequencing data.

Christley Scott S Levin Mikhail K MK Toby Inimary T IT Fonner John M JM Monson Nancy L NL Rounds William H WH Rubelt Florian F Scarborough Walter W Scheuermann Richard H RH Cowell Lindsay G LG

BMC bioinformatics 20171011 1

<h4>Background</h4>Pre-processing of high-throughput sequencing data for immune repertoire profiling is essential to insure high quality input for downstream analysis. VDJPipe is a flexible, high-performance tool that can perform multiple pre-processing tasks with just a single pass over the data files.<h4>Results</h4>Processing tasks provided by VDJPipe include base composition statistics calculation, read quality statistics calculation, quality filtering, homopolymer filtering, length and nucl ...[more]

PMID: 29020925

Dataset Information

VDJPipe: a pipelined tool for pre-processing immune repertoire sequencing data.

Publications

VDJPipe: a pipelined tool for pre-processing immune repertoire sequencing data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Adaptive Immune Receptor Repertoire Community recommendations for sharing immune-repertoire sequencing data.
| S-EPMC5790180 | biostudies-literature

ImmuneDB, a Novel Tool for the Analysis, Storage, and Dissemination of Immune Repertoire Sequencing Data.
| S-EPMC6161679 | biostudies-literature

Anchor Clustering for million-scale immune repertoire sequencing data.
| S-EPMC10809746 | biostudies-literature

ImSpectR - R package to quantify immune repertoire diversity in spectratype and repertoire sequencing data.
| S-EPMC7703782 | biostudies-literature

Ultrasensitive allele inference from immune repertoire sequencing data with MiXCR.
| S-EPMC11694755 | biostudies-literature

Guidelines for reproducible analysis of adaptive immune receptor repertoire sequencing data.
| S-EPMC11097599 | biostudies-literature

Characterizing pre-transplant and post-transplant kidney rejection risk by B cell immune repertoire sequencing.
| S-EPMC6479061 | biostudies-literature

mzRAPP: a tool for reliability assessment of data pre-processing in non-targeted metabolomics.
| S-EPMC8545297 | biostudies-literature

IGHV allele similarity clustering improves genotype inference from adaptive immune receptor repertoire sequencing data.
| S-EPMC10484671 | biostudies-literature

Inferring the immune response from repertoire sequencing.
| S-EPMC7213749 | biostudies-literature