Dataset Information

PRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires.

ABSTRACT:

Unlabelled

Driven by dramatic technological improvements, large-scale characterization of lymphocyte receptor repertoires via high-throughput sequencing is now feasible. Although promising, the high germline and somatic diversity, especially of B-cell immunoglobulin repertoires, presents challenges for analysis requiring the development of specialized computational pipelines. We developed the REpertoire Sequencing TOolkit (pRESTO) for processing reads from high-throughput lymphocyte receptor studies. pRESTO processes raw sequences to produce error-corrected, sorted and annotated sequence sets, along with a wealth of metrics at each step. The toolkit supports multiplexed primer pools, single- or paired-end reads and emerging technologies that use single-molecule identifiers. pRESTO has been tested on data generated from Roche and Illumina platforms. It has a built-in capacity to parallelize the work between available processors and is able to efficiently process millions of sequences generated by typical high-throughput projects.

Availability and implementation

pRESTO is freely available for academic use. The software package and detailed tutorials may be downloaded from http://clip.med.yale.edu/presto.

SUBMITTER: Vander Heiden JA

PROVIDER: S-EPMC4071206 | biostudies-literature | 2014 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires.

Vander Heiden Jason A JA Yaari Gur G Uduman Mohamed M Stern Joel N H JN O'Connor Kevin C KC Hafler David A DA Vigneault Francois F Kleinstein Steven H SH

Bioinformatics (Oxford, England) 20140310 13

<h4>Unlabelled</h4>Driven by dramatic technological improvements, large-scale characterization of lymphocyte receptor repertoires via high-throughput sequencing is now feasible. Although promising, the high germline and somatic diversity, especially of B-cell immunoglobulin repertoires, presents challenges for analysis requiring the development of specialized computational pipelines. We developed the REpertoire Sequencing TOolkit (pRESTO) for processing reads from high-throughput lymphocyte rece ...[more]

PMID: 24618469

Dataset Information

PRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires.

Unlabelled

Availability and implementation

Publications

pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A comprehensive profiling of T- and B-lymphocyte receptor repertoires from a Chinese-origin rhesus macaque by high-throughput sequencing.
| S-EPMC5559085 | biostudies-literature

High-throughput sequencing of immune repertoires in multiple sclerosis.
| S-EPMC4818741 | biostudies-other

Bicyclus anynana RNA-Seq raw sequencing reads
2015-10-01 | E-MTAB-3887 | biostudies-arrayexpress

Inference of phylogenetic trees directly from raw sequencing reads using Read2Tree.
| S-EPMC10791578 | biostudies-literature

Identifying micro-inversions using high-throughput sequencing reads.
| S-EPMC4895285 | biostudies-literature

Characteristics of T cell receptor repertoires of patients with acute myocardial infarction through high-throughput sequencing.
| S-EPMC6330436 | biostudies-literature

High-throughput Treg cell receptor sequencing reveals differential immune repertoires in rheumatoid arthritis with kidney deficiency.
| S-EPMC9899432 | biostudies-literature

Fulcrum: condensing redundant reads from high-throughput sequencing studies.
| S-EPMC3348557 | biostudies-literature

FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads.
| S-EPMC5451431 | biostudies-literature