Unknown

Dataset Information

0

BcSeq: an R package for fast sequence mapping in high-throughput shRNA and CRISPR screens.


ABSTRACT: Summary:CRISPR-Cas9 and shRNA high-throughput sequencing screens have abundant applications for basic and translational research. Methods and tools for the analysis of these screens must properly account for sequencing error, resolve ambiguous mappings among similar sequences in the barcode library in a statistically principled manner, and be computationally efficient. Herein we present bcSeq, an open source R package that implements a fast and parallelized algorithm for mapping high-throughput sequencing reads to a barcode library while tolerating sequencing error. The algorithm uses a Trie data structure for speed and resolves ambiguous mappings by using a statistical sequencing error model based on Phred scores for each read. Availability and implementation:The package source code and an accompanying tutorial are available at http://bioconductor.org/packages/bcSeq/. Supplementary information:Supplementary data are available at Bioinformatics online.

SUBMITTER: Lin J 

PROVIDER: S-EPMC6184561 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

bcSeq: an R package for fast sequence mapping in high-throughput shRNA and CRISPR screens.

Lin Jiaxing J   Gresham Jeremy J   Wang Tongrong T   Kim So Young SY   Alvarez James J   Damrauer Jeffrey S JS   Floyd Scott S   Granek Joshua J   Allen Andrew A   Chan Cliburn C   Xie Jichun J   Owzar Kouros K  

Bioinformatics (Oxford, England) 20181001 20


<h4>Summary</h4>CRISPR-Cas9 and shRNA high-throughput sequencing screens have abundant applications for basic and translational research. Methods and tools for the analysis of these screens must properly account for sequencing error, resolve ambiguous mappings among similar sequences in the barcode library in a statistically principled manner, and be computationally efficient. Herein we present bcSeq, an open source R package that implements a fast and parallelized algorithm for mapping high-thr  ...[more]

Similar Datasets

| S-EPMC3852393 | biostudies-literature
| S-EPMC8689513 | biostudies-literature
| S-EPMC5430825 | biostudies-literature
| S-EPMC4836973 | biostudies-literature
| S-EPMC7660407 | biostudies-literature
| S-EPMC4023662 | biostudies-literature
| S-EPMC8137922 | biostudies-literature
| S-EPMC6923430 | biostudies-literature
| S-EPMC2644677 | biostudies-literature
| S-EPMC5449203 | biostudies-literature