Unknown

Dataset Information

0

Indexcov: fast coverage quality control for whole-genome sequencing.


ABSTRACT: The BAM and CRAM formats provide a supplementary linear index that facilitates rapid access to sequence alignments in arbitrary genomic regions. Comparing consecutive entries in a BAM or CRAM index allows one to infer the number of alignment records per genomic region for use as an effective proxy of sequence depth in each genomic region. Based on these properties, we have developed indexcov, an efficient estimator of whole-genome sequencing coverage to rapidly identify samples with aberrant coverage profiles, reveal large-scale chromosomal anomalies, recognize potential batch effects, and infer the sex of a sample. Indexcov is available at https://github.com/brentp/goleft under the MIT license.

SUBMITTER: Pedersen BS 

PROVIDER: S-EPMC5737511 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Indexcov: fast coverage quality control for whole-genome sequencing.

Pedersen Brent S BS   Collins Ryan L RL   Talkowski Michael E ME   Quinlan Aaron R AR  

GigaScience 20171101 11


The BAM and CRAM formats provide a supplementary linear index that facilitates rapid access to sequence alignments in arbitrary genomic regions. Comparing consecutive entries in a BAM or CRAM index allows one to infer the number of alignment records per genomic region for use as an effective proxy of sequence depth in each genomic region. Based on these properties, we have developed indexcov, an efficient estimator of whole-genome sequencing coverage to rapidly identify samples with aberrant cov  ...[more]

Similar Datasets

| S-EPMC4610308 | biostudies-literature
| S-EPMC4931220 | biostudies-literature
| S-EPMC4344394 | biostudies-literature
| S-EPMC4144465 | biostudies-literature
| S-EPMC8394143 | biostudies-literature
| S-EPMC9399180 | biostudies-literature
| S-EPMC5829578 | biostudies-literature
| S-EPMC6834861 | biostudies-literature
| S-EPMC10142246 | biostudies-literature
| S-EPMC7236547 | biostudies-literature