Dataset Information

FASTQuick: rapid and comprehensive quality assessment of raw sequence reads.

ABSTRACT:

Background

Rapid and thorough quality assessment of sequenced genomes on an ultra-high-throughput scale is crucial for successful large-scale genomic studies. Comprehensive quality assessment typically requires full genome alignment, which costs a substantial amount of computational resources and turnaround time. Existing tools are either computationally expensive owing to full alignment or lacking essential quality metrics by skipping read alignment.

Findings

We developed a set of rapid and accurate methods to produce comprehensive quality metrics directly from a subset of raw sequence reads (from whole-genome or whole-exome sequencing) without full alignment. Our methods offer orders of magnitude faster turnaround time than existing full alignment-based methods while providing comprehensive and sophisticated quality metrics, including estimates of genetic ancestry and cross-sample contamination.

Conclusions

By rapidly and comprehensively performing the quality assessment, our tool will help investigators detect potential issues in ultra-high-throughput sequence reads in real time within a low computational cost at the early stages of the analyses, ensuring high-quality downstream results and preventing unexpected loss in time, money, and invaluable specimens.

SUBMITTER: Zhang F

PROVIDER: S-EPMC7844880 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

FASTQuick: rapid and comprehensive quality assessment of raw sequence reads.

Zhang Fan F Kang Hyun Min HM

GigaScience 20210101 2

<h4>Background</h4>Rapid and thorough quality assessment of sequenced genomes on an ultra-high-throughput scale is crucial for successful large-scale genomic studies. Comprehensive quality assessment typically requires full genome alignment, which costs a substantial amount of computational resources and turnaround time. Existing tools are either computationally expensive owing to full alignment or lacking essential quality metrics by skipping read alignment.<h4>Findings</h4>We developed a set o ...[more]

PMID: 33511994

Dataset Information

FASTQuick: rapid and comprehensive quality assessment of raw sequence reads.

Background

Findings

Conclusions

Publications

FASTQuick: rapid and comprehensive quality assessment of raw sequence reads.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Ophiocordyceps sinensis raw sequence reads
2019-05-14 | GSE123085 | GEO

Oryza sativa Raw sequence reads
2015-04-22 | E-MTAB-4312 | biostudies-arrayexpress

Oryza sativa Raw sequence reads
2015-07-31 | E-MTAB-4347 | biostudies-arrayexpress

Populus tricho Raw sequence reads
2015-06-01 | E-MTAB-4364 | biostudies-arrayexpress

Arabidopsis thaliana Raw sequence reads
2015-06-20 | E-MTAB-4396 | biostudies-arrayexpress

Cyprinus carpio 'koi' Raw sequence reads
2019-01-15 | GSE125039 | GEO

Homo sapiens Raw sequence reads-ferroptosis resistance
2022-06-01 | GSE173905 | GEO

Vitis vinifera RNA-Seq raw sequence reads
2015-07-08 | E-MTAB-4390 | biostudies-arrayexpress