Genomics

Dataset Information

0

GCparagon: Evaluation and correction of GC biases in cell-free DNA at the fragment level


ABSTRACT: We present GCparagon, a two-stage algorithm for computing and correcting GC biases in cell-free DNA (cfDNA) samples. The length of the highly fragmented cfDNAs and the number of GC bases are essential parameters in the calculations. Regions of low mappability, known reference genome assembly errors and regions surrounding assembly gaps are excluded from the bias computation. GCparagon outputs a bias matrix and an optional tagged BAM file with GC bias balance weights as alignment tags. Parallelization allows calculation of a GC bias estimate in less than 2 minutes per sample with between 99.0% and 99.9% of fragments already corrected. We propose that GCparagon can help standardize cfDNA applications and evaluate the impact of GC bias on algorithms used in the analysis of liquid biopsy data.

PROVIDER: EGAS00001006963 | EGA |

REPOSITORIES: EGA

Similar Datasets

| PRJNA419240 | ENA
| EGAD00001010100 | EGA
2014-06-06 | E-GEOD-56644 | biostudies-arrayexpress
2019-03-31 | E-MTAB-7163 | biostudies-arrayexpress
| PRJNA503577 | ENA
| PRJNA380045 | ENA
2019-08-19 | PXD007891 | Pride
2019-05-07 | GSE124974 | GEO
2022-07-21 | GSE208596 | GEO
2022-05-13 | GSE202606 | GEO