Unknown

Dataset Information

0

A Novel Method to Detect Bias in Short Read NGS Data.


ABSTRACT: Detecting sources of bias in transcriptomic data is essential to determine signals of Biological significance. We outline a novel method to detect sequence specific bias in short read Next Generation Sequencing data. This is based on determining intra-exon correlations between specific motifs. This requires a mild assumption that short reads sampled from specific regions from the same exon will be correlated with each other. This has been implemented on Apache Spark and used to analyse two D. melanogaster eye-antennal disc data sets generated at the same laboratory. The wild type data set in drosophila indicates a variation due to motif GC content that is more significant than that found due to exon GC content. The software is available online and could be applied for cross-experiment transcriptome data analysis in eukaryotes.

SUBMITTER: Alnasir J 

PROVIDER: S-EPMC6042817 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Novel Method to Detect Bias in Short Read NGS Data.

Alnasir Jamie J   Shanahan Hugh P HP  

Journal of integrative bioinformatics 20170923 3


Detecting sources of bias in transcriptomic data is essential to determine signals of Biological significance. We outline a novel method to detect sequence specific bias in short read Next Generation Sequencing data. This is based on determining intra-exon correlations between specific motifs. This requires a mild assumption that short reads sampled from specific regions from the same exon will be correlated with each other. This has been implemented on Apache Spark and used to analyse two D. me  ...[more]

Similar Datasets

| S-EPMC6387560 | biostudies-literature
| S-EPMC8694450 | biostudies-literature
| S-EPMC3871669 | biostudies-literature
| S-EPMC2943903 | biostudies-literature
| S-EPMC4246604 | biostudies-literature
| S-EPMC3106329 | biostudies-literature
| S-EPMC5657049 | biostudies-literature
| S-EPMC4824081 | biostudies-other
| S-EPMC3035802 | biostudies-other