Unknown

Dataset Information

0

BiasAway: command-line and web server to generate nucleotide composition-matched DNA background sequences.


ABSTRACT:

Motivation

Accurate motif enrichment analyses depend on the choice of background DNA sequences used, which should ideally match the sequence composition of the foreground sequences. It is important to avoid false positive enrichment due to sequence biases in the genome, such as GC-bias. Therefore, relying on an appropriate set of background sequences is crucial for enrichment analysis.

Results

We developed BiasAway, a command line tool and its dedicated easy-to-use web server to generate synthetic sequences matching any k-mer nucleotide composition or select genomic DNA sequences matching the mononucleotide composition of the foreground sequences through four different models. For genomic sequences, we provide precomputed partitions of genomes from nine species with five different bin sizes to generate appropriate genomic background sequences.

Availability and implementation

BiasAway source code is freely available from Bitbucket (https://bitbucket.org/CBGR/biasaway) and can be easily installed using bioconda or pip. The web server is available at https://biasaway.uio.no and a detailed documentation is available at https://biasaway.readthedocs.io.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Khan A 

PROVIDER: S-EPMC8275979 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC1538864 | biostudies-literature
| S-EPMC5683244 | biostudies-literature
| S-EPMC4542940 | biostudies-literature
| S-EPMC6978790 | biostudies-literature
| S-EPMC7656184 | biostudies-literature
| S-EPMC3380018 | biostudies-literature
| S-EPMC310868 | biostudies-literature
| S-EPMC6875409 | biostudies-literature
| S-EPMC4987903 | biostudies-literature