Unknown

Dataset Information

0

BatchQC: interactive software for evaluating sample and batch effects in genomic data.


ABSTRACT: Sequencing and microarray samples often are collected or processed in multiple batches or at different times. This often produces technical biases that can lead to incorrect results in the downstream analysis. There are several existing batch adjustment tools for '-omics' data, but they do not indicate a priori whether adjustment needs to be conducted or how correction should be applied. We present a software pipeline, BatchQC, which addresses these issues using interactive visualizations and statistics that evaluate the impact of batch effects in a genomic dataset. BatchQC can also apply existing adjustment tools and allow users to evaluate their benefits interactively. We used the BatchQC pipeline on both simulated and real data to demonstrate the effectiveness of this software toolkit.BatchQC is available through Bioconductor: http://bioconductor.org/packages/BatchQC and GitHub: https://github.com/mani2012/BatchQC CONTACT: wej@bu.eduSupplementary information: Supplementary data are available at Bioinformatics online.

SUBMITTER: Manimaran S 

PROVIDER: S-EPMC5167063 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

BatchQC: interactive software for evaluating sample and batch effects in genomic data.

Manimaran Solaiappan S   Selby Heather Marie HM   Okrah Kwame K   Ruberman Claire C   Leek Jeffrey T JT   Quackenbush John J   Haibe-Kains Benjamin B   Bravo Hector Corrada HC   Johnson W Evan WE  

Bioinformatics (Oxford, England) 20160818 24


Sequencing and microarray samples often are collected or processed in multiple batches or at different times. This often produces technical biases that can lead to incorrect results in the downstream analysis. There are several existing batch adjustment tools for '-omics' data, but they do not indicate a priori whether adjustment needs to be conducted or how correction should be applied. We present a software pipeline, BatchQC, which addresses these issues using interactive visualizations and st  ...[more]

Similar Datasets

| S-EPMC6185451 | biostudies-literature
| S-EPMC7416706 | biostudies-literature
| S-EPMC6107636 | biostudies-literature
| S-EPMC6803429 | biostudies-literature
| S-EPMC6061843 | biostudies-literature
| S-EPMC8485848 | biostudies-literature
| S-EPMC10025448 | biostudies-literature
| S-EPMC7660438 | biostudies-literature
| S-EPMC3810845 | biostudies-other
| S-EPMC7821039 | biostudies-literature