Unknown

Dataset Information

0

Pan-cancer analysis of systematic batch effects on somatic sequence variations.


ABSTRACT: The Cancer Genome Atlas (TCGA) is a comprehensive database that includes multi-layered cancer genome profiles. Large-scale collection of data inevitably generates batch effects introduced by differences in processing at various stages from sample collection to data generation. However, batch effects on the sequence variation and its characteristics have not been studied extensively.We systematically evaluated batch effects on somatic sequence variations in pan-cancer TCGA data, revealing 999 somatic variants that were batch-biased with statistical significance (P?

SUBMITTER: Choi JH 

PROVIDER: S-EPMC5387285 | biostudies-literature | 2017 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pan-cancer analysis of systematic batch effects on somatic sequence variations.

Choi Ji-Hye JH   Hong Seong-Eui SE   Woo Hyun Goo HG  

BMC bioinformatics 20170411 1


<h4>Background</h4>The Cancer Genome Atlas (TCGA) is a comprehensive database that includes multi-layered cancer genome profiles. Large-scale collection of data inevitably generates batch effects introduced by differences in processing at various stages from sample collection to data generation. However, batch effects on the sequence variation and its characteristics have not been studied extensively.<h4>Results</h4>We systematically evaluated batch effects on somatic sequence variations in pan-  ...[more]

Similar Datasets

| S-EPMC5956099 | biostudies-literature
| S-EPMC6686424 | biostudies-literature
| S-EPMC7648123 | biostudies-literature
| S-EPMC5951840 | biostudies-literature
| S-EPMC4671203 | biostudies-literature
| S-EPMC9747925 | biostudies-literature
| S-EPMC10651928 | biostudies-literature
| S-EPMC6964824 | biostudies-literature
| S-EPMC6907172 | biostudies-literature
| S-EPMC3966983 | biostudies-literature