Unknown

Dataset Information

0

TMC-SNPdb: an Indian germline variant database derived from whole exome sequences.


ABSTRACT: Cancer is predominantly a somatic disease. A mutant allele present in a cancer cell genome is considered somatic when it's absent in the paired normal genome along with public SNP databases. The current build of dbSNP, the most comprehensive public SNP database, however inadequately represents several non-European Caucasian populations, posing a limitation in cancer genomic analyses of data from these populations. We present the T: ata M: emorial C: entre-SNP D: ata B: ase (TMC-SNPdb), as the first open source, flexible, upgradable, and freely available SNP database (accessible through dbSNP build 149 and ANNOVAR)-representing 114 309 unique germline variants-generated from whole exome data of 62 normal samples derived from cancer patients of Indian origin. The TMC-SNPdb is presented with a companion subtraction tool that can be executed with command line option or using an easy-to-use graphical user interface with the ability to deplete additional Indian population specific SNPs over and above dbSNP and 1000 Genomes databases. Using an institutional generated whole exome data set of 132 samples of Indian origin, we demonstrate that TMC-SNPdb could deplete 42, 33 and 28% false positive somatic events post dbSNP depletion in Indian origin tongue, gallbladder, and cervical cancer samples, respectively. Beyond cancer somatic analyses, we anticipate utility of the TMC-SNPdb in several Mendelian germline diseases. In addition to dbSNP build 149 and ANNOVAR, the TMC-SNPdb along with the subtraction tool is available for download in the public domain at the following:Database URL: http://www.actrec.gov.in/pi-webpages/AmitDutt/TMCSNP/TMCSNPdp.html.

SUBMITTER: Upadhyay P 

PROVIDER: S-EPMC4940432 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

TMC-SNPdb: an Indian germline variant database derived from whole exome sequences.

Upadhyay Pawan P   Gardi Nilesh N   Desai Sanket S   Sahoo Bikram B   Singh Ankita A   Togar Trupti T   Iyer Prajish P   Prasad Ratnam R   Chandrani Pratik P   Gupta Sudeep S   Dutt Amit A  

Database : the journal of biological databases and curation 20160709


Cancer is predominantly a somatic disease. A mutant allele present in a cancer cell genome is considered somatic when it's absent in the paired normal genome along with public SNP databases. The current build of dbSNP, the most comprehensive public SNP database, however inadequately represents several non-European Caucasian populations, posing a limitation in cancer genomic analyses of data from these populations. We present the T: ata M: emorial C: entre-SNP D: ata B: ase (TMC-SNPdb), as the fi  ...[more]

Similar Datasets

2018-06-06 | E-MTAB-4618 | biostudies-arrayexpress
| PRJEB14300 | ENA
| S-EPMC9216475 | biostudies-literature
| PRJEB13801 | ENA
| S-EPMC9777150 | biostudies-literature
| S-EPMC6441288 | biostudies-literature
| S-EPMC10032232 | biostudies-literature
| S-EPMC8836109 | biostudies-literature
| S-EPMC6081235 | biostudies-literature
2022-02-11 | E-MTAB-8801 | biostudies-arrayexpress