Browse
Submit Data
Databases
API
Help

Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

76 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

TMC-SNPdb: An Indian germline variant dataset derived from whole exome sequence

ABSTRACT: Cancer is predominantly a somatic disease. A mutant allele found in cancer cell genome is considered somatic when it is absent in paired normal genome and dbSNP, the most comprehensive public SNP database. However, dbSNP inadequately represents several non-Caucasian populations including that from the Indian subcontinent, posing a limitation in cancer genomic analyses of data from these populations. We present TMC-SNPdb, as the first open source freely accessible (through ANNOVAR), flexible and upgradable SNP database from whole exome data of 62 normal samples derived from cancer patients of Indian origin, representing 114,309 unique germline variants. TMC-SNPdb is presented with a companion subtraction tool that can be executed with command line option or an easy-to-use graphical user interface (GUI) with the ability to deplete additional Indian population specific SNPs over and above that possible with dbSNP and 1000 Genomes databases. Using an institutional generated whole exome data set of 132 samples of Indian origin, we demonstrate that TMC-SNPdb reduced 42%, 33% and 28% false positive somatic events post dbSNP depletion in Indian origin tongue, gallbladder, and cervical cancer samples, respectively. Beyond cancer somatic analyses, we anticipate utility of TMC-SNPdb in several Mendelian germline diseases.

INSTRUMENT(S): Illumina HiSeq 1500, Illumina Genome Analyzer IIx, NextSeq 500, Illumina HiSeq 2000

ORGANISM(S): Homo sapiens

SUBMITTER: Amit Dutt

PROVIDER: E-MTAB-4618 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

ACCESS DATA

Json Xml

Similar Datasets

Integrated Genomics Approach to Identify Biologically Relevant Alterations in Fewer Samples.

Project description:This study involves characterization of four head and neck cancer cell lines -- NT8e, OT9, AW13516 and AW8507, established from Indian head and neck cancer patients, using SNP arrays, whole exome and whole transcriptome sequencing.

2015-10-29 | E-MTAB-3958 | biostudies-arrayexpress

Integrated Genomics Approach to Identify Biologically Relevant Alterations in Fewer Samples.

2015-11-27 | E-MTAB-3961 | biostudies-arrayexpress

Whole exome sequencing of Lung Squamous Carcinoma Patients of Indian Origin

Project description:The study involves whole exome sequencing of 20 primary tumors obtained from lung squamous carcinoma patients of Indian origin. With this, we aim to describe the mutational profile of this specific subset of lung cancer patients. This knowledge will further allow us to gain an insight into potentially actionable genomic alterations prevalent in Indian lung squamous carcinoma.

2022-02-11 | E-MTAB-8801 | biostudies-arrayexpress

Integrated Genomics Approach to Identify Biologically Relevant Alterations in Fewer Samples.

2015-11-01 | E-MTAB-3960 | biostudies-arrayexpress

Whole exome sequencing analyzed the off-target effect of gene editing through BE3 system.

Project description:We used the whole exome sequencing to analyze the off-target effect of base editing system in mouse genomic DNA, which was extracted from haploid stem cells, mouse tails of semi-cloned embryos and mutant pups. The purpose of this sequencing is to find whether there exists off-target effect in the genome. By obtaining over 100 million reads of each sample from WES, we mapped the reads to the reference data base (mm10) and calculated the numbers of SNVs and indels. After filtering out naturally-occurring variants in the SNP database (dbSNP) and excluding SNPs also found in the wild-type genome, we next compared the DNA sequences at the remaining SNP sites with the on-target sequence. The results indicated that rare off-targets events happened in tested cell and embryos.

2018-08-06 | GSE115017 | GEO

Affymetrix SNP array data for myelodysplastic syndromes (MDS) and related neoplasms

Project description:In this study, to obtain a complete registry of genetic lesions in MDS and to identify novel therapeutic targets, we performed SNP array analysis and whole exome analysis for novel mutations using high-throughput sequencing technologies. In whole exome analysis, paired CD3-positive T cells were used as a normal control. By comparing sequences in tumors and paired T cells, 268 non-synonymous somatic mutations were confirmed with an overall true positive rate of 53.9 %, including 206 missense, 25 nonsense, and 10 splice site mutations, and 27 frameshift-causing insertions/deletions (indels). The mutations of the known gene targets, however, accounted for only 12.3 % of all detected mutations (N = 33), and the remaining 235 mutations involved previously unreported genes. Combined with the genomic copy number profile obtained by SNP array karyotyping, this array of somatic mutations provided a landscape of myelodysplasia genomes. Copy number analysis of Affymetrix 250K SNP arrays was performed for 29 MDS or related neoplasms and paired 29 germline samples.

2011-09-11 | E-GEOD-31174 | biostudies-arrayexpress

Exomes of human leukemic JMML (Juvenile MyeloMonocytic Leukemia) xenografts in NSG or NSG-S mice (samples enriched in hCD45)

Project description:The study includes 14 patients with confirmed JMML and known somatic mutations (from exome data of paired tumoral and germline DNA). Bone marrow or peripheral blood mononucleated cells were injected in immundeficient mice to recapitulate the leukemia. Whole exome sequencing was performed in xenograft samples to control the persistance of patients' known mutations and look for new mutations acquired in xenograft sample.

2018-03-15 | E-MTAB-6467 | biostudies-arrayexpress

Germline and somatic genetic variants in the p53 pathway interact to affect cancer risk, progression, and drug response

Project description:Insights into oncogenesis derived from cancer susceptibility loci (single nucleotide polymorphisms, SNP) hold the potential to facilitate better cancer management and treatment through precision oncology. However, therapeutic insights have thus far been limited by our current lack of understanding regarding both interactions of these loci with somatic cancer driver mutations and their influence on tumorigenesis. For example, while both germline and somatic genetic variation to the p53 tumor suppressor pathway are known to promote tumorigenesis, little is known about the extent to which such variants cooperate to alter pathway activity. Here we hypothesize that cancer risk-associated germline variants interact with somatic TP53 mutational status to modify cancer risk, progression, and response to therapy. Focusing on a cancer risk SNP (rs78378222) with a well-documented ability to directly influence p53 activity as well as integration of germline datasets relating to cancer susceptibility with tumor data capturing somatically-acquired genetic variation provided supportive evidence for this hypothesis. Integration of germline and somatic genetic data enabled identification of a novel entry point for therapeutic manipulation of p53 activities. A cluster of cancer risk SNPs resulted in increased expression of pro-survival p53 target gene KITLG and attenuation of p53-mediated responses to genotoxic therapies, which were reversed by pharmacological inhibition of the pro-survival c-KIT signal. Together, our results offer evidence of how cancer susceptibility SNPs can interact with cancer driver genes to affect cancer progression and identify novel combinatorial therapies.

2021-02-10 | GSE143561 | GEO

APOBEC mutagenesis and copy number alterations are drivers of proteogenomic tumor evolution and heterogeneity in metastatic thoracic tumors

Project description:Intratumor mutational heterogeneity has been documented in primary non-small cell lung cancer. Here, we elucidate mechanisms of tumor evolution and heterogeneity in metastatic thoracic tumors (lung adenocarcinoma and thymic carcinoma) using whole-exome and transcriptome sequencing, SNP array for copy number alterations (CNA) and mass spectrometry-based quantitative proteomics of metastases obtained by rapid autopsy. APOBEC-mutagenesis, promoted by increased expression of APOBEC3 region transcripts and associated with a high-risk germline APOBEC3 variant, strongly correlated with mutational tumor heterogeneity. TP53 mutation status was associated with APOBEC hypermutator status. Interferon pathways were enriched in tumors with high APOBEC mutagenesis and IFN- induced expression of APOBEC3B in lung adenocarcinoma cells in culture suggesting a role for the immune microenvironment in the generation of mutational heterogeneity. CNA occurring late in tumor evolution correlated with downstream transcriptomic and proteomic heterogeneity, although global proteomic heterogeneity was significantly greater than transcriptomic and CNA heterogeneity. These results illustrate key mechanisms underlying multi-dimensional heterogeneity in metastatic thoracic tumors.

2019-05-07 | PXD012845 | Pride

Homo sapiens

Project description:Whole Exome Sequencing samples of Cancer Cervix from Indian origin

| PRJNA789476 | ENA