Project description:Next-generation sequencing technologies have been and continue to be deployed in clinical laboratories, enabling rapid transformations in genomic medicine. These technologies have reduced the cost of large-scale sequencing by several orders of magnitude, and continuous advances are being made. It is now feasible to analyze an individual's near-complete exome or genome to assist in the diagnosis of a wide array of clinical scenarios. Next-generation sequencing technologies are also facilitating further advances in therapeutic decision making and disease prediction for at-risk patients. However, with rapid advances come additional challenges involving the clinical validation and use of these constantly evolving technologies and platforms in clinical laboratories. To assist clinical laboratories with the validation of next-generation sequencing methods and platforms, the ongoing monitoring of next-generation sequencing testing to ensure quality results, and the interpretation and reporting of variants found using these technologies, the American College of Medical Genetics and Genomics has developed the following professional standards and guidelines.
Project description:We compared whole-exome sequencing (WES) and whole-genome sequencing (WGS) in six unrelated individuals. In the regions targeted by WES capture (81.5% of the consensus coding genome), the mean numbers of single-nucleotide variants (SNVs) and small insertions/deletions (indels) detected per sample were 84,192 and 13,325, respectively, for WES, and 84,968 and 12,702, respectively, for WGS. For both SNVs and indels, the distributions of coverage depth, genotype quality, and minor read ratio were more uniform for WGS than for WES. After filtering, a mean of 74,398 (95.3%) high-quality (HQ) SNVs and 9,033 (70.6%) HQ indels were called by both platforms. A mean of 105 coding HQ SNVs and 32 indels was identified exclusively by WES whereas 692 HQ SNVs and 105 indels were identified exclusively by WGS. We Sanger-sequenced a random selection of these exclusive variants. For SNVs, the proportion of false-positive variants was higher for WES (78%) than for WGS (17%). The estimated mean number of real coding SNVs (656 variants, ∼3% of all coding HQ SNVs) identified by WGS and missed by WES was greater than the number of SNVs identified by WES and missed by WGS (26 variants). For indels, the proportions of false-positive variants were similar for WES (44%) and WGS (46%). Finally, WES was not reliable for the detection of copy-number variations, almost all of which extended beyond the targeted regions. Although currently more expensive, WGS is more powerful than WES for detecting potential disease-causing mutations within WES regions, particularly those due to SNVs.
Project description:Mitochondrial DNA (mtDNA) variant pathogenicity interpretation has special considerations given unique features of the mtDNA genome, including maternal inheritance, variant heteroplasmy, threshold effect, absence of splicing, and contextual effects of haplogroups. Currently, there are insufficient standardized criteria for mtDNA variant assessment, which leads to inconsistencies in clinical variant pathogenicity reporting. An international working group of mtDNA experts was assembled within the Mitochondrial Disease Sequence Data Resource Consortium and obtained Expert Panel status from ClinGen. This group reviewed the 2015 American College of Medical Genetics and Association of Molecular Pathology standards and guidelines that are widely used for clinical interpretation of DNA sequence variants and provided further specifications for additional and specific guidance related to mtDNA variant classification. These Expert Panel consensus specifications allow for consistent consideration of the unique aspects of the mtDNA genome that directly influence variant assessment, including addressing mtDNA genome composition and structure, haplogroups and phylogeny, maternal inheritance, heteroplasmy, and functional analyses unique to mtDNA, as well as specifications for utilization of mtDNA genomic databases and computational algorithms.
Project description:Tooth agenesis is a common craniofacial abnormality in humans and represents failure to develop 1 or more permanent teeth. Tooth agenesis is complex, and variations in about a dozen genes have been reported as contributing to the etiology. Here, we combined whole-exome sequencing, array-based genotyping, and linkage analysis to identify putative pathogenic variants in candidate disease genes for tooth agenesis in 10 multiplex Turkish families. Novel homozygous and heterozygous variants in LRP6, DKK1, LAMA3, and COL17A1 genes, as well as known variants in WNT10A, were identified as likely pathogenic in isolated tooth agenesis. Novel variants in KREMEN1 were identified as likely pathogenic in 2 families with suspected syndromic tooth agenesis. Variants in more than 1 gene were identified segregating with tooth agenesis in 2 families, suggesting oligogenic inheritance. Structural modeling of missense variants suggests deleterious effects to the encoded proteins. Functional analysis of an indel variant (c.3607+3_6del) in LRP6 suggested that the predicted resulting mRNA is subject to nonsense-mediated decay. Our results support a major role for WNT pathways genes in the etiology of tooth agenesis while revealing new candidate genes. Moreover, oligogenic cosegregation was suggestive for complex inheritance and potentially complex gene product interactions during development, contributing to improved understanding of the genetic etiology of familial tooth agenesis.
Project description:Erythrocytosis is characterized by an increase in red cells in peripheral blood. Polycythemia vera, the commonest primary erythrocytosis, results from pathogenic variants in JAK2 in ∼98% of cases. Although some variants have been reported in JAK2-negative polycythemia, the causal genetic variants remain unidentified in ∼80% of cases. To discover genetic variants in unexplained erythrocytosis, we performed whole exome sequencing in 27 patients with JAK2-negative polycythemia after excluding the presence of any mutations in genes previously associated with erythrocytosis (EPOR, VHL, PHD2, EPAS1, HBA, and HBB). We found that the majority of patients (25/27) had variants in genes involved in epigenetic processes, including TET2 and ASXL1 or in genes related to hematopoietic signaling such as MPL and GFIB. Based on computational analysis, we believe that variants identified in 11 patients in this study could be pathogenic although functional studies will be required for confirmation. To our knowledge, this is the largest study reporting novel variants in individuals with unexplained erythrocytosis. Our results suggest that genes involved in epigenetic processes and hematopoietic signaling pathways are likely associated with unexplained erythrocytosis in individuals lacking JAK2 mutations. With very few previous studies targeting JAK2-negative polycythemia patients to identify underlying variants, this study opens a new avenue in evaluating and managing JAK2-negative polycythemia.
Project description:In 2015, the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) published updated standards and guidelines for the clinical interpretation of sequence variants with respect to human diseases on the basis of 28 criteria. However, variability between individual interpreters can be extensive because of reasons such as the different understandings of these guidelines and the lack of standard algorithms for implementing them, yet computational tools for semi-automated variant interpretation are not available. To address these problems, we propose a suite of methods for implementing these criteria and have developed a tool called InterVar to help human reviewers interpret the clinical significance of variants. InterVar can take a pre-annotated or VCF file as input and generate automated interpretation on 18 criteria. Furthermore, we have developed a companion web server, wInterVar, to enable user-friendly variant interpretation with an automated interpretation step and a manual adjustment step. These tools are especially useful for addressing severe congenital or very early-onset developmental disorders with high penetrance. Using results from a few published sequencing studies, we demonstrate the utility of InterVar in significantly reducing the time to interpret the clinical significance of sequence variants.
Project description:Germline pathogenic variants in TP53 are associated with Li-Fraumeni syndrome, a cancer predisposition disorder inherited in an autosomal dominant pattern associated with a high risk of malignancy, including early-onset breast cancers, sarcomas, adrenocortical carcinomas, and brain tumors. Intense cancer surveillance for individuals with TP53 germline pathogenic variants is associated with reduced cancer-related mortality. Accurate and consistent classification of germline variants across clinical and research laboratories is important to ensure appropriate cancer surveillance recommendations. Here, we describe the work performed by the Clinical Genome Resource TP53 Variant Curation Expert Panel (ClinGen TP53 VCEP) focused on specifying the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) guidelines for germline variant classification to the TP53 gene. Specifications were developed for 20 ACMG/AMP criteria, while nine were deemed not applicable. The original strength level for the 10 criteria was also adjusted due to current evidence. Use of TP53-specific guidelines and sharing of clinical data among experts and clinical laboratories led to a decrease in variants of uncertain significance from 28% to 12% compared with the original guidelines. The ClinGen TP53 VCEP recommends the use of these TP53-specific ACMG/AMP guidelines as the standard strategy for TP53 germline variant classification.
Project description:We developed a novel software tool, EXCAVATOR, for the detection of copy number variants (CNVs) from whole-exome sequencing data. EXCAVATOR combines a three-step normalization procedure with a novel heterogeneous hidden Markov model algorithm and a calling method that classifies genomic regions into five copy number states. We validate EXCAVATOR on three datasets and compare the results with three other methods. These analyses show that EXCAVATOR outperforms the other methods and is therefore a valuable tool for the investigation of CNVs in largescale projects, as well as in clinical research and diagnostics. EXCAVATOR is freely available at http://sourceforge.net/projects/excavatortool/.
Project description:Low levels of high density lipoprotein-cholesterol (HDL-C) are associated with an elevated risk of arteriosclerotic coronary heart disease. Heritability of HDL-C levels is high. In this research discovery study, we used whole-exome sequencing to identify damaging gene variants that may play significant roles in determining HDL-C levels. We studied 204 individuals with a mean HDL-C level of 27.8 ± 6.4 mg/dl (range: 4-36 mg/dl). Data were analyzed by statistical gene burden testing and by filtering against candidate gene lists. We found 120 occurrences of probably damaging variants (116 heterozygous; four homozygous) among 45 of 104 recognized HDL candidate genes. Those with the highest prevalence of damaging variants were ABCA1 (n = 20), STAB1 (n = 9), OSBPL1A (n = 8), CPS1 (n = 8), CD36 (n = 7), LRP1 (n = 6), ABCA8 (n = 6), GOT2 (n = 5), AMPD3 (n = 5), WWOX (n = 4), and IRS1 (n = 4). Binomial analysis for damaging missense or loss-of-function variants identified the ABCA1 and LDLR genes at genome-wide significance. In conclusion, whole-exome sequencing of individuals with low HDL-C showed the burden of damaging rare variants in the ABCA1 and LDLR genes is particularly high and revealed numerous occurrences in HDL candidate genes, including many genes identified in genome-wide association study reports. Many of these genes are involved in cancer biology, which accords with epidemiologic findings of the association of HDL deficiency with increased risk of cancer, thus presenting a new area of interest in HDL genomics.
Project description:Genomic technologies, such as whole-exome sequencing, are a powerful tool in genetic research. Such testing yields a great deal of incidental medical information, or medical information not related to the primary research target. We describe the management of incidental medical information derived from whole-exome sequencing in the research context. We performed whole-exome sequencing on a monozygotic twin pair in which only 1 child was affected with congenital anomalies and applied an institutional review board-approved algorithm to determine what genetic information would be returned. Whole-exome sequencing identified 79525 genetic variants in the twins. Here, we focus on novel variants. After filtering artifacts and excluding known single nucleotide polymorphisms and variants not predicted to be pathogenic, the twins had 32 novel variants in 32 genes that were felt to be likely to be associated with human disease. Eighteen of these novel variants were associated with recessive disease and 18 were associated with dominantly manifesting conditions (variants in some genes were potentially associated with both recessive and dominant conditions), but only 1 variant ultimately met our institutional review board-approved criteria for return of information to the research participants.