Project description:Genome sequencing has established clinical utility for rare disease diagnosis. While increasing numbers of individuals have undergone elective genome sequencing, a comprehensive study surveying genome-wide disease-associated genes in adults with deep phenotyping has not been reported. Here we report the results of a 3-y precision medicine study with a goal to integrate whole-genome sequencing with deep phenotyping. A cohort of 1,190 adult participants (402 female [33.8%]; mean age, 54 y [range 20 to 89+]; 70.6% European) had whole-genome sequencing, and were deeply phenotyped using metabolomics, advanced imaging, and clinical laboratory tests in addition to family/medical history. Of 1,190 adults, 206 (17.3%) had at least 1 genetic variant with pathogenic (P) or likely pathogenic (LP) assessment that suggests a predisposition of genetic risk. A multidisciplinary clinical team reviewed all reportable findings for the assessment of genotype and phenotype associations, and 137 (11.5%) had genotype and phenotype associations. A high percentage of genotype and phenotype associations (>75%) was observed for dyslipidemia (n = 24), cardiomyopathy, arrhythmia, and other cardiac diseases (n = 42), and diabetes and endocrine diseases (n = 17). A lack of genotype and phenotype associations, a potential burden for patient care, was observed in 69 (5.8%) individuals with P/LP variants. Genomics and metabolomics associations identified 61 (5.1%) heterozygotes with phenotype manifestations affecting serum metabolite levels in amino acid, lipid and cofactor, and vitamin pathways. Our descriptive analysis provides results on the integration of whole-genome sequencing and deep phenotyping for clinical assessments in adults.
Project description:Understanding the role of DNA methylation often requires accurate assessment and comparison of these modifications in a genome-wide fashion. Sequencing-based DNA methylation profiling provides an unprecedented opportunity to map and compare complete DNA CpG methylomes. These include whole genome bisulfite sequencing (WGBS), Reduced-Representation Bisulfite-Sequencing (RRBS), and enrichment-based methods such as MeDIP-seq, MBD-seq, and MRE-seq. An investigator needs a method that is flexible with the quantity of input DNA, provides the appropriate balance among genomic CpG coverage, resolution, quantitative accuracy, and cost, and comes with robust bioinformatics software for analyzing the data. In this chapter, we describe four protocols that combine state-of-the-art experimental strategies with state-of-the-art computational algorithms to achieve this goal. We first introduce two experimental methods that are complementary to each other. MeDIP-seq, or methylation-dependent immunoprecipitation followed by sequencing, uses an anti-methylcytidine antibody to enrich for methylated DNA fragments, and uses massively parallel sequencing to reveal identity of enriched DNA. MRE-seq, or methylation-sensitive restriction enzyme digestion followed by sequencing, relies on a collection of restriction enzymes that recognize CpG containing sequence motifs, but only cut when the CpG is unmethylated. Digested DNA fragments enrich for unmethylated CpGs at their ends, and these CpGs are revealed by massively parallel sequencing. The two computational methods both implement advanced statistical algorithms that integrate MeDIP-seq and MRE-seq data. M&M is a statistical framework to detect differentially methylated regions between two samples. methylCRF is a machine learning framework that predicts CpG methylation levels at single CpG resolution, thus raising the resolution and coverage of MeDIP-seq and MRE-seq to a comparable level of WGBS, but only incurring a cost of less than 5% of WGBS. Together these methods form an effective, robust, and affordable platform for the investigation of genome-wide DNA methylation.
Project description:As the world is struggling to control the novel Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), there is an urgency to develop effective control measures. Essential information is encoded in the virus genome sequence with accurate and complete SARS-CoV-2 sequences essential for tracking the movement and evolution of the virus and for guiding efforts to develop vaccines and antiviral drugs. While there is unprecedented SARS-CoV-2 sequencing efforts globally, approximately 19 to 43 per cent of the genomes generated monthly are gapped, reducing their information content. The current study documents the genome gap frequencies and their positions in the currently available data and provides an alternative primer set and a sequencing scheme to help improve the quality and coverage of the genomes.
Project description:Extensive studies and analyses into the molecular features of severe acute respiratory syndrome related coronavirus 2 (SARS-CoV-2) have enhanced the surveillance and investigation of its clusters and transmission worldwide. The whole genome sequencing (WGS) approach is crucial in identifying the source of infection and transmission routes by monitoring the emergence of variants over time and through communities. Varying SARS-CoV-2 genomics capacity and capability levels have been established in public health laboratories across different Australian states and territories. Therefore, laboratories performing SARS-CoV-2 WGS for public health purposes are recommended to participate in an external proficiency testing program (PTP). This study describes the development of a SARS-CoV-2 WGS PTP. The PTP assessed the performance of laboratories while providing valuable insight into the current state of SARS-CoV-2 genomics in public health across Australia. Part 1 of the PTP contained eight simulated SARS-CoV-2 positive and negative specimens to assess laboratories' wet and dry laboratory capacity. Part 2 involved the analysis of a genomic dataset that consisted of a multi-FASTA file of 70 consensus genomes of SARS-CoV-2. Participating laboratories were required to (1) submit raw data for independent bioinformatics analysis, (2) analyse the data with their processes, and (3) answer relevant questions about the data. The performance of the laboratories was commendable, despite some variation in the reported results due to the different sequencing and bioinformatics approaches used by laboratories. The overall outcome is positive and demonstrates the critical role of the PTP in supporting the implementation and validation of SARS-CoV-2 WGS processes. The data derived from this PTP will contribute to the development of SARS-CoV-2 bioinformatic quality control (QC) and performance benchmarking for accreditation.
Project description:The evolution of SARS-CoV-2 within immunocompromised hosts who fail to clear the virus over many months has been proposed as a route to the development of Variants of Concern (VoCs). We present a case of an immunocompromised male patient with a prolonged SARS-CoV-2 infection. During hospitalisation, 7 weeks after first diagnosis, his condition worsened to require continuous ventilation support. Resolution of symptoms was observed after convalescent plasma therapy. Whole genome sequencing of the virus showed Pango lineage B.1.221. Between the first sample and the second from bronchoalveolar lavage fluid 7 weeks later, we identified eight mutations, including minor variants, which could be used to estimate the chronology of mutations. This suggests an elevated mutation rate, in-host accumulation of mutations and further evidence for sources of VoCs. Prolonged SARS-CoV-2 infections in immunocompromised hosts increase the likelihood of hospital stays and morbidity, and also pose an increased risk to global public health.
Project description:Hotel quarantine for international travelers has been used to prevent coronavirus disease spread into Australia. A quarantine hotel-associated community outbreak was detected in South Australia. Real-time genomic sequencing enabled rapid confirmation tracking the outbreak to a recently returned traveler and linked 2 cases of infection in travelers at the same facility.
Project description:We developed an extraction-free qPCR assay to identify Omicron BA.1 cases and verified lineage by sequencing. We find that BA.2 variants show almost twice the viral load (Ct) compared to both BA.1 as well as Delta variants.
Project description:Following the detection of the first imported case of COVID-19 in the northern sector of Ghana, we molecularly characterized and phylogenetically analysed sequences, including three complete genome sequences, of severe acute respiratory syndrome coronavirus 2 obtained from nine patients in Ghana. We performed high-throughput sequencing on nine samples that were found to have a high concentration of viral RNA. We also assessed the potential impact that long-distance transport of samples to testing centres may have on sequencing results. Here, two samples that were similar in terms of viral RNA concentration but were transported from sites that are over 400 km apart were analyzed. All sequences were compared to previous sequences from Ghana and representative sequences from regions where our patients had previously travelled. Three complete genome sequences and another nearly complete genome sequence with 95.6% coverage were obtained. Sequences with coverage in excess of 80% were found to belong to three lineages, namely A, B.1 and B.2. Our sequences clustered in two different clades, with the majority falling within a clade composed of sequences from sub-Saharan Africa. Less RNA fragmentation was seen in sample KATH23, which was collected 9 km from the testing site, than in sample TTH6, which was collected and transported over a distance of 400 km to the testing site. The clustering of several sequences from sub-Saharan Africa suggests regional circulation of the viruses in the subregion. Importantly, there may be a need to decentralize testing sites and build more capacity across Africa to boost the sequencing output of the subregion.
Project description:Whole genome sequencing provides rapid insight into key information about the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), such as virus typing and key mutation site, and this information is important for precise prevention, control and tracing of coronavirus disease 2019 (COVID-19) outbreak in conjunction with the epidemiological information of the case. Nanopore sequencing is widely used around the world for its short sample-to-result time, simple experimental operation and long sequencing reads. However, because nanopore sequencing is a relatively new sequencing technology, many researchers still have doubts about its accuracy. The combination of the newly launched nanopore sequencing Q20+ kit (LSK112) and flow cell R10.4 is a qualitative improvement over the accuracy of the previous kits. In this study, we firstly used LSK112 kit with flow cell R10.4 to sequence the SARS-CoV-2 whole genome, and summarized the sequencing results of the combination of LSK112 kit and flow cell R10.4 for the 1200bp amplicons of SARS-CoV-2. We found that the proportion of sequences with an accuracy of more than 99% reached 30.1%, and the average sequence accuracy reached 98.34%, while the results of the original combination of LSK109 kit and flow cell R9.4.1 were 0.61% and 96.52%, respectively. The mutation site analysis showed that it was completely consistent with the final consensus sequence of next generation sequencing (NGS). The results showed that the combination of LSK112 kit and flow cell R10.4 allowed rapid whole-genome sequencing of SARS-CoV-2 without the need for verification of NGS.