Project description:The data contained in this experiment correspond to Illumina-based whole genome shotgun sequencing of EN-TEX donor ENCDO793LXB, a 53 year old female. There are two libraries: PCR-based (9-cycle, ENCLB388OQF) and PCR-free (ENCLB358DLU), both sequenced on multiple lanes to a combined depth of ~ 60X in PE125 format. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODE_Data_Use_Policy_for_External_Users_03-07-14.pdf
Project description:The data contained in this experiment correspond to Illumina-based whole genome shotgun sequencing of EN-TEX donor ENCDO271OUW, a 51 year old female. There are two libraries: PCR-based (9-cycle, ENCLB659IQW) and PCR-free (ENCLB898JVB), both sequenced on multiple lanes to a combined depth of ~ 60X in PE125 format. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODE_Data_Use_Policy_for_External_Users_03-07-14.pdf
Project description:Whole-genome sequencing is an important way to understand the genetic information, gene function, biological characteristics, and living mechanisms of organisms. There is no difficulty to have mega-level genomes sequenced at present. However, we encountered a hard-to-sequence genome of Pseudomonas aeruginosa phage PaP1. The shotgun sequencing method failed to dissect this genome. After insisting for 10 years and going over 3 generations of sequencing techniques, we successfully dissected the PaP1 genome with 91,715 bp in length. Single-molecule sequencing revealed that this genome contains lots of modified bases, including 51 N6-methyladenines (m6A) and 152 N4-methylcytosines (m4C). At the same time, further investigations revealed a novel immune mechanism of bacteria, by which the host bacteria can recognize and repel the modified bases containing inserts in large scale, and this led to the failure of the shotgun method in PaP1 genome sequencing. Strategy of resolving this problem is use of non-library dependent sequencing techniques or use of the nfi- mutant of E. coli DH5M-NM-1 as the host bacteria to construct the shotgun library. In conclusion, we unlock the mystery of phage PaP1 genome hard to be sequenced, and discover a new mechanism of bacterial immunity in present study. Methylation profiling of Pseudomonas aeruginosa phage PaP1 using kinetic data generated by single-molecule, real-time (SMRT) sequencing on the PacBio RS.
Project description:The data contained in this experiment correspond to Illumina-based whole genome shotgun sequencing of EN-TEX donor ENCDO451RUA, a 54 year old male. There are two libraries: PCR-based (9-cycle, ENCLB694UOK) and PCR-free (ENCLB257VPS), both sequenced on multiple lanes to a combined depth of ~ 60X in PE125 format. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODE_Data_Use_Policy_for_External_Users_03-07-14.pdf
Project description:The data contained in this experiment correspond to Illumina-based whole genome shotgun sequencing of EN-TEX donor ENCDO845WKR, a 37 year old male. There are two libraries: PCR-based (9-cycle, ENCLB436FSE) and PCR-free (ENCLB480CRB), both sequenced on multiple lanes to a combined depth of ~ 60X in PE125 format. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODE_Data_Use_Policy_for_External_Users_03-07-14.pdf
Project description:The draft genome of L. sativa (lettuce) cv. Tizian was sequenced in two Illumina sequencing runs, mate pair and shotgun. This entry contains the RAW sequencing data.
Project description:FK506 (tacrolimus) is a valuable immunosuppressant produced by several Streptomyces strains. In the genome of the wild type producer Streptomyces tsukubaensis NRRL18488 FK506 biosynthesis is encoded by a gene cluster that spans 83.5 kilobases. A whole transcriptome differential shotgun sequencing of S. tsukubaensis was performed to analyze transcription at two different time points; before and during active FK506 production. In total 8,914 transcription start sites were identified in either condition, which enabled precise determination of the 5'-UTR length of the corresponding transcripts as well as the identification of two consensus sequence motifs in the promoter regions. The transcription start sites of all gene operons within the FK506 cluster were identified, including three examples of leaderless RNA transcripts. These data provide detailed insight into the transcription of the FK506 biosynthetic gene cluster and supports future regulatory studies and genetic manipulations.
Project description:We leveraged massively parallel sequencing approach to comprehensively characterize the spectrum of somatic mutations and genomic rearrangements in two intestinal-type gastric adenocarcinomas from patients with and without active Helicobacter pylori infections. The tumours exhibited distinct patterns of genomic changes with more than 16,000 somatic substitutions on average, focal amplifications and rearrangements in the non-active infected tumour and a 7-fold enrichment of micro-deletions in the infected tumour. Paired-end sequences from large insert libraries revealed the structure and origins of large amplicons, including one involving the oncogene KRAS. The mutational frequencies of the tumours revealed patterns of H. pylori infection and mutagenesis and a unique exome signature, providing insights into mechanisms that define the mutational landscape of gastric cancer. For the tumour with active infection, we also reconstructed the genome of the pathogenic H. pylori strain from the raw sequence reads, demonstrating the power of whole-genome shotgun sequencing for simultaneously characterizing the tumour and its associated carcinogen genome.