Project description:Since the initial publication of the trypanosomatid genomes, curation has been ongoing. Here we apply the technique of ribosome profiling to Trypanosoma brucei, identifying 223 new coding regions by virtue of ribosome occupancy in the corresponding transcripts. A small number of these putative genes correspond to extra copies of previously annotated genes but 85% are novel. The median size of these novels CDSs is small (74 aa) indicating that past annotation work has excelled at detecting large CDSs. Of the unique CDSs discovered here, over half have candidate orthologues in other trypanosomatid genomes, most of which were not yet annotated as genes. Still, approximately one-third of the new CDSs were found only in T. brucei subspecies. When combined with RNA-seq and spliced leader mapping, we were able to definitively revise the start sites for 430 CDSs as compared to the current gene models. Such data also allowed us to use a structured approach to eliminate 701 putative genes as protein-coding. Finally, the data pointed to several regions of the genome that had sequence errors that altered coding region boundaries.
Project description:Phytomonas are a large and diverse sub-group of plant-infecting trypanosomatids that are relatively poorly understood. Little is known of their biology or how they have adapted to life inside plants. This study sequenced the genome of the Cassava (Manihot esculenta) infecting species Phytomonas francai to provide additional genome resources and new insight into the biology of this poorly understood group of organisms.
Project description:Since the initial publication of the trypanosomatid genomes, curation has been ongoing. Here we apply the technique of ribosome profiling to Trypanosoma brucei, identifying 223 new coding regions by virtue of ribosome occupancy in the corresponding transcripts. A small number of these putative genes correspond to extra copies of previously annotated genes but 85% are novel. The median size of these novels CDSs is small (74 aa) indicating that past annotation work has excelled at detecting large CDSs. Of the unique CDSs discovered here, over half have candidate orthologues in other trypanosomatid genomes, most of which were not yet annotated as genes. Still, approximately one-third of the new CDSs were found only in T. brucei subspecies. When combined with RNA-seq and spliced leader mapping, we were able to definitively revise the start sites for 430 CDSs as compared to the current gene models. Such data also allowed us to use a structured approach to eliminate 701 putative genes as protein-coding. Finally, the data pointed to several regions of the genome that had sequence errors that altered coding region boundaries. Ribosome profiling and mRNA libraries were constructed in triplicate from in vitro PCF and in vivo BF lifestages of theT. brucei Treu927 and in vitro T. brucei Lister427, to evaluate role of translational gene regulation
Project description:Abstract: The Kinetoplastida (Euglenozoa) are unicellular flagellates that include the trypanosomatid parasites, most notably Trypanosoma brucei, T.cruzi and Leishmania spp. These organisms cause substantial mortality and morbidity in humans and their livestock worldwide as the causative agents of African sleeping sickness, Chagas disease and leishmaniasis respectively. Draft genome sequences are available for several species of both Trypanosoma and Leishmania. Bodo saltans is a free-living heterotroph found worldwide in freshwater and marine habitats, and it is among the closest bodonid relatives of the trypanosomatids. The purpose of a B. saltans genome sequence is to provide an 'out-group' for comparative genomic analysis of the trypanosomatid parasites. It will provide a model of the ancestral trypanosomatid to distinguish those derived parts of the parasite genomes (i.e., unique trypanosomatid adaptations) from those which are a legacy of the free-living ancestor. To aid annotation of the B.saltans genome sequence, total genomic RNA was extracted on four occasions from the total cellular mass of 160ml of B.saltans cell culture, for the purposes of transcription profiling by high throughput sequencing. Cells were unmodified. B.saltans cells were grown in water at 4oC. Total genomic RNA was extracted from a cell pellet using TRIZOL reagent and ethanol precipitated. Poly A+ mRNA was purified from total RNA using oligo dT dyna bead selection and libraries were created using the Illumina RNA-seq protocol. The samples were sequenced on an Illumina HiSeq 2000. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Project description:Here, we report the genome sequence of the cassava (Manihot esculenta) latex parasite Phytomonas françai P. françai infection is linked with the yield-loss disease "chochamento de raizes" (empty roots) in the Unha variety of cassava, a disease characterized by poor root development and chlorosis of the leaves.
Project description:We performed rigorous ribosome profiling in genome-edited cell lines with human MTOR point mutations and Torin1 treated cell lines. By obtaining about 10 million bases of sequence cells, we generated genome-wide translatome and transcriptome data of genome-edited MTOR(G4448A)-p.C1483Y and Torin1 treated cell lines
Project description:Elizabethkingia meningoseptica EM1 was isolated from a whole-blood sample from a female patient. The draft genome sequence of Em1 contains 4,038,467 bp, with a G+C content of 36.37%. A preliminary genome analysis showed that Em1 contains genes conferring resistance to β-lactams. The bacterium has hemolysin genes and a set of genes involved in heme uptake and heme utilization, showing its potential to cause bloodstream infections. A clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein (CRISPR/Cas) system was identified. Average nucleotide identity (ANI) analysis assigned the bacterium to the species E. meningoseptica (ANI, >95%). The annotated genome sequence provides the genetic basis for revealing its role as a pathogen in humans.
Project description:We report the application of illumina sequencing technology for high-throughput profiling of histone acetyltransferase Mof in mouse embryonic stem cells. By obtaining over four billion bases of sequence from chromatin immunoprecipitated DNA, we generated genome-wide chromatin-state maps of mouse embryonic stem cells. We find that Mof widely binds to mouse genome and mark genes that are expressed, poised for expression, or stably repressed. Mof binds to both gene promoters and coding regions, exhibiting two modes of binding. This study provides a framework for understanding the function of Mof in regulating ESC core transcription network. Examination of Mof binding patterns in ES cells