Unknown

Dataset Information

0

Sequencing and characterizing the genome of Estrella lausannensis as an undergraduate project: training students and biological insights.


ABSTRACT: With the widespread availability of high-throughput sequencing technologies, sequencing projects have become pervasive in the molecular life sciences. The huge bulk of data generated daily must be analyzed further by biologists with skills in bioinformatics and by "embedded bioinformaticians," i.e., bioinformaticians integrated in wet lab research groups. Thus, students interested in molecular life sciences must be trained in the main steps of genomics: sequencing, assembly, annotation and analysis. To reach that goal, a practical course has been set up for master students at the University of Lausanne: the "Sequence a genome" class. At the beginning of the academic year, a few bacterial species whose genome is unknown are provided to the students, who sequence and assemble the genome(s) and perform manual annotation. Here, we report the progress of the first class from September 2010 to June 2011 and the results obtained by seven master students who specifically assembled and annotated the genome of Estrella lausannensis, an obligate intracellular bacterium related to Chlamydia. The draft genome of Estrella is composed of 29 scaffolds encompassing 2,819,825 bp that encode for 2233 putative proteins. Estrella also possesses a 9136 bp plasmid that encodes for 14 genes, among which we found an integrase and a toxin/antitoxin module. Like all other members of the Chlamydiales order, Estrella possesses a highly conserved type III secretion system, considered as a key virulence factor. The annotation of the Estrella genome also allowed the characterization of the metabolic abilities of this strictly intracellular bacterium. Altogether, the students provided the scientific community with the Estrella genome sequence and a preliminary understanding of the biology of this recently-discovered bacterial genus, while learning to use cutting-edge technologies for sequencing and to perform bioinformatics analyses.

SUBMITTER: Bertelli C 

PROVIDER: S-EPMC4333871 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications


With the widespread availability of high-throughput sequencing technologies, sequencing projects have become pervasive in the molecular life sciences. The huge bulk of data generated daily must be analyzed further by biologists with skills in bioinformatics and by "embedded bioinformaticians," i.e., bioinformaticians integrated in wet lab research groups. Thus, students interested in molecular life sciences must be trained in the main steps of genomics: sequencing, assembly, annotation and analy  ...[more]

Similar Datasets

| S-EPMC4255356 | biostudies-literature
| S-EPMC8117324 | biostudies-literature
| S-EPMC6754024 | biostudies-literature
| S-EPMC4690566 | biostudies-literature
| S-EPMC7706025 | biostudies-literature
| S-EPMC7564944 | biostudies-literature
| S-EPMC7547838 | biostudies-literature
| S-EPMC3439555 | biostudies-literature
| S-EPMC10663973 | biostudies-literature
| S-EPMC6326404 | biostudies-other