Project description:Comparative genomic analysis of important signaling pathways in Caenorhabditis briggsae and Caenorhabditis elegans reveals both conserved features and also differences. To build a framework to address the significance of these features we determined the C. briggsae embryonic cell lineage, using the tools StarryNite and AceTree. We traced both cell divisions and cell positions for all cells through all but the last round of cell division and for selected cells through the final round. We found the lineage to be remarkably similar to that of C. elegans. Not only did the founder cells give rise to similar numbers of progeny, the relative cell division timing and positions were largely maintained. These lineage similarities appear to give rise to similar cell fates as judged both by the positions of lineally equivalent cells and by the patterns of cell deaths in both species. However, some reproducible differences were seen, e.g., the P4 cell cycle length is more than 40% longer in C. briggsae than that in C. elegans (p<0.01). The extensive conservation of embryonic development between such divergent species suggests that substantial evolutionary distance between these two species has not altered these early developmental cellular events, although the developmental defects of transpecies hybrids suggest that the details of the underlying molecular pathways have diverged sufficiently so as to not be interchangeable.
Project description:Accurate identification of synteny blocks is an important step in comparative genomics towards the understanding of genome architecture and expression. Most computer programs developed in the last decade for identifying synteny blocks have limitations. To address these limitations, we recently developed a robust program called OrthoCluster, and an online database OrthoClusterDB. In this work, we have demonstrated the application of OrthoCluster in identifying synteny blocks between the genomes of Caenorhabditis elegans and Caenorhabditis briggsae, two closely related hermaphrodite nematodes.Initial identification and analysis of synteny blocks using OrthoCluster enabled us to systematically improve the genome annotation of C. elegans and C. briggsae, identifying 52 potential novel genes in C. elegans, 582 in C. briggsae, and 949 novel orthologous relationships between these two species. Using the improved annotation, we have detected 3,058 perfect synteny blocks that contain no mismatches between C. elegans and C. briggsae. Among these synteny blocks, the majority are mapped to homologous chromosomes, as previously reported. The largest perfect synteny block contains 42 genes, which spans 201.2 kb in Chromosome V of C. elegans. On average, perfect synteny blocks span 18.8 kb in length. When some mismatches (interruptions) are allowed, synteny blocks ("imperfect synteny blocks") that are much larger in size are identified. We have shown that the majority (80%) of the C. elegans and C. briggsae genomes are covered by imperfect synteny blocks. The largest imperfect synteny block spans 6.14 Mb in Chromosome X of C. elegans and there are 11 synteny blocks that are larger than 1 Mb in size. On average, imperfect synteny blocks span 63.6 kb in length, larger than previously reported.We have demonstrated that OrthoCluster can be used to accurately identify synteny blocks and have found that synteny blocks between C. elegans and C. briggsae are almost three-folds larger than previously identified.
Project description:We investigated the possibility of positive selection acting on members of the putative seven-pass chemoreceptor superfamily in Caenorhabditis elegans, which comprises approximately 1,300 genes encoding seven-pass G protein-coupled receptors (GPCRs). Using a maximum-likelihood approach, we conducted statistical tests for evidence of codon sites where the ratio of nonsynonymous substitutions per nonsynonymous site to synonymous substitutions per synonymous site (d(N)/d(S)) was >1. Evidence for positive selection was found only for the srz family, about which virtually nothing specific is known. We extended the annotation of the srz gene family, establishing gene models for 60 srz genes in C. elegans and 28 srz genes in Caenorhabditis briggsae. d(N)/d(S) ratios varied dramatically in different regions of the SRZ proteins, peaking in predicted extracellular regions. These regions included 23 sites where evidence of positive selection was highly significant, corresponding remarkably well with regions implicated in ligand binding in other GPCR family members. We interpret these results as indicating that the srz family is under positive selection, probably driven by ligand binding.
Project description:We have used the genetic resources of Arabidopsis thaliana to generate mutant lines that have reactivated TE expression. We used these lines with long-read Oxford Nanopore sequencing technology to capture Transposable Element (TE) mRNAs for TE transcript annotation.
Project description:We explored changes at gene-level or transcript-level in embryonic stem cells, before and after in vitro differentiation with retinoic acid. RNA was sequenced both via Illumina short reads, and with Oxford Nanopore Technology with cDNA and direct RNA sequencing.
Project description:We explored changes at gene-level or transcript-level in embryonic stem cells, before and after in vitro differentiation with retinoic acid. RNA was sequenced both via Illumina short reads, and with Oxford Nanopore Technology with cDNA and direct RNA sequencing.
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:Understanding the evolutionary processes responsible for shaping genetic variation within and between species requires separating the effects of mutation and selection. Differences between the patterns of genetic variation observed in nature and when mutations are allowed to accumulate in the relative absence of selection can reveal biases imposed by selection. We characterize the genetic variation at dinucleotide microsatellite repeats in four sets of 250-generation mutation accumulation (MA) lines, two in the species Caenorhabditis briggsae and two in Caenorhabditis elegans, and compare the mutational variation with the standing variation in those species. We also compare the mutational properties of microsatellites with the cumulative effects of mutations on fitness in the same lines. Integrated over the whole genome, we infer that the mutation rate of C. briggsae is about twice that of C. elegans, consistent with the cumulative mutational effects on fitness. The mutational spectrum (ratio of insertions to deletions) differs between repeat types and, in some cases, between species. The per-locus mutation rate is significantly positively correlated with the standing genetic variation at the same locus in both species, providing justification for the common practice of using the standing genetic variance as a surrogate for the mutation rate.
Project description:Patterns of similarity between genomes of related species reflect the distribution of selective constraint within DNA. We analyzed alignments of 142 orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae and found a mosaic pattern with regions of high similarity (phylogenetic footprints) interspersed with non-alignable sequences. Footprints cover approximately 20% of intergenic regions, often occur in clumps and are rare within 5' UTRs but common within 3' UTRs. The footprints have a higher ratio of transitions to transversions than expected at random and a higher GC content than the rest of the intergenic region. The number of footprints and the GC content of footprints within an intergenic region are higher when genes are oriented so that their 5' ends form the boundaries of the intergenic region. Overall, the patterns and characteristics identified here, along with other comparative and experimental studies, suggest that many footprints have a regulatory function, although other types of function are also possible. These conclusions may be quite general across eukaryotes, and the characteristics of conserved regulatory elements determined from genomic comparisons can be useful in prediction of regulation sites within individual DNA sequences.