Project description:BackgroundTransposable elements (TEs) are a major component of metazoan genomes and are associated with a variety of mechanisms that shape genome architecture and evolution. Despite the ever-growing number of insect genomes sequenced to date, our understanding of the diversity and evolution of insect TEs remains poor.ResultsHere, we present a standardized characterization and an order-level comparison of arthropod TE repertoires, encompassing 62 insect and 11 outgroup species. The insect TE repertoire contains TEs of almost every class previously described, and in some cases even TEs previously reported only from vertebrates and plants. Additionally, we identified a large fraction of unclassifiable TEs. We found high variation in TE content, ranging from less than 6% in the antarctic midge (Diptera), the honey bee and the turnip sawfly (Hymenoptera) to more than 58% in the malaria mosquito (Diptera) and the migratory locust (Orthoptera), and a possible relationship between the content and diversity of TEs and the genome size.ConclusionWhile most insect orders exhibit a characteristic TE composition, we also observed intraordinal differences, e.g., in Diptera, Hymenoptera, and Hemiptera. Our findings shed light on common patterns and reveal lineage-specific differences in content and evolution of TEs in insects. We anticipate our study to provide the basis for future comparative research on the insect TE repertoire.
Project description:BACKGROUND:Transposable elements (TEs) have the potential to impact genome structure, function and evolution in profound ways. In order to understand the contribution of transposable elements (TEs) to Heliconius melpomene, we queried the H. melpomene draft sequence to identify repetitive sequences. RESULTS:We determined that TEs comprise ~25% of the genome. The predominant class of TEs (~12% of the genome) was the non-long terminal repeat (non-LTR) retrotransposons, including a novel SINE family. However, this was only slightly higher than content derived from DNA transposons, which are diverse, with several families having mobilized in the recent past. Compared to the only other well-studied lepidopteran genome, Bombyx mori, H. melpomene exhibits a higher DNA transposon content and a distinct repertoire of retrotransposons. We also found that H. melpomene exhibits a high rate of TE turnover with few older elements accumulating in the genome. CONCLUSIONS:Our analysis represents the first complete, de novo characterization of TE content in a butterfly genome and suggests that, while TEs are able to invade and multiply, TEs have an overall deleterious effect and/or that maintaining a small genome is advantageous. Our results also hint that analysis of additional lepidopteran genomes will reveal substantial TE diversity within the group.
Project description:Transposable elements (TEs) are ubiquitous in arthropods. However, analyses of large-scale and long-term coevolution between TEs and host genomes remain scarce in arthropods. Here, we choose 14 representative Arthropoda species from eight orders spanning more than 500 million years of evolution. By developing an unbiased TE annotation pipeline, we obtained 87 to 2266 TE reference sequences in a species, which is a considerable improvement compared to the reference TEs previously annotated in Repbase. We find that TE loads are diversified among species and were previously underestimated. The highly species- and time-specific expansions and contractions, and intraspecific sequence diversification are the leading driver of long terminal repeat (LTR) dynamics in Lepidoptera. Terminal inverted repeats (TIRs) proliferated substantially in five species with large genomes. A phylogenetic comparison reveals that the loads of multiple TE subfamilies are positively correlated with genome sizes. We also identified a few horizontally transferred TE candidates across nine species. In addition, we set up the Arthropod Transposable Elements database (ArTEdb) to provide TE references and annotations. Collectively, our results provide high-quality TE references and uncover that TE loads and expansion histories vary greatly among arthropods, which implies that TEs are an important driving force shaping the evolution of genomes through gain and loss.
Project description:Maize was originally domesticated in a tropical environment but is now widely cultivated at temperate latitudes. Temperate and tropical maize populations have diverged both genotypically and phenotypically. Tropical maize lines grown in temperate environments usually exhibit delayed flowering, pollination, and seed set, which reduces their grain yield relative to temperate adapted maize lines. One potential mechanism by which temperate maize may have adapted to a new environment is novel transposable element insertions, which can influence gene regulation. Recent advances in sequencing technology have made it possible to study variation in transposon content and insertion location in large sets of maize lines.In total, 274,408 non-redundant TEs (NRTEs) were identified using resequencing data generated from 83 maize inbred lines. The locations of DNA TEs and copia-superfamily retrotransposons showed significant positive correlations with gene density and genetic recombination rates, whereas gypsy-superfamily retrotransposons showed a negative correlation with these two parameters. Compared to tropical maize, temperate maize had fewer unique NRTEs but higher insertion frequency, lower background recombination rates, and higher linkage disequilibrium, with more NRTEs close to flowering and stress-related genes in the genome. Association mapping demonstrated that the presence/absence of 48 NRTEs was associated with flowering time and that expression of neighboring genes differed between haplotypes where a NRTE was present or absent.This study suggests that NRTEs may have played an important role in creating the variation in gene regulation that enabled the rapid adaptation of maize to diverse environments.
Project description:Sequencing the giga-genomes of several pine species has enabled comparative genomic analyses of these outcrossing tree species. Previous studies have revealed the wide distribution and extraordinary diversity of transposable elements (TEs) that occupy the large intergenic spaces in conifer genomes. In this study, we analyzed the distribution of TEs in gene regions of the assembled genomes of Pinus taeda and Pinus lambertiana using high-performance computing resources. The quality of draft genomes and the genome annotation have significant consequences for the investigation of TEs and these aspects are discussed. Several TE families frequently inserted into genes or their flanks were identified in both species' genomes. Potentially important sequence motifs were identified in TEs that could bind additional regulatory factors, promoting gene network formation with faster or enhanced transcription initiation. Node genes that contain many TEs were observed in multiple potential transposable element-associated networks. This study demonstrated the increased accumulation of TEs in the introns of stress-responsive genes of pines and suggests the possibility of rewiring them into responsive networks and sub-networks interconnected with node genes containing multiple TEs. Many such regulatory influences could lead to the adaptive environmental response clines that are characteristic of naturally spread pine populations.
Project description:Transposable elements, as major components of most eukaryotic organisms' genomes, define their structural organization and plasticity. They supply host genomes with functional elements, for example, binding sites of the pleiotropic master transcription factor p53 were identified in LINE1, Alu and LTR repeats in the human genome. Similarly, in this report we reveal the role of zebrafish (Danio rerio) EnSpmN6_DR non-autonomous DNA transposon in shaping the repertoire of the p53 target genes. The multiple copies of EnSpmN6_DR and their embedded p53 responsive elements drive in several instances p53-dependent transcriptional modulation of the adjacent gene, whose human orthologs were frequently previously annotated as p53 targets. These transposons define predominantly a set of target genes whose human orthologs contribute to neuronal morphogenesis, axonogenesis, synaptic transmission and the regulation of programmed cell death. Consistent with these biological functions the orthologs of the EnSpmN6_DR-colonized loci are enriched for genes expressed in the amygdala, the hippocampus and the brain cortex. Our data pinpoint a remarkable example of convergent evolution: the exaptation of lineage-specific transposons to shape p53-regulated neuronal morphogenesis-related pathways in both a hominid and a teleost fish.
Project description:Transposable elements (TEs) are widespread across eukaryotic genomes, yet their content varies widely between different species. Factors shaping the diversity of TEs are poorly understood. Understanding the evolution of TEs is difficult because their sequences diversify rapidly and TEs are often transferred through non-conventional means such as horizontal gene transfer. We developed a method to track TE evolution using network analysis to visualise TE sequence and TE content across different genomes. We illustrate our method by first using a monopartite network to study the sequence evolution of Tc1/mariner elements across focal species. We identify a connection between two subfamilies associated with convergent acquisition of a domain from a protein-coding gene. Second, we use a bipartite network to study how TE content across species is shaped by epigenetic silencing mechanisms. We show that the presence of Piwi-interacting RNAs is associated with differences in network topology after controlling for phylogenetic effects. Together, our method demonstrates how a network-based approach can identify hitherto unknown properties of TE evolution across species.
Project description:Transposable elements are a major component of most eukaryotic genomes. Here, we present a new approach which allows us to study patterns of natural selection in the evolution of transposable elements over short time scales. The method uses the alignment of all elements with intact gag/pol genes of a transposable element family from a single genome. We predict that the ratio of nonsynonymous to synonymous variants in the alignment should decrease as a function of the frequency of the variants, because elements with nonsynonymous variants that reduce transposition will have fewer progeny. We apply our method to Sirevirus long-terminal repeat retrotransposons that are abundant in maize and other plant species and show that nonsynonymous to synonymous variants declines as variant frequency increases, indicating that negative selection is acting strongly on the Sirevirus genome. The asymptotic value of nonsynonymous to synonymous variants suggests that at least 85% of all nonsynonymous mutations in the transposable element reduce transposition. Crucially, these patterns in nonsynonymous to synonymous variants are only predicted to occur if the gene products from a particular transposable element insertion preferentially promote the transposition of the same insertion. Overall, by using large numbers of intact elements, this study sheds new light on the selective processes that act on transposable elements.