Unknown

Dataset Information

0

A Map of 3′ DNA Transduction Variants Mediated by Non-LTR Retroelements on 3202 Human Genomes


ABSTRACT:

Simple Summary

During the transcription of non-LTR retroelements, such as LINEs and SVAs, the transcriptional termination signal at the 3′ end might be ignored by RNA polymerase. As a result, the transcription terminates at another downstream signal, creating a chimeric transcriptional readthrough. Termed 3′ DNA transduction, this process copies the 3′ flanking region along with the retroelement sequence to a new genomic locus, which influences the structure of the genome and occasionally possesses a functional impact. To discover putative non-LTR retroelement-driven 3′ DNA transductions, we analyzed the new dataset (n = 3202) of the 1000 Genomes Project. Our results indicate that 3′ transduction derived by non-LTR retroelements is a relatively common phenomenon in the human genome and that their discovery needs to be more appreciated in genome projects.

Abstract

As one of the major structural constituents, mobile elements comprise more than half of the human genome, among which Alu, L1, and SVA elements are still active and continue to generate new offspring. One of the major characteristics of L1 and SVA elements is their ability to co-mobilize adjacent downstream sequences to new loci in a process called 3′ DNA transduction. Transductions influence the structure and content of the genome in different ways, such as increasing genome variation, exon shuffling, and gene duplication. Moreover, given their mutagenicity capability, 3′ transductions are often involved in tumorigenesis or in the development of some diseases. In this study, we analyzed 3202 genomes sequenced at high coverage by the New York Genome Center to catalog and characterize putative 3′ transduced segments mediated by L1s and SVAs. Here, we present a genome-wide map of inter/intrachromosomal 3′ transduction variants, including their genomic and functional location, length, progenitor location, and allelic frequency across 26 populations. In total, we identified 7103 polymorphic L1s and 3040 polymorphic SVAs. Of these, 268 and 162 variants were annotated as high-confidence L1 and SVA 3′ transductions, respectively, with lengths that ranged from 7 to 997 nucleotides. We found specific loci within chromosomes X, 6, 7, and 6_GL000253v2_alt as master L1s and SVAs that had yielded more transductions, among others. Together, our results demonstrate the dynamic nature of transduction events within the genome and among individuals and their contribution to the structural variations of the human genome.

SUBMITTER: Halabian R 

PROVIDER: S-EPMC9311842 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6298018 | biostudies-literature
| S-EPMC2996953 | biostudies-literature
| S-EPMC2790886 | biostudies-literature
| S-EPMC2774666 | biostudies-literature
| S-EPMC4131045 | biostudies-literature
| S-EPMC4380235 | biostudies-literature
| S-EPMC3636483 | biostudies-literature
| S-EPMC10576055 | biostudies-literature
| S-EPMC6274825 | biostudies-literature
| S-EPMC4498404 | biostudies-literature