Unknown

Dataset Information

0

TE-greedy-nester: structure-based detection of LTR retrotransposons and their nesting.


ABSTRACT:

Motivation

Transposable elements (TEs) in eukaryotes often get inserted into one another, forming sequences that become a complex mixture of full-length elements and their fragments. The reconstruction of full-length elements and the order in which they have been inserted is important for genome and transposon evolution studies. However, the accumulation of mutations and genome rearrangements over evolutionary time makes this process error-prone and decreases the efficiency of software aiming to recover all nested full-length TEs.

Results

We created software that uses a greedy recursive algorithm to mine increasingly fragmented copies of full-length LTR retrotransposons in assembled genomes and other sequence data. The software called TE-greedy-nester considers not only sequence similarity but also the structure of elements. This new tool was tested on a set of natural and synthetic sequences and its accuracy was compared to similar software. We found TE-greedy-nester to be superior in a number of parameters, namely computation time and full-length TE recovery in highly nested regions.

Availability and implementation

http://gitlab.fi.muni.cz/lexa/nested.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Lexa M 

PROVIDER: S-EPMC7755421 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

TE-greedy-nester: structure-based detection of LTR retrotransposons and their nesting.

Lexa Matej M   Jedlicka Pavel P   Vanat Ivan I   Cervenansky Michal M   Kejnovsky Eduard E  

Bioinformatics (Oxford, England) 20201201 20


<h4>Motivation</h4>Transposable elements (TEs) in eukaryotes often get inserted into one another, forming sequences that become a complex mixture of full-length elements and their fragments. The reconstruction of full-length elements and the order in which they have been inserted is important for genome and transposon evolution studies. However, the accumulation of mutations and genome rearrangements over evolutionary time makes this process error-prone and decreases the efficiency of software a  ...[more]

Similar Datasets

| S-EPMC3248453 | biostudies-literature
| S-EPMC2253517 | biostudies-literature
| S-EPMC2686373 | biostudies-literature
| S-EPMC9456015 | biostudies-literature
| S-EPMC4139406 | biostudies-literature
| S-EPMC6022998 | biostudies-literature
| S-EPMC11204925 | biostudies-literature
| S-EPMC6218219 | biostudies-literature
| S-EPMC4608283 | biostudies-literature
| S-EPMC1858694 | biostudies-literature