Unknown

Dataset Information

0

Detecting long tandem duplications in genomic sequences.


ABSTRACT:

Background

Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication.

Results

In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem duplication arrays of moderate to longer length regions, with possibly locally weak similarities, directly at the DNA level. On the A. thaliana genome, using a reference set of tandem duplicated genes built using TAIR,(a) we show that ReD Tandem is able to predict a large fraction of recently duplicated genes (dS ?ConclusionsReD Tandem allows to identify large tandem duplications without any annotation, leading to agnostic identification of tandem duplications. This approach nicely complements the usual protein gene based which ignores duplications involving non coding regions. It is however inherently restricted to relatively recent duplications. By recovering otherwise ignored events, ReD Tandem gives a more comprehensive view of existing evolutionary processes and may also allow to improve existing annotations.

SUBMITTER: Audemard E 

PROVIDER: S-EPMC3464658 | biostudies-literature | 2012 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting long tandem duplications in genomic sequences.

Audemard Eric E   Schiex Thomas T   Faraut Thomas T  

BMC bioinformatics 20120508


<h4>Background</h4>Detecting duplication segments within completely sequenced genomes provides valuable information to address genome evolution and in particular the important question of the emergence of novel functions. The usual approach to gene duplication detection, based on all-pairs protein gene comparisons, provides only a restricted view of duplication.<h4>Results</h4>In this paper, we introduce ReD Tandem, a software using a flow based chaining algorithm targeted at detecting tandem du  ...[more]

Similar Datasets

| S-EPMC3751903 | biostudies-literature
| S-EPMC8275333 | biostudies-literature
| S-EPMC5103432 | biostudies-literature
| S-EPMC4555851 | biostudies-literature
| S-EPMC7274563 | biostudies-literature
| S-EPMC3488214 | biostudies-literature
| S-EPMC6680281 | biostudies-literature
| S-EPMC8361843 | biostudies-literature
2014-06-30 | GSE57246 | GEO
2017-11-23 | GSE103624 | GEO