Unknown

Dataset Information

0

Identifying repeat domains in large genomes.


ABSTRACT: We present a graph-based method for the analysis of repeat families in a repeat library. We build a repeat domain graph that decomposes a repeat library into repeat domains, short subsequences shared by multiple repeat families, and reveals the mosaic structure of repeat families. Our method recovers documented mosaic repeat structures and suggests additional putative ones. Our method is useful for elucidating the evolutionary history of repeats and annotating de novo generated repeat libraries.

SUBMITTER: Zhi D 

PROVIDER: S-EPMC1431705 | biostudies-literature | 2006

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying repeat domains in large genomes.

Zhi Degui D   Raphael Benjamin J BJ   Price Alkes L AL   Tang Haixu H   Pevzner Pavel A PA  

Genome biology 20060131 1


We present a graph-based method for the analysis of repeat families in a repeat library. We build a repeat domain graph that decomposes a repeat library into repeat domains, short subsequences shared by multiple repeat families, and reveals the mosaic structure of repeat families. Our method recovers documented mosaic repeat structures and suggests additional putative ones. Our method is useful for elucidating the evolutionary history of repeats and annotating de novo generated repeat libraries. ...[more]

Similar Datasets

| S-EPMC2533575 | biostudies-literature
| PRJEB34435 | ENA
| S-EPMC7479135 | biostudies-literature
| S-EPMC6171319 | biostudies-literature
| S-EPMC5430533 | biostudies-literature
| S-EPMC8283788 | biostudies-literature
| S-EPMC6609731 | biostudies-literature
| S-EPMC4864936 | biostudies-literature
| S-EPMC2275221 | biostudies-literature
| S-EPMC2605191 | biostudies-literature