Unknown

Dataset Information

0

I-ADHoRe 3.0--fast and sensitive detection of genomic homology in extremely large data sets.


ABSTRACT: Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1?h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein-protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution.

SUBMITTER: Proost S 

PROVIDER: S-EPMC3258164 | biostudies-literature | 2012 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

i-ADHoRe 3.0--fast and sensitive detection of genomic homology in extremely large data sets.

Proost Sebastian S   Fostier Jan J   De Witte Dieter D   Dhoedt Bart B   Demeester Piet P   Van de Peer Yves Y   Vandepoele Klaas K  

Nucleic acids research 20111118 2


Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a softwa  ...[more]

Similar Datasets

| S-EPMC2375126 | biostudies-literature
| S-EPMC3439324 | biostudies-literature
| S-EPMC3965454 | biostudies-literature
| S-EPMC6581055 | biostudies-literature
| S-EPMC7826169 | biostudies-literature
| S-EPMC6886512 | biostudies-literature
| S-EPMC10046006 | biostudies-literature
| S-EPMC4682108 | biostudies-literature
| S-EPMC6344855 | biostudies-literature
| S-EPMC5389551 | biostudies-literature