Unknown

Dataset Information

0

GET_PANGENES: calling pangenes from plant genome alignments confirms presence-absence variation.


ABSTRACT: Crop pangenomes made from individual cultivar assemblies promise easy access to conserved genes, but genome content variability and inconsistent identifiers hamper their exploration. To address this, we define pangenes, which summarize a species coding potential and link back to original annotations. The protocol get_pangenes performs whole genome alignments (WGA) to call syntenic gene models based on coordinate overlaps. A benchmark with small and large plant genomes shows that pangenes recapitulate phylogeny-based orthologies and produce complete soft-core gene sets. Moreover, WGAs support lift-over and help confirm gene presence-absence variation. Source code and documentation: https://github.com/Ensembl/plant-scripts .

SUBMITTER: Contreras-Moreira B 

PROVIDER: S-EPMC10552430 | biostudies-literature | 2023 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

GET_PANGENES: calling pangenes from plant genome alignments confirms presence-absence variation.

Contreras-Moreira Bruno B   Saraf Shradha S   Naamati Guy G   Casas Ana M AM   Amberkar Sandeep S SS   Flicek Paul P   Jones Andrew R AR   Dyer Sarah S  

Genome biology 20231005 1


Crop pangenomes made from individual cultivar assemblies promise easy access to conserved genes, but genome content variability and inconsistent identifiers hamper their exploration. To address this, we define pangenes, which summarize a species coding potential and link back to original annotations. The protocol get_pangenes performs whole genome alignments (WGA) to call syntenic gene models based on coordinate overlaps. A benchmark with small and large plant genomes shows that pangenes recapit  ...[more]

Similar Datasets

| S-EPMC4066771 | biostudies-literature
| S-EPMC7653742 | biostudies-literature
| S-EPMC9949116 | biostudies-literature
| S-EPMC2780416 | biostudies-literature
| S-EPMC3433342 | biostudies-literature
| S-EPMC10273549 | biostudies-literature
| S-EPMC7430858 | biostudies-literature
| S-EPMC5053417 | biostudies-literature
| S-EPMC8495401 | biostudies-literature
| S-EPMC2590607 | biostudies-literature