Unknown

Dataset Information

0

A New Catalog of Structural Variants in 1,301 A. thaliana Lines from Africa, Eurasia, and North America Reveals a Signature of Balancing Selection at Defense Response Genes.


ABSTRACT: Genomic variation in the model plant Arabidopsis thaliana has been extensively used to understand evolutionary processes in natural populations, mainly focusing on single-nucleotide polymorphisms. Conversely, structural variation has been largely ignored in spite of its potential to dramatically affect phenotype. Here, we identify 155,440 indels and structural variants ranging in size from 1 bp to 10 kb, including presence/absence variants (PAVs), inversions, and tandem duplications in 1,301 A. thaliana natural accessions from Morocco, Madeira, Europe, Asia, and North America. We show evidence for strong purifying selection on PAVs in genes, in particular for housekeeping genes and homeobox genes, and we find that PAVs are concentrated in defense-related genes (R-genes, secondary metabolites) and F-box genes. This implies the presence of a "core" genome underlying basic cellular processes and a "flexible" genome that includes genes that may be important in spatially or temporally varying selection. Further, we find an excess of intermediate frequency PAVs in defense response genes in nearly all populations studied, consistent with a history of balancing selection on this class of genes. Finally, we find that PAVs in genes involved in the cold requirement for flowering (vernalization) and drought response are strongly associated with temperature at the sites of origin.

SUBMITTER: Goktay M 

PROVIDER: S-EPMC8042739 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2020-08-31 | PRJEB38975 | EVA
| S-EPMC8662644 | biostudies-literature
| S-EPMC5714260 | biostudies-literature
| S-EPMC6965331 | biostudies-literature
| S-EPMC5680632 | biostudies-literature
| S-EPMC4355296 | biostudies-literature
| S-EPMC6192627 | biostudies-literature
| S-EPMC5005231 | biostudies-literature
2010-11-01 | GSE24034 | GEO
| S-EPMC5665159 | biostudies-literature