Unknown

Dataset Information

0

Variation Patterns of NLR Clusters in Arabidopsis thaliana Genomes.


ABSTRACT: The nucleotide-binding domain and leucine-rich repeat (NLR) gene family is highly expanded in the plant lineage with extensive sequence and structure polymorphisms. To survey the landscape of NLR expansion, we mined the published long-read data generated by the resistance gene enrichment sequencing of 64 diverse Arabidopsis thaliana accessions. We found that the hot spots of massive multi-gene NLR cluster expansion did not typically span the whole cluster; instead, they were restricted to a handful of, or only one, dominant radiation(s). All sequences in such a radiation were distinct from other genes in the cluster but not from each other in the clade, making it difficult to assign trustworthy reference-based orthologies when multiple reference genes were present in the radiation. Consequently, NLR genes can be broadly divided into two types: radiating or high-fidelity, where high-fidelity genes are well conserved and well separated from other clades. A similar distinction could be made for NLR clusters, depending on whether cluster size was determined primarily by extensive radiation or the presence of numerous high-fidelity genes. We also identified groups of well-conserved NLR clades that were missing from the Columbia-0 reference genome. This suggests that the classification of NLRs using gene IDs from a single reference accession can rarely capture all major paralogs in a cluster accurately and representatively and that a reference-agnostic perspective is required to properly characterize these additional variations. Finally, we present a quantitative visualization method for differentiating these situations in a given clade of interest.

SUBMITTER: Lee RRQ 

PROVIDER: S-EPMC7747988 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Variation Patterns of NLR Clusters in <i>Arabidopsis thaliana</i> Genomes.

Lee Rachelle R Q RRQ   Chae Eunyoung E  

Plant communications 20200624 4


The nucleotide-binding domain and leucine-rich repeat (NLR) gene family is highly expanded in the plant lineage with extensive sequence and structure polymorphisms. To survey the landscape of NLR expansion, we mined the published long-read data generated by the resistance gene enrichment sequencing of 64 diverse <i>Arabidopsis thaliana</i> accessions. We found that the hot spots of massive multi-gene NLR cluster expansion did not typically span the whole cluster; instead, they were restricted to  ...[more]

Similar Datasets

| S-EPMC6684095 | biostudies-literature
| S-EPMC1431726 | biostudies-literature
| S-EPMC7826191 | biostudies-literature
| S-EPMC5850868 | biostudies-literature
| S-EPMC6709784 | biostudies-literature
| S-EPMC4301183 | biostudies-literature
| S-EPMC1892575 | biostudies-literature
| S-EPMC3121819 | biostudies-other
| S-EPMC4856438 | biostudies-literature
| S-EPMC1865379 | biostudies-literature