Differential Domain Distribution of gnomAD- and Disease-Linked Connexin Missense Variants.
Ontology highlight
ABSTRACT: Twenty-one human genes encode connexins, a family of homologous proteins making gap junction (GJ) channels, which mediate direct intercellular communication to synchronize tissue/organ activities. Genetic variants in more than half of the connexin genes are associated with dozens of different Mendelian inherited diseases. With rapid advances in DNA sequencing technology, more variants are being identified not only in families and individuals with diseases but also in people in the general population without any apparent linkage to Mendelian inherited diseases. Nevertheless, it remains challenging to classify the pathogenicity of a newly identified connexin variant. Here, we analyzed the disease- and Genome Aggregation Database (gnomAD, as a proxy of the general population)-linked variants in the coding region of the four disease-linked α connexin genes. We found that the most abundant and position-sensitive missense variants showed distinct domain distribution preference between disease- and gnomAD-linked variants. Plotting missense variants on topological and structural models revealed that disease-linked missense variants are highly enriched on the structurally stable/resolved domains, especially the pore-lining domains, while the gnomAD-linked missense variants are highly enriched in the structurally unstable/unresolved domains, especially the carboxyl terminus. In addition, disease-linked variants tend to be on highly conserved residues and those positions show evolutionary co-variation, while the gnomAD-linked missense variants are likely on less conserved residue positions and on positions without co-variation. Collectively, the revealed distribution patterns of disease- and gnomAD-linked missense variants further our understanding of the GJ structure-biological function relationship, which is valuable for classifying the pathogenicity of newly identified connexin variants.
SUBMITTER: Bai D
PROVIDER: S-EPMC8346055 | biostudies-literature |
REPOSITORIES: biostudies-literature
ACCESS DATA