Unknown

Dataset Information

0

Dissecting a hidden gene duplication: the Arabidopsis thaliana SEC10 locus.


ABSTRACT: Repetitive sequences present a challenge for genome sequence assembly, and highly similar segmental duplications may disappear from assembled genome sequences. Having found a surprising lack of observable phenotypic deviations and non-Mendelian segregation in Arabidopsis thaliana mutants in SEC10, a gene encoding a core subunit of the exocyst tethering complex, we examined whether this could be explained by a hidden gene duplication. Re-sequencing and manual assembly of the Arabidopsis thaliana SEC10 (At5g12370) locus revealed that this locus, comprising a single gene in the reference genome assembly, indeed contains two paralogous genes in tandem, SEC10a and SEC10b, and that a sequence segment of 7 kb in length is missing from the reference genome sequence. Differences between the two paralogs are concentrated in non-coding regions, while the predicted protein sequences exhibit 99% identity, differing only by substitution of five amino acid residues and an indel of four residues. Both SEC10 genes are expressed, although varying transcript levels suggest differential regulation. Homozygous T-DNA insertion mutants in either paralog exhibit a wild-type phenotype, consistent with proposed extensive functional redundancy of the two genes. By these observations we demonstrate that recently duplicated genes may remain hidden even in well-characterized genomes, such as that of A. thaliana. Moreover, we show that the use of the existing A. thaliana reference genome sequence as a guide for sequence assembly of new Arabidopsis accessions or related species has at least in some cases led to error propagation.

SUBMITTER: Vukasinovic N 

PROVIDER: S-EPMC3984084 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Dissecting a hidden gene duplication: the Arabidopsis thaliana SEC10 locus.

Vukašinović Nemanja N   Cvrčková Fatima F   Eliáš Marek M   Cole Rex R   Fowler John E JE   Žárský Viktor V   Synek Lukáš L  

PloS one 20140411 4


Repetitive sequences present a challenge for genome sequence assembly, and highly similar segmental duplications may disappear from assembled genome sequences. Having found a surprising lack of observable phenotypic deviations and non-Mendelian segregation in Arabidopsis thaliana mutants in SEC10, a gene encoding a core subunit of the exocyst tethering complex, we examined whether this could be explained by a hidden gene duplication. Re-sequencing and manual assembly of the Arabidopsis thaliana  ...[more]

Similar Datasets

| S-EPMC7792519 | biostudies-literature
| S-EPMC2710639 | biostudies-literature
| S-EPMC6750192 | biostudies-literature
| S-EPMC3136272 | biostudies-literature
| S-EPMC6260828 | biostudies-literature
| S-EPMC4437517 | biostudies-literature
| S-EPMC1952470 | biostudies-literature
| S-EPMC2650789 | biostudies-literature
| S-EPMC1461127 | biostudies-other
| S-EPMC2826654 | biostudies-literature