Unknown

Dataset Information

0

A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.


ABSTRACT: In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequencing data. The best assembly comprises 69 nucleome sequences and displays a contig length of up to 16 Mbp. Compared to an earlier Illumina short read-based NGS assembly (AthNd-1_v1), a 75 fold increase in contiguity was observed for AthNd-1_v2c. To assign contig locations independent from the Col-0 gold standard reference sequence, we used genetic anchoring to generate a de novo assembly. In addition, we assembled the chondrome and plastome sequences. Detailed analyses of AthNd-1_v2c allowed reliable identification of large genomic rearrangements between A. thaliana accessions contributing to differences in the gene sets that distinguish the genotypes. One of the differences detected identified a gene that is lacking from the Col-0 gold standard sequence. This de novo assembly extends the known proportion of the A. thaliana pan-genome.

SUBMITTER: Pucker B 

PROVIDER: S-EPMC6529160 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.

Pucker Boas B   Holtgräwe Daniela D   Stadermann Kai Bernd KB   Frey Katharina K   Huettel Bruno B   Reinhardt Richard R   Weisshaar Bernd B  

PloS one 20190521 5


In addition to the BAC-based reference sequence of the accession Columbia-0 from the year 2000, several short read assemblies of THE plant model organism Arabidopsis thaliana were published during the last years. Also, a SMRT-based assembly of Landsberg erecta has been generated that identified translocation and inversion polymorphisms between two genotypes of the species. Here we provide a chromosome-arm level assembly of the A. thaliana accession Niederzenz-1 (AthNd-1_v2c) based on SMRT sequen  ...[more]

Similar Datasets

| S-EPMC4948326 | biostudies-literature
| S-EPMC7137110 | biostudies-literature
| S-EPMC1904369 | biostudies-literature
| S-EPMC146221 | biostudies-other
| S-EPMC8214408 | biostudies-literature
| S-EPMC2604966 | biostudies-literature
| S-EPMC8692795 | biostudies-literature
| S-EPMC7056972 | biostudies-literature
| S-EPMC3583977 | biostudies-literature
| S-EPMC8322157 | biostudies-literature