Unknown

Dataset Information

0

A high-contiguity Brassica nigra genome localizes active centromeres and defines the ancestral Brassica genome.


ABSTRACT: It is only recently, with the advent of long-read sequencing technologies, that we are beginning to uncover previously uncharted regions of complex and inherently recursive plant genomes. To comprehensively study and exploit the genome of the neglected oilseed Brassica nigra, we generated two high-quality nanopore de novo genome assemblies. The N50 contig lengths for the two assemblies were 17.1 Mb (12 contigs), one of the best among 324 sequenced plant genomes, and 0.29 Mb (424 contigs), respectively, reflecting recent improvements in the technology. Comparison with a de novo short-read assembly corroborated genome integrity and quantified sequence-related error rates (0.2%). The contiguity and coverage allowed unprecedented access to low-complexity regions of the genome. Pericentromeric regions and coincidence of hypomethylation enabled localization of active centromeres and identified centromere-associated ALE family retro-elements that appear to have proliferated through relatively recent nested transposition events (<1 Ma). Genomic distances calculated based on synteny relationships were used to define a post-triplication Brassica-specific ancestral genome, and to calculate the extensive rearrangements that define the evolutionary distance separating B. nigra from its diploid relatives.

SUBMITTER: Perumal S 

PROVIDER: S-EPMC7419231 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7385995 | biostudies-literature
| S-EPMC4536001 | biostudies-literature
| S-EPMC449767 | biostudies-literature
| S-EPMC5803254 | biostudies-literature
| S-EPMC4199417 | biostudies-literature
| S-EPMC7731534 | biostudies-literature
| PRJNA18141 | ENA
| PRJNA175368 | ENA
| S-EPMC3765694 | biostudies-literature
| S-EPMC1199546 | biostudies-literature