Unknown

Dataset Information

0

CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome.


ABSTRACT: Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining crucial traits. However, many major crops have gigabase-scale genomes, which are not well-suited to WGS. Existing cost-effective sequencing approaches including re-sequencing, exome-sequencing and restriction enzyme-based methods all have difficulty in obtaining long novel genomic sequences from highly divergent population with large genome size. The present study presented a reference-independent core genome targeted sequencing approach, CGT-seq, which employed epigenomic information from both active and repressive epigenetic marks to guide the assembly of the core genome mainly composed of promoter and intragenic regions. This method was relatively easily implemented, and displayed high sensitivity and specificity for capturing the core genome of bread wheat. 95% intragenic and 89% promoter region from wheat were covered by CGT-seq read. We further demonstrated in rice that CGT-seq captured hundreds of novel genes and regulatory sequences from a previously unsequenced ecotype. Together, with specific enrichment and sequencing of regions within and nearby genes, CGT-seq is a time- and resource-effective approach to profiling functionally relevant regions in sequenced and non-sequenced populations with large genomes.

SUBMITTER: Qi M 

PROVIDER: S-EPMC6182137 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

CGT-seq: epigenome-guided de novo assembly of the core genome for divergent populations with large genome.

Qi Meifang M   Li Zijuan Z   Liu Chunmei C   Hu Wenyan W   Ye Luhuan L   Xie Yilin Y   Zhuang Yili Y   Zhao Fei F   Teng Wan W   Zheng Qi Q   Fan Zhenjun Z   Xu Lin L   Lang Zhaobo Z   Tong Yiping Y   Zhang Yijing Y  

Nucleic acids research 20181001 18


Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining crucial traits. However, many major crops have gigabase-scale genomes, which are not well-suited to WGS. Existing cost-effective sequencing approaches including re-sequencing, exome-sequencing and re  ...[more]

Similar Datasets

| S-EPMC5681816 | biostudies-other
| S-EPMC3746961 | biostudies-literature
| S-EPMC4315662 | biostudies-literature
| S-EPMC4058956 | biostudies-literature
2018-05-27 | GSE107827 | GEO
| S-EPMC6072799 | biostudies-literature
| S-EPMC2143711 | biostudies-other
2022-11-23 | GSE212159 | GEO
| S-EPMC3469330 | biostudies-literature
| S-EPMC4177539 | biostudies-literature