Unknown

Dataset Information

0

The unexpected depths of genome-skimming data: A case study examining Goodeniaceae floral symmetry genes.


ABSTRACT: Premise of the study:The use of genome skimming allows systematists to quickly generate large data sets, particularly of sequences in high abundance (e.g., plastomes); however, researchers may be overlooking data in low abundance that could be used for phylogenetic or evo-devo studies. Here, we present a bioinformatics approach that explores the low-abundance portion of genome-skimming next-generation sequencing libraries in the fan-flowered Goodeniaceae. Methods:Twenty-four previously constructed Goodeniaceae genome-skimming Illumina libraries were examined for their utility in mining low-copy nuclear genes involved in floral symmetry, specifically the CYCLOIDEA (CYC)-like genes. De novo assemblies were generated using multiple assemblers, and BLAST searches were performed for CYC1, CYC2, and CYC3 genes. Results:Overall Trinity, SOAPdenovo-Trans, and SOAPdenovo implementing lower k-mer values uncovered the most data, although no assembler consistently outperformed the others. Using SOAPdenovo-Trans across all 24 data sets, we recovered four CYC-like gene groups (CYC1, CYC2, CYC3A, and CYC3B) from a majority of the species. Alignments of the fragments included the entire coding sequence as well as upstream and downstream regions. Discussion:Genome-skimming data sets can provide a significant source of low-copy nuclear gene sequence data that may be used for multiple downstream applications.

SUBMITTER: Berger BA 

PROVIDER: S-EPMC5664964 | biostudies-literature | 2017 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

The unexpected depths of genome-skimming data: A case study examining Goodeniaceae floral symmetry genes.

Berger Brent A BA   Han Jiahong J   Sessa Emily B EB   Gardner Andrew G AG   Shepherd Kelly A KA   Ricigliano Vincent A VA   Jabaily Rachel S RS   Howarth Dianella G DG  

Applications in plant sciences 20171020 10


<h4>Premise of the study</h4>The use of genome skimming allows systematists to quickly generate large data sets, particularly of sequences in high abundance (e.g., plastomes); however, researchers may be overlooking data in low abundance that could be used for phylogenetic or evo-devo studies. Here, we present a bioinformatics approach that explores the low-abundance portion of genome-skimming next-generation sequencing libraries in the fan-flowered Goodeniaceae.<h4>Methods</h4>Twenty-four previ  ...[more]

Similar Datasets

| S-EPMC4858217 | biostudies-literature
| S-EPMC240701 | biostudies-literature
| S-EPMC2851953 | biostudies-literature
| PRJEB62527 | ENA
| S-EPMC6206312 | biostudies-literature
| S-EPMC3315478 | biostudies-literature
| S-EPMC5403951 | biostudies-literature
| S-EPMC6191499 | biostudies-literature
| S-EPMC6353098 | biostudies-literature
| S-EPMC9828177 | biostudies-literature