Unknown

Dataset Information

0

16Stimator: statistical estimation of ribosomal gene copy numbers from draft genome assemblies.


ABSTRACT: The 16S rRNA gene (16S) is an accepted marker of bacterial taxonomic diversity, even though differences in copy number obscure the relationship between amplicon and organismal abundances. Ancestral state reconstruction methods can predict 16S copy numbers through comparisons with closely related reference genomes; however, the database of closed genomes is limited. Here, we extend the reference database of 16S copy numbers to de novo assembled draft genomes by developing 16Stimator, a method to estimate 16S copy numbers when these repetitive regions collapse during assembly. Using a read depth approach, we estimate 16S copy numbers for 12 endophytic isolates from Arabidopsis thaliana and confirm estimates by qPCR. We further apply this approach to draft genomes deposited in NCBI and demonstrate accurate copy number estimation regardless of sequencing platform, with an overall median deviation of 14%. The expanded database of isolates with 16S copy number estimates increases the power of phylogenetic correction methods for determining organismal abundances from 16S amplicon surveys.

SUBMITTER: Perisin M 

PROVIDER: S-EPMC4796925 | biostudies-literature | 2016 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

16Stimator: statistical estimation of ribosomal gene copy numbers from draft genome assemblies.

Perisin Matthew M   Vetter Madlen M   Gilbert Jack A JA   Bergelson Joy J  

The ISME journal 20150911 4


The 16S rRNA gene (16S) is an accepted marker of bacterial taxonomic diversity, even though differences in copy number obscure the relationship between amplicon and organismal abundances. Ancestral state reconstruction methods can predict 16S copy numbers through comparisons with closely related reference genomes; however, the database of closed genomes is limited. Here, we extend the reference database of 16S copy numbers to de novo assembled draft genomes by developing 16Stimator, a method to  ...[more]

Similar Datasets

| S-EPMC7607385 | biostudies-literature
| S-EPMC9710636 | biostudies-literature
| S-EPMC9470731 | biostudies-literature
| S-EPMC10100071 | biostudies-literature
| PRJEB60432 | ENA
| S-EPMC10759652 | biostudies-literature
| S-EPMC2099498 | biostudies-literature
| S-EPMC2851716 | biostudies-literature
| S-EPMC7770131 | biostudies-literature
| S-EPMC8248862 | biostudies-literature