Unknown

Dataset Information

0

An improved zebrafish transcriptome annotation for sensitive and comprehensive detection of cell type-specific genes.


ABSTRACT: The zebrafish is ideal for studying embryogenesis and is increasingly applied to model human disease. In these contexts, RNA-sequencing (RNA-seq) provides mechanistic insights by identifying transcriptome changes between experimental conditions. Application of RNA-seq relies on accurate transcript annotation for a genome of interest. Here, we find discrepancies in analysis from RNA-seq datasets quantified using Ensembl and RefSeq zebrafish annotations. These issues were due, in part, to variably annotated 3' untranslated regions and thousands of gene models missing from each annotation. Since these discrepancies could compromise downstream analyses and biological reproducibility, we built a more comprehensive zebrafish transcriptome annotation that addresses these deficiencies. Our annotation improves detection of cell type-specific genes in both bulk and single cell RNA-seq datasets, where it also improves resolution of cell clustering. Thus, we demonstrate that our new transcriptome annotation can outperform existing annotations, providing an important resource for zebrafish researchers.

SUBMITTER: Lawson ND 

PROVIDER: S-EPMC7486121 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

An improved zebrafish transcriptome annotation for sensitive and comprehensive detection of cell type-specific genes.

Lawson Nathan D ND   Li Rui R   Shin Masahiro M   Grosse Ann A   Yukselen Onur O   Stone Oliver A OA   Kucukural Alper A   Zhu Lihua L  

eLife 20200824


The zebrafish is ideal for studying embryogenesis and is increasingly applied to model human disease. In these contexts, RNA-sequencing (RNA-seq) provides mechanistic insights by identifying transcriptome changes between experimental conditions. Application of RNA-seq relies on accurate transcript annotation for a genome of interest. Here, we find discrepancies in analysis from RNA-seq datasets quantified using Ensembl and RefSeq zebrafish annotations. These issues were due, in part, to variably  ...[more]

Similar Datasets

| S-EPMC3223729 | biostudies-literature
2020-07-28 | GSE152759 | GEO
| S-EPMC9501657 | biostudies-literature
| S-EPMC6508516 | biostudies-literature
| S-EPMC6120630 | biostudies-literature
| S-EPMC6105091 | biostudies-literature
| S-EPMC7595951 | biostudies-literature
| PRJNA640350 | ENA
| S-EPMC3400636 | biostudies-literature
| S-EPMC5282741 | biostudies-literature