Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

Structural annotation of equine protein-coding genes determined by mRNA sequencing

ABSTRACT: The horse, like a majority of animal species, has a limited amount of species-specific expressed sequence data available in public databases. As a result, structural models for a majority of genes defined in the equine genome are predictions based on ab initio sequence analysis or the projection of gene structures from other mammalian species. The current study used Illumina-based sequencing of messenger RNA (RNA-seq) to help refine structural annotation of equine protein-coding genes and for a preliminary assessment of gene expression patterns. Sequencing of mRNA from eight equine tissues generated 293,758,105 thirty five-base sequence tags, equaling 10.28 giga-basepairs of total sequence data. The tag alignments represent approximately 208X coverage of the equine mRNA transcriptome and confirmed transcriptional activity for roughly 90% of the protein-coding gene structures predicted by Ensembl and NCBI. Tag coverage was sufficient to define structural annotation for 11,356 genes, while also identifying an additional 456 transcripts with exon/intron features that are not listed by either Ensembl or NCBI. Genomic locus data and intervals for the protein-coding genes predicted by the Ensembl and NCBI annotation pipelines were combined with 75,116 RNA-seq derived transcriptional units to generate a consensus equine protein-coding gene set of 20,302 defined loci. Gene ontology annotation was used to compare the functional and structural categories of genes expressed in either a tissue-restricted pattern or broadly across all tissue samples. Examination of 8 equine RNA samples representing 6 distinct tissues

ORGANISM(S): Equus caballus

SUBMITTER: Stephen Coleman

PROVIDER: E-GEOD-21925 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

ACCESS DATA

Publications

Structural annotation of equine protein-coding genes determined by mRNA sequencing.

Coleman S J SJ Zeng Z Z Wang K K Luo S S Khrebtukova I I Mienaltowski M J MJ Schroth G P GP Liu J J MacLeod J N JN

Animal genetics 20101201

The horse, like the majority of animal species, has a limited amount of species-specific expressed sequence data available in public databases. As a result, structural models for the majority of genes defined in the equine genome are predictions based on ab initio sequence analysis or the projection of gene structures from other mammalian species. The current study used Illumina-based sequencing of messenger RNA (RNA-seq) to help refine structural annotation of equine protein-coding genes and fo ...[more]

PMID: 21070285

Dataset Information

Structural annotation of equine protein-coding genes determined by mRNA sequencing

Publications

Structural annotation of equine protein-coding genes determined by mRNA sequencing.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Structural annotation of equine protein-coding genes determined by mRNA sequencing
2010-11-10 | GSE21925 | GEO

Analysis of Unannotated Equine Transcripts Identified by mRNA Sequencing
2013-07-30 | E-GEOD-46858 | biostudies-arrayexpress

Analysis of Unannotated Equine Transcripts Identified by mRNA Sequencing
2013-07-30 | GSE46858 | GEO

NimbleGen 42M data for the HuRef individual
2013-02-12 | E-GEOD-20289 | biostudies-arrayexpress

RNA-seq of tissue panel samples from zebrafish (Danio rerio), medaka (Oryzias latipes), and rainbow trout (Oncorhynchus mykiss)
2020-04-30 | E-MTAB-8959 | biostudies-arrayexpress

RNA-Seq of liver tissue samples from northern pike (Esox lucius), coho salmon (Oncorhynchus kisutch) and Arctic charr (Salvelinus alpinus)
2020-04-30 | E-MTAB-8962 | biostudies-arrayexpress

Stallion sperm transcriptome as revealed by microarray analysis and RNA sequencing
2013-02-19 | E-GEOD-38725 | biostudies-arrayexpress

Global Transcriptome Characterization and Assembly of thermophilic ascomycete Chaetomium thermophilum
2019-12-31 | GSE116834 | GEO

Membrane enriched proteome of transfected SW480 cells
2013-05-30 | PXD000230 | Pride

RECURRENT SETBP1 MUTATIONS IN ATYPICAL CHRONIC MYELOID LEUKEMIA
2012-12-07 | E-GEOD-42146 | biostudies-arrayexpress