Unknown

Dataset Information

0

Identification of long non-coding RNA in the horse transcriptome.


ABSTRACT: Efforts to resolve the transcribed sequences in the equine genome have focused on protein-coding RNA. The transcription of the intergenic regions, although detected via total RNA sequencing (RNA-seq), has yet to be characterized in the horse. The most recent equine transcriptome based on RNA-seq from several tissues was a prime opportunity to obtain a concurrent long non-coding RNA (lncRNA) database.This lncRNA database has a breadth of eight tissues and a depth of over 20 million reads for select tissues, providing the deepest and most expansive equine lncRNA database. Utilizing the intergenic reads and three categories of novel genes from a previously published equine transcriptome pipeline, we better describe these groups by annotating the lncRNA candidates. These lncRNA candidates were filtered using an approach adapted from human lncRNA annotation, which removes transcripts based on size, expression, protein-coding capability and distance to the start or stop of annotated protein-coding transcripts.Our equine lncRNA database has 20,800 transcripts that demonstrate characteristics unique to lncRNA including low expression, low exon diversity and low levels of sequence conservation. These candidate lncRNA will serve as a baseline lncRNA annotation and begin to describe the RNA-seq reads assigned to the intergenic space in the horse.

SUBMITTER: Scott EY 

PROVIDER: S-EPMC5496257 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of long non-coding RNA in the horse transcriptome.

Scott E Y EY   Mansour T T   Bellone R R RR   Brown C T CT   Mienaltowski M J MJ   Penedo M C MC   Ross P J PJ   Valberg S J SJ   Murray J D JD   Finno C J CJ  

BMC genomics 20170704 1


<h4>Background</h4>Efforts to resolve the transcribed sequences in the equine genome have focused on protein-coding RNA. The transcription of the intergenic regions, although detected via total RNA sequencing (RNA-seq), has yet to be characterized in the horse. The most recent equine transcriptome based on RNA-seq from several tissues was a prime opportunity to obtain a concurrent long non-coding RNA (lncRNA) database.<h4>Results</h4>This lncRNA database has a breadth of eight tissues and a dept  ...[more]

Similar Datasets

| S-EPMC3582448 | biostudies-literature
| S-EPMC7299297 | biostudies-literature
2024-09-05 | PXD045625 | Pride
| 2693518 | ecrin-mdr-crc
| S-EPMC6050698 | biostudies-literature
| S-EPMC6670153 | biostudies-literature
2022-03-01 | PXD007121 | Pride
| S-EPMC3849168 | biostudies-literature
| S-EPMC8278092 | biostudies-literature
2024-09-06 | PXD055610 |