Unknown

Dataset Information

0

Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq.


ABSTRACT: The Tibetan Schizothoracinae fish Gymnocypris przewalskii has the ability to adapt to the extreme plateau environment, making it an ideal biological material for evolutionary biology research. However, the lack of well-annotated reference genomes has limited the study of the molecular genetics of G. przewalskii. To characterize its transcriptome features, we first used long-read sequencing technology in combination with RNA-seq for transcriptomic analysis. A total of 159,053 full-length (FL) transcripts were captured by Iso-Seq, having a mean length of 3,445 bp with N50 value of 4,348. Of all FL transcripts, 145,169 were well-annotated in the public database and 134,537 contained complete open reading frames. There were 4,149 pairs of alternative splicing events, of which three randomly selected were defined by RT-PCR and sequencing, and 13,293 long non-coding RNAs detected, based on all-vs.-all BLAST. A total of 118,185 perfect simple sequence repeats were identified from FL transcripts. The FL transcriptome might provide basis for further research of G. przewalskii.

SUBMITTER: Li X 

PROVIDER: S-EPMC8320875 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6704404 | biostudies-literature
| S-EPMC4931018 | biostudies-literature
| S-EPMC6105124 | biostudies-literature
| S-EPMC6240054 | biostudies-literature
| S-EPMC6821003 | biostudies-literature
| S-EPMC6787290 | biostudies-literature
2022-05-31 | PXD031213 | Pride
| S-EPMC5550469 | biostudies-other
| S-EPMC6912988 | biostudies-literature
| S-EPMC9723614 | biostudies-literature