Unknown

Dataset Information

0

Assessment and improvement of Indian-origin rhesus macaque and Mauritian-origin cynomolgus macaque genome annotations using deep transcriptome sequencing data.


ABSTRACT:

Background

The genome annotations of rhesus (Macaca mulatta) and cynomolgus (Macaca fascicularis) macaques, two of the most common non-human primate animal models, are limited.

Methods

We analyzed large-scale macaque RNA-based next-generation sequencing (RNAseq) data to identify un-annotated macaque transcripts.

Results

For both macaque species, we uncovered thousands of novel isoforms for annotated genes and thousands of un-annotated intergenic transcripts enriched with non-coding RNAs. We also identified thousands of transcript sequences which are partially or completely 'missing' from current macaque genome assemblies. We showed that many newly identified transcripts were differentially expressed during SIV infection of rhesus macaques or during Ebola virus infection of cynomolgus macaques.

Conclusions

For two important macaque species, we uncovered thousands of novel isoforms and un-annotated intergenic transcripts including coding and non-coding RNAs, polyadenylated and non-polyadenylated transcripts. This resource will greatly improve future macaque studies, as demonstrated by their applications in infectious disease studies.

SUBMITTER: Peng X 

PROVIDER: S-EPMC4176519 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6314804 | biostudies-literature
| S-EPMC7595107 | biostudies-literature
2011-10-14 | GSE29629 | GEO
2011-10-13 | E-GEOD-29629 | biostudies-arrayexpress
| S-EPMC3077881 | biostudies-literature
| S-EPMC7266356 | biostudies-literature
| S-EPMC4378995 | biostudies-literature
| S-EPMC1790710 | biostudies-literature
| PRJNA244048 | ENA