Unknown

Dataset Information

0

Sequence diversity analyses of an improved rhesus macaque genome enhance its biomedical utility.


ABSTRACT: The rhesus macaque (Macaca mulatta) is the most widely studied nonhuman primate (NHP) in biomedical research. We present an updated reference genome assembly (Mmul_10, contig N50 = 46 Mbp) that increases the sequence contiguity 120-fold and annotate it using 6.5 million full-length transcripts, thus improving our understanding of gene content, isoform diversity, and repeat organization. With the improved assembly of segmental duplications, we discovered new lineage-specific genes and expanded gene families that are potentially informative in studies of evolution and disease susceptibility. Whole-genome sequencing (WGS) data from 853 rhesus macaques identified 85.7 million single-nucleotide variants (SNVs) and 10.5 million indel variants, including potentially damaging variants in genes associated with human autism and developmental delay, providing a framework for developing noninvasive NHP models of human disease.

SUBMITTER: Warren WC 

PROVIDER: S-EPMC7818670 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5135103 | biostudies-literature
| S-EPMC1790710 | biostudies-literature
| S-EPMC3218825 | biostudies-literature
| S-EPMC5663730 | biostudies-literature
| S-EPMC5604784 | biostudies-literature
| S-EPMC5930483 | biostudies-literature
| S-EPMC4214606 | biostudies-literature
| S-EPMC3426473 | biostudies-literature
| S-EPMC4839223 | biostudies-literature
| S-EPMC4238990 | biostudies-literature