Dataset Information

Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants.

ABSTRACT: We present a high-quality de novo genome assembly (rheMacS) of the Chinese rhesus macaque (Macaca mulatta) using long-read sequencing and multiplatform scaffolding approaches. Compared to the current Indian rhesus macaque reference genome (rheMac8), rheMacS increases sequence contiguity 75-fold, closing 21,940 of the remaining assembly gaps (60.8 Mbp). We improve gene annotation by generating more than two million full-length transcripts from ten different tissues by long-read RNA sequencing. We sequence resolve 53,916 structural variants (96% novel) and identify 17,000 ape-specific structural variants (ASSVs) based on comparison to ape genomes. Many ASSVs map within ChIP-seq predicted enhancer regions where apes and macaque show diverged enhancer activity and gene expression. We further characterize a subset that may contribute to ape- or great-ape-specific phenotypic traits, including taillessness, brain volume expansion, improved manual dexterity, and large body size. The rheMacS genome assembly serves as an ideal reference for future biomedical and evolutionary studies.

SUBMITTER: He Y

PROVIDER: S-EPMC6749001 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants.

He Yaoxi Y Luo Xin X Zhou Bin B Hu Ting T Meng Xiaoyu X Audano Peter A PA Kronenberg Zev N ZN Eichler Evan E EE Jin Jie J Guo Yongbo Y Yang Yanan Y Qi Xuebin X Su Bing B

Nature communications 20190917 1

We present a high-quality de novo genome assembly (rheMacS) of the Chinese rhesus macaque (Macaca mulatta) using long-read sequencing and multiplatform scaffolding approaches. Compared to the current Indian rhesus macaque reference genome (rheMac8), rheMacS increases sequence contiguity 75-fold, closing 21,940 of the remaining assembly gaps (60.8 Mbp). We improve gene annotation by generating more than two million full-length transcripts from ten different tissues by long-read RNA sequencing. We ...[more]

PMID: 31530812

Dataset Information

Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants.

Publications

Long-read assembly of the Chinese rhesus macaque genome and identification of ape-specific structural variants.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Defining genetic diversity of rhesus macaque Fcγ receptors with long-read RNA sequencing.
| S-EPMC10803544 | biostudies-literature

Phenotypic Characterization of Chinese Rhesus Macaque Plasmablasts for Cloning Antigen-Specific Monoclonal Antibodies.
| S-EPMC6798180 | biostudies-literature

Adaptive Structural Variants Contributing to Human Brain Development Revealed by 1,026 Rhesus Macaque Genomes
2024-04-23 | GSE221928 | GEO

Comparative Genomic Analysis Identifies Great-Ape-Specific Structural Variants and Their Evolutionary Relevance.
| S-EPMC10461412 | biostudies-literature

Systematic Profiling of Full-Length Ig and TCR Repertoire Diversity in Rhesus Macaque through Long Read Transcriptome Sequencing.
| S-EPMC7276939 | biostudies-literature

Long-read sequencing and de novo assembly of a Chinese genome.
| S-EPMC4931320 | biostudies-literature

Hematological and biochemical parameters for Chinese rhesus macaque.
| S-EPMC6748566 | biostudies-literature

Limitations of the rhesus macaque draft genome assembly and annotation.
| S-EPMC3426473 | biostudies-literature

Long-read assembly of major histocompatibility complex and killer cell immunoglobulin-like receptor genome regions in cynomolgus macaque.
| S-EPMC9707422 | biostudies-literature