Project description:In the present study we analyzed centromeric localization on chromosome 12 in different orangutan and results showed that each individual exhibits a different arrangement of CENP-A binding domains.
Project description:Bonobo (Pan paniscus), Sumatran orangutan (Pongo abelii) Y chromosome sequencing and assembly and the improvement of the gorilla Y draft assembly
Project description:We couple long-read sequence assembly, full-length cDNA sequencing, and a multi-platform scaffolding approach to produce ab initio chimpanzee and orangutan genome assemblies where most genes are complete, gaps are closed, and novel gene models are identified. We further analyzed the overlap between structural variants in the human genome and gene expression differences in human and chimpanzee cells, including iPS-derived organoid radial glia cells.
Project description:Comparing the molecular and cellular properties among primates is crucial to better understand human evolution and biology. However, it is difficult or ethically even impossible to collect matched tissues from many primates, especially during development. An alternative is to model different cell types and their development using induced pluripotent stem cells (iPSCs). These can be generated from many tissue sources, but non-invasive sampling would decisively broaden the spectrum of non-human primates that can be investigated. Here, we report the generation of primate iPSCs from urine samples. We first validate and optimize the procedure using human urine samples and show that Sendai virus transduction of reprogramming factors into urinary cells efficiently generates integration-free iPSCs, which maintain their pluripotency under feeder-free culture conditions. We demonstrate that this method is also applicable to gorilla and orangutan urinary cells isolated from a non-sterile zoo floor. We characterize the urinary cells, iPSCs and derived neural progenitor cells using karyotyping, immunohistochemistry, differentiation assays and RNA-sequencing. We show that the urine-derived human iPSCs are indistinguishable from well characterized PBMC-derived human iPSCs and that the gorilla and orangutan iPSCs are well comparable to the human iPSCs. In summary, this study introduces a novel and efficient approach to generate primate iPSCs non-invasively from urinary samples.
Project description:Testis RNA-Seq datasets from bonobo (Pan paniscus) and Bornean orangutan (Pongo pygmaeus pygmaeus) for the assembly and expression of Y-chromosomal ampliconic genes
Project description:We compared a 5 week time course of cortical organoid differentiation across human, chimpanzee, orangutan, and rhesus using bulk RNAseq. In addition, single cell RNAseq was performed on a subset of time points from human cells in weeks 0, 1, 2, and 5.
Project description:Accurate annotation of transcript isoforms is crucial to understand gene functions, but automated methods for reconstructing full-length transcripts from RNA sequencing (RNA-seq) data remain imprecise. We developed Bookend, a software package for transcript assembly that incorporates data from different RNA-seq techniques, with a focus on identifying and utilizing RNA 5′ and 3′ ends. Through end-guided assembly with Bookend we demonstrate that correct modeling of transcript start and end sites is essential for precise transcript assembly. Furthermore, we discovered that utilization of end-labeled reads present in full-length single-cell RNA-seq (scRNA-seq) datasets dramatically improves the precision of transcript assembly in single cells. Finally, we show that hybrid assembly across short-read, long-read, and end-capture RNA-seq datasets from Arabidopsis, as well as meta-assembly of RNA-seq from single mouse embryonic stem cells (mESCs) can produce end-to-end transcript annotations of comparable quality to reference annotations in these model organisms.
Project description:Aegilops tauschii is the donor of the wheat D subgenome and an important genetic resource for wheat. The assembly of Ae. tauschii acc. AL8/78 reference genome sequence Aet v4.0 was therefore an important milestone for wheat biology and breeding. The combination of the > 4.2 Gb size of the Ae. tauschii genome and > 84% of recently evolved repeated sequences make sequencing this genome challenging. Here, we report further advances in the development of the Ae. tauschii acc. AL8/78 genome sequence. Two new genome-wide optical maps were constructed and employed in the revision of pseudomolecules and estimations of gap lengths. Gaps were closed with contigs of single-molecule Pacific Biosciences reads. The number of gaps in Aet v5.0 decreased by 38,899 compared to Aet v4.0. Transposable elements and protein-coding genes were reannotated. The number of high-confidence genes was reduced from 38,886 in Aet v4.0 to 32,980 in Aet v5.0. A nonredundant set of 478 biologically important genes including many of known function in wheat was manually annotated. Sixty-one microRNA precursor and 60 phasiRNA loci were discovered, annotated, and their expression was characterized. Also characterized was expression of other small RNAs, such as hc-siRNAs and tRFs. This upgraded genome sequence will facilitate the use of Ae. tauschii in wheat breeding and biological research. Aegilops tauschii is the donor of the wheat D subgenome and an important genetic resource for wheat. The assembly of Ae. tauschii acc. AL8/78 reference genome sequence Aet v4.0 was therefore an important milestone for wheat biology and breeding. The combination of the > 4.2 Gb size of the Ae. tauschii genome and > 84% of recently evolved repeated sequences make sequencing this genome challenging. Here, we report further advances in the development of the Ae. tauschii acc. AL8/78 genome sequence. Two new genome-wide optical maps were constructed and employed in the revision of pseudomolecules and estimations of gap lengths. Gaps were closed with contigs of single-molecule Pacific Biosciences reads. The number of gaps in Aet v5.0 decreased by 38,899 compared to Aet v4.0. Transposable elements and protein-coding genes were reannotated. The number of high-confidence genes was reduced from 38,886 in Aet v4.0 to 32,980 in Aet v5.0. A nonredundant set of 478 biologically important genes including many of known function in wheat was manually annotated. Sixty-one microRNA precursor and 60 phasiRNA loci were discovered, annotated, and their expression was characterized. Also characterized was expression of other small RNAs, such as hc-siRNAs and tRFs. This upgraded genome sequence will facilitate the use of Ae. tauschii in wheat breeding and biological research. Aegilops tauschii is the donor of the wheat D subgenome and an important genetic resource for wheat. The assembly of Ae. tauschii acc. AL8/78 reference genome sequence Aet v4.0 was therefore an important milestone for wheat biology and breeding. The combination of the > 4.2 Gb size of the Ae. tauschii genome and > 84% of recently evolved repeated sequences make sequencing this genome challenging. Here, we report further advances in the development of the Ae. tauschii acc. AL8/78 genome sequence. Two new genome-wide optical maps were constructed and employed in the revision of pseudomolecules and estimations of gap lengths. Gaps were closed with contigs of single-molecule Pacific Biosciences reads. The number of gaps in Aet v5.0 decreased by 38,899 compared to Aet v4.0. Transposable elements and protein-coding genes were reannotated. The number of high-confidence genes was reduced from 38,886 in Aet v4.0 to 32,980 in Aet v5.0. A nonredundant set of 478 biologically important genes including many of known function in wheat was manually annotated. Sixty-one microRNA precursor and 60 phasiRNA loci were discovered, annotated, and their expression was characterized. Also characterized was expression of other small RNAs, such as hc-siRNAs and tRFs. This upgraded genome sequence will facilitate the use of Ae. tauschii in wheat breeding and biological research.