Project description:<p>Recently developed methods that utilize partitioning of long genomic DNA fragments, and barcoding of shorter fragments derived from them, have succeeded in retaining long-range information in short sequencing reads. These so-called read cloud approaches represent a powerful, accurate, and cost-effective alternative to single-molecule long-read sequencing. We developed software, GROC-SVs, that takes advantage of read clouds for structural variant detection and assembly. We apply the method to two 10x Genomics data sets, one chromothriptic sarcoma with several spatially separated samples, and one breast cancer cell line, all Illumina-sequenced to high coverage. Comparison to short-fragment data from the same samples, and validation by mate-pair data from a subset of the sarcoma samples, demonstrate substantial improvement in specificity of breakpoint detection compared to short-fragment sequencing, at comparable sensitivity, and vice versa. The embedded long-range information also facilitates sequence assembly of a large fraction of the breakpoints; importantly, consecutive breakpoints that are closer than the average length of the input DNA molecules can be assembled together and their order and arrangement reconstructed, with some events exhibiting remarkable complexity. These features facilitated an analysis of the structural evolution of the sarcoma. In the chromothripsis, rearrangements occurred before copy number amplifications, and using the phylogenetic tree built from point mutation data, we show that single nucleotide variants and structural variants are not correlated. We predict significant future advances in structural variant science using 10x data analyzed with GROC-SVs and other read cloud-specific methods.</p>
Project description:The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective fashion. Here, we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67X coverage, Sample GSM1551550). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Aedes aegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that virtually all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, accurate, and can be applied to many species.
Project description:Kynureninase is a member of a large family of catalytically diverse but structurally homologous pyridoxal 5'-phosphate (PLP) dependent enzymes known as the aspartate aminotransferase superfamily or alpha-family. The Homo sapiens and other eukaryotic constitutive kynureninases preferentially catalyze the hydrolytic cleavage of 3-hydroxy-l-kynurenine to produce 3-hydroxyanthranilate and l-alanine, while l-kynurenine is the substrate of many prokaryotic inducible kynureninases. The human enzyme was cloned with an N-terminal hexahistidine tag, expressed, and purified from a bacterial expression system using Ni metal ion affinity chromatography. Kinetic characterization of the recombinant enzyme reveals classic Michaelis-Menten behavior, with a Km of 28.3 +/- 1.9 microM and a specific activity of 1.75 micromol min-1 mg-1 for 3-hydroxy-dl-kynurenine. Crystals of recombinant kynureninase that diffracted to 2.0 A were obtained, and the atomic structure of the PLP-bound holoenzyme was determined by molecular replacement using the Pseudomonas fluorescens kynureninase structure (PDB entry 1qz9) as the phasing model. A structural superposition with the P. fluorescens kynureninase revealed that these two structures resemble the "open" and "closed" conformations of aspartate aminotransferase. The comparison illustrates the dynamic nature of these proteins' small domains and reveals a role for Arg-434 similar to its role in other AAT alpha-family members. Docking of 3-hydroxy-l-kynurenine into the human kynureninase active site suggests that Asn-333 and His-102 are involved in substrate binding and molecular discrimination between inducible and constitutive kynureninase substrates.
Project description:As the evolution of miRNA genes has been found to be one of the important factors in formation of the modern type of man, we performed a comparative analysis of the evolution of miRNA genes in two archaic hominines, Homo sapiens neanderthalensis and Homo sapiens denisova, and elucidated the expression of their target mRNAs in bain.A comparative analysis of the genomes of primates, including species in the genus Homo, identified a group of miRNA genes having fixed substitutions with important implications for the evolution of Homo sapiens neanderthalensis and Homo sapiens denisova. The mRNAs targeted by miRNAs with mutations specific for Homo sapiens denisova exhibited enhanced expression during postnatal brain development in modern humans. By contrast, the expression of mRNAs targeted by miRNAs bearing variations specific for Homo sapiens neanderthalensis was shown to be enhanced in prenatal brain development.Our results highlight the importance of changes in miRNA gene sequences in the course of Homo sapiens denisova and Homo sapiens neanderthalensis evolution. The genetic alterations of miRNAs regulating the spatiotemporal expression of multiple genes in the prenatal and postnatal brain may contribute to the progressive evolution of brain function, which is consistent with the observations of fine technical and typological properties of tools and decorative items reported from archaeological Denisovan sites. The data also suggest that differential spatial-temporal regulation of gene products promoted by the subspecies-specific mutations in the miRNA genes might have occurred in the brains of Homo sapiens denisova and Homo sapiens neanderthalensis, potentially contributing to the cultural differences between these two archaic hominines.
Project description:PurposeWe investigated the evidence of recent positive selection in the human phototransduction system at single nucleotide polymorphism (SNP) and gene level.MethodsSNP genotyping data from the International HapMap Project for European, Eastern Asian, and African populations was used to discover differences in haplotype length and allele frequency between these populations. Numeric selection metrics were computed for each SNP and aggregated into gene-level metrics to measure evidence of recent positive selection. The level of recent positive selection in phototransduction genes was evaluated and compared to a set of genes shown previously to be under recent selection, and a set of highly conserved genes as positive and negative controls, respectively.ResultsSix of 20 phototransduction genes evaluated had gene-level selection metrics above the 90th percentile: RGS9, GNB1, RHO, PDE6G, GNAT1, and SLC24A1. The selection signal across these genes was found to be of similar magnitude to the positive control genes and much greater than the negative control genes.ConclusionsThere is evidence for selective pressure in the genes involved in retinal phototransduction, and traces of this selective pressure can be demonstrated using SNP-level and gene-level metrics of allelic variation. We hypothesize that the selective pressure on these genes was related to their role in low light vision and retinal adaptation to ambient light changes. Uncovering the underlying genetics of evolutionary adaptations in phototransduction not only allows greater understanding of vision and visual diseases, but also the development of patient-specific diagnostic and intervention strategies.
Project description:Single-nucleus RNA sequencing (snRNA-seq) was used to profile the transcriptome of 16,015 nuclei in human adult testis. This dataset includes five samples from two different individuals. This dataset is part of a larger evolutionary study of adult testis at the single-nucleus level (97,521 single-nuclei in total) across mammals including 10 representatives of the three main mammalian lineages: human, chimpanzee, bonobo, gorilla, gibbon, rhesus macaque, marmoset, mouse (placental mammals); grey short-tailed opossum (marsupials); and platypus (egg-laying monotremes). Corresponding data were generated for a bird (red junglefowl, the progenitor of domestic chicken), to be used as an evolutionary outgroup.