Project description:An archaeological bone fragment from Baishiya Karst Cave, China, was identified as stemming from a hominin through ZooMS (Zooarchaeology by Mass Spectrometry). Shotgun palaeoproteomic analyses were thereafter conducted on the specimen to refine the taxonomic identification and perform phylogenetic analyses. The reconstruted proteome shows that the newly described Baishiya Karst Cave individual, Xiahe 2, is most closely related to the high-coverage published genome from a Denisovan individual.
Project description:Microarray-based enrichment of selected genomic loci is a powerful method for genome complexity reduction. Since the vast majority of exons in vertebrate genomes are smaller than 150 nt, we have explored the use of short fragment libraries (85-110bp) to achieve higher enrichment specificity by reducing carryover and adverse effects of flanking intronic sequences. These short fragment libraries were enriched for 1.69 Mb of exonic sequences, using custom 244K microarrays, and sequenced using AB/SOLiD. High enrichment specificity (60 M-bM-^@M-^S 75%) was obtained at 67-213x average coverage, with 77-92% and 90-98% of targeted regions covered with more than 25% and 10% of the average coverage, respectively. As a more appropriate measure of the evenness of coverage, which is relatively independent of sequencing depth, we introduce the evenness of coverage parameter E. E values up to 75% were achieved. To verify the accuracy of SNP/mutation detection we evaluated 384 known non-reference SNPs in the targeted regions. At ~ 200x average sequence coverage, we were able to survey 96.4% of 1.69 Mb of genomic sequence with only 4.2% false negative calls while 3.6% of targeted regions were marked as unsurveyed. A total of 1197 new variants were detected. Verification revealed only 8 false positive calls, resulting in an overall false positive rate of less than 1 per ~200,000 bp (0.0005%, equivalent to an overall phred score of 55). 4 samples + capture design file
Project description:Microarray-based enrichment of selected genomic loci is a powerful method for genome complexity reduction. Since the vast majority of exons in vertebrate genomes are smaller than 150 nt, we have explored the use of short fragment libraries (85-110bp) to achieve higher enrichment specificity by reducing carryover and adverse effects of flanking intronic sequences. These short fragment libraries were enriched for 1.69 Mb of exonic sequences, using custom 244K microarrays, and sequenced using AB/SOLiD. High enrichment specificity (60 – 75%) was obtained at 67-213x average coverage, with 77-92% and 90-98% of targeted regions covered with more than 25% and 10% of the average coverage, respectively. As a more appropriate measure of the evenness of coverage, which is relatively independent of sequencing depth, we introduce the evenness of coverage parameter E. E values up to 75% were achieved. To verify the accuracy of SNP/mutation detection we evaluated 384 known non-reference SNPs in the targeted regions. At ~ 200x average sequence coverage, we were able to survey 96.4% of 1.69 Mb of genomic sequence with only 4.2% false negative calls while 3.6% of targeted regions were marked as unsurveyed. A total of 1197 new variants were detected. Verification revealed only 8 false positive calls, resulting in an overall false positive rate of less than 1 per ~200,000 bp (0.0005%, equivalent to an overall phred score of 55).
Project description:Using in silico modelling, such Genome-Scale Metabolic Network (GSMN), represents a promising approach to predicting and understanding the potential for specialised metabolite production in a given organism. To address these questions, we reconstructed a new high-quality GSMN for the Penicillium rubens Wisconsin 54-1255 strain, a commonly used model organism. Our reconstruction, iPrub22, adheres to current convention standards and quality criteria, incorporating updated functional annotations, orthology searches with different GSMN templates, data from previous reconstructions, and manual curation steps targeting basal and specialised metabolites. With a MEMOTE score of 74% and a metabolic coverage of 45%, iPrub22 includes 5,464 metabolites interconnected by 5,919 reactions, of which 5,033 are supported by at least one genomic sequence. Of the metabolites present in iPrub22, 13% are categorised as belonging to specialised metabolism.
Project description:DNA copy number variations occur within populations and aberrations can cause disease. We sought to develop an improved lab-automatable, cost-efficient, accurate platform to profile DNA copy number. We developed a sequencing-based assay of nuclear, mitochondrial, and telomeric DNA copy number that draws on the unbiased nature of next-generation sequencing and incorporates techniques developed for RNA expression profiling. To demonstrate this platform, we assayed UMC-11 cells using 5 million 33 nt reads and found tremendous copy number variation, including regions of single and homogeneous deletions and amplifications to 29 copies; 5 times more mitochondria and 4 times less telomeric sequence than a pool of non-diseased, blood-derived DNA; and that UMC-11 was derived from a male individual. The described assay outputs absolute copy number, outputs an error estimate (p-value), and is more accurate than array-based platforms at high copy number. The platform enables profiling of mitochondrial levels and telomeric length. The assay is lab-automatable and has a genomic resolution and cost that are tunable based on the number of sequence reads. DNA genome sequencing at roughly 0.03 coverage to identify genomic copy number variations
Project description:To define the sequence preference of SALL4 C2H2 zinc finger domains, we performed SELEX coupled with high-throughput sequencing (HT-SELEX) using the purified SALL4 ZFC1, ZFC2 and ZFC4 domains combined with no protein control experiment. We re-sequenced the libraries from E-MTAB-9236 with very high coverage to estimate the minimum number of reads required per sample for accurate results.
Project description:The development of CRISPR genetic screening tools has improved functional genomics, as these tools enable precise genomic editing, provide broad access to genomic regions beyond protein-coding genes, and have fewer off-target effects than other functional genomics modalities, allowing for novel applications with smaller library sizes compared to prior technologies. Pooled functional genomics screens require high cellular coverage per perturbation to accurately quantify phenotypes and average out phenotype-independent variability across the population. While more compact libraries have decreased the number of cells needed for a given screen, the cell coverage required for large-scale CRISPR screens still poses technical hurdles to screen in more challenging systems, such as iPSC-derived and primary cells. A major factor that influences cell coverage is screening library uniformity, as larger variation in individual guide RNA abundance requires higher cell coverage to reliably measure low-abundance guides. In this work, we have systematically optimized guide RNA cloning procedures to decrease bias. We implement these protocols to demonstrate that CRISPRi screens using 10-fold fewer cells than the current standard provides equivalent statistically significant hit-calling results to screens run at higher coverage, opening the possibility of conducting genome-wide and other large-scale CRISPR screens in technically challenging models.