Project description:Most glioblastoma studies incorporate the layer of tumor molecular subtype based on the four-subtype classification system proposed in 2010. Nevertheless, there is no universally recognized and convenient tool for glioblastoma molecular subtyping, and each study applies a different set of markers and/or approaches that cause inconsistencies in data comparability and reproducibility between studies. Thus, this study aimed to create an applicable user-friendly tool for glioblastoma classification, with high accuracy, while using a significantly smaller number of variables. The study incorporated a TCGA microarray, sequencing datasets, and an independent cohort of 56 glioblastomas (LUHS cohort). The models were constructed by applying the Agilent G4502 dataset, and they were tested using the Affymetrix HG-U133a and Illumina Hiseq cohorts, as well as the LUHS cases. Two classification models were constructed by applying a logistic regression classification algorithm, based on the mRNA levels of twenty selected genes. The classifiers were translated to a RT-qPCR assay and validated in an independent cohort of 56 glioblastomas. The classification accuracy of the 20-gene and 5-gene classifiers varied between 90.7-91% and 85.9-87.7%, respectively. With this work, we propose a cost-efficient three-class (classical, mesenchymal, and proneural) tool for glioblastoma molecular classification based on the mRNA analysis of only 5-20 genes, and we provide the basic information for classification performance starting from the wet-lab stage. We hope that the proposed classification tool will enable data comparability between different research groups.
Project description:How to use bioinformatics methods to quickly and accurately locate the effective targets of traditional Chinese medicine monomer (TCM) is still an urgent problem needing to be solved. Here, we used high-throughput sequencing to identify the genes that were up-regulated after cells were treated with TCM monomers and used bioinformatics methods to analyze which transcription factors activated these genes. Then, the binding proteins of these transcription factors were analyzed and cross-analyzed with the docking proteins predicted by small molecule reverse docking software to quickly and accurately determine the monomer's targets. Followeding this method, we predicted that the TCM monomer Daphnoretin (DT) directly binds to JAK2 with a binding energy of -5.43 kcal/mol, and activates the JAK2/STAT3 signaling transduction pathway. Subsequent Western blotting and in vitro binding and kinase experiments further validated our bioinformatics predictions. Our method provides a new approach for quickly and accurately locating the effective targets of TCM monomers, and we also have discovered for the first time that TCM monomer DT is an agonist of JAK2.
Project description:BackgroundOur aim was to build a skill assessment system, providing objective feedback to trainees based on the motion metrics of laparoscopic surgical instruments.MethodsParticipants performed tissue dissection around the aorta (tissue dissection task) and renal parenchymal closure (parenchymal-suturing task), using swine organs in a box trainer under a motion capture (Mocap) system. Two experts assessed the recorded movies, according to the formula of global operative assessment of laparoscopic skills (GOALS: score range, 5-25), and the mean scores were utilized as objective variables in the regression analyses. The correlations between mean GOALS scores and Mocap metrics were evaluated, and potential Mocap metrics with a Spearman's rank correlation coefficient value exceeding 0.4 were selected for each GOALS item estimation. Four regression algorithms, support vector regression (SVR), principal component analysis (PCA)-SVR, ridge regression, and partial least squares regression, were utilized for automatic GOALS estimation. Model validation was conducted by nested and repeated k-fold cross validation, and the mean absolute error (MAE) was calculated to evaluate the accuracy of each regression model.ResultsForty-five urologic, 9 gastroenterological, and 3 gynecologic surgeons, 4 junior residents, and 9 medical students participated in the training. In both tasks, a positive correlation was observed between the speed-related parameters (e.g., velocity, velocity range, acceleration, jerk) and mean GOALS scores, with a negative correlation between the efficiency-related parameters (e.g., task time, path length, number of opening/closing operations) and mean GOALS scores. Among the 4 algorithms, SVR showed the highest accuracy in the tissue dissection task ([Formula: see text]), and PCA-SVR in the parenchymal-suturing task ([Formula: see text]), based on 100 iterations of the validation process of automatic GOALS estimation.ConclusionWe developed a machine learning-based GOALS scoring system in wet lab training, with an error of approximately 1-2 points for the total score, and motion metrics that were explainable to trainees. Our future challenges are the further improvement of onsite GOALS feedback, exploring the educational benefit of our model and building an efficient training program.
Project description:BackgroundDespite the large volume of genome sequencing data produced by next-generation sequencing technologies and the highly sophisticated software dedicated to handling these types of data, gaps are commonly found in draft genome assemblies. The existence of gaps compromises our ability to take full advantage of the genome data. This study aims to identify a practical approach for biologists to complete their own genome assemblies using commonly available tools and resources.ResultsA pipeline was developed to assemble complete genomes primarily from the next generation sequencing (NGS) data. The input of the pipeline is paired-end Illumina sequence reads, and the output is a high quality complete genome sequence. The pipeline alternates the employment of computational and biological methods in seven steps. It combines the strengths of de novo assembly, reference-based assembly, customized programming, public databases utilization, and wet lab experimentation. The application of the pipeline is demonstrated by the completion of a bacterial genome, Thermotoga sp. strain RQ7, a hydrogen-producing strain.ConclusionsThe developed pipeline provides an example of effective integration of computational and biological principles. It highlights the complementary roles that in silico and wet lab methodologies play in bioinformatical studies. The constituting principles and methods are applicable to similar studies on both prokaryotic and eukaryotic genomes.
Project description:IntroductionWe undertook a systematic review of the use of wet lab (animal and cadaveric) simulation models in urological training, with an aim to establishing a level of evidence (LoE) for studies and level of recommendation (LoR) for models, as well as evaluating types of validation.MethodsMedline, EMBASE, and Cochrane databases were searched for English-language studies using search terms including a combination of "surgery," "surgical training," and "medical education." These results were combined with "wet lab," "animal model," "cadaveric," and "in-vivo." Studies were then assigned a LoE and LoR if appropriate as per the education-modified Oxford Centre for Evidence-Based Medicine classification.ResultsA total of 43 articles met the inclusion criteria. There was a mean of 23.1 (±19.2) participants per study with a median of 20. Overall, the studies were largely of low quality, with 90.7% of studies being lower than LoE 2a (n=26 for LoE 2b and n=13 for LoE 3). The majority (72.1%, n=31) of studies were in animal models and 27.9% (n=12) were in cadaveric models.ConclusionsSimulation in urological education is becoming more prevalent in the literature, however, there is a focus on animal rather than cadaveric simulation, possibly due to cost and ethical considerations. Studies are also predominately of a low LoE; higher LoEs, especially randomized controlled studies, are needed.
Project description:The systematic perturbation of genomes using CRISPR/Cas9 deciphers gene function at an unprecedented rate, depth and ease. Commercially available sgRNA libraries typically contain tens of thousands of pre-defined constructs, resulting in a complexity challenging to handle. In contrast, custom sgRNA libraries comprise gene sets of self-defined content and size, facilitating experiments under complex conditions such as in vivo systems. To streamline and upscale cloning of custom libraries, we present CLUE, a bioinformatic and wet-lab pipeline for the multiplexed generation of pooled sgRNA libraries. CLUE starts from lists of genes or pasted sequences provided by the user and designs a single synthetic oligonucleotide pool containing various libraries. At the core of the approach, a barcoding strategy for unique primer binding sites allows amplifying different user-defined libraries from one single oligonucleotide pool. We prove the approach to be straightforward, versatile and specific, yielding uniform sgRNA distributions in all resulting libraries, virtually devoid of cross-contaminations. For in silico library multiplexing and design, we established an easy-to-use online platform at www.crispr-clue.de. All in all, CLUE represents a resource-saving approach to produce numerous high quality custom sgRNA libraries in parallel, which will foster their broad use across molecular biosciences.
Project description:Humanized monoclonal antibodies (mAbs) are among the most promising modern therapeutics, but defined engineering strategies are still not available. Antibody humanization often leads to a loss of affinity, as it is the case for our model antibody Ab2/3H6 (PDB entry 3BQU). Identifying appropriate back-to-mouse mutations is needed to restore binding affinity, but highly challenging. In order to get more insight, we have applied molecular dynamics simulations and correlated them to antibody binding and expression in wet lab experiments. In this study, we discuss six mAb variants and investigate a tyrosine conglomeration, an isopolar substitution and the improvement of antibody binding towards wildtype affinity. In the 3D structure of the mouse wildtype, residue R94h is surrounded by three tyrosines which form a so-called 'tyrosine cage'. We demonstrate that the tyrosine cage has a supporting function for the CDRh3 loop conformation. The isopolar substitution is not able to mimic the function appropriately. Finally, we show that additional light chain mutations can restore binding to wildtype-comparable level, and also improve the expression of the mAb significantly. We conclude that the variable light chain of Ab2/3H6 is of underestimated importance for the interaction with its antigen mAb 2F5.
Project description:The varietal authentication of wines is fundamental for assessing wine quality, and it is part of its compositional profiling. The availability of historical, cultural and chemical composition information is extremely important for quality evaluation. DNA-based techniques are a powerful tool for proving the varietal composition of a wine. SSR-amplification of genomic residual Vitis vinifera DNA, namely Wine DNA Fingerprinting (WDF) is able to produce strong, analytical evidence concerning the monovarietal nature of a wine, and for blended wines by generating the probability of the presence/absence of a certain variety, all in association with a dedicated bioinformatics elaboration of genotypes associated with possible varietal candidates. Together with WDF we could exploit Bioinformatics techniques, due to the number of grape genomes grown. In this paper, the use of WDF and the development of a bioinformatics tool for allelic data validation, retrieved from the amplification of 7 to 10 SSRs markers in the Vitis vinifera genome, are reported. The wines were chosen based on increasing complexity; from monovarietal, experimental ones, to commercial monovarietals, to blended commercial wines. The results demonstrate that WDF, after calculation of different distance matrices and Neighbor-Joining input data, followed by Principal Component Analysis (PCA) can effectively describe the varietal nature of wines. In the unknown blended wines the WDF profiles were compared to possible varietal candidates (Merlot, Pinot Noir, Cabernet Sauvignon and Zinfandel), and the output graphs show the most probable varieties used in the blend as closeness to the tested wine. This pioneering work should be meant as to favor in perspective the multidisciplinary building-up of on-line databanks and bioinformatics toolkits on wine. The paper concludes with a discussion on an integrated decision support system based on bioinformatics, chemistry and cultural data to assess wine quality.
Project description:Whole genome sequencing of bacterial isolates has become a daily task in many laboratories, generating incredible amounts of data. However, data acquisition is not an end in itself; the goal is to acquire high-quality data useful for understanding genetic relationships. Having a method that could rapidly determine which of the many available run metrics are the most important indicators of overall run quality and having a way to monitor these during a given sequencing run would be extremely helpful to this effect. Therefore, we compared various run metrics across 486 MiSeq runs, from five different machines. By performing a statistical analysis using principal components analysis and a K-means clustering algorithm of the metrics, we were able to validate metric comparisons among instruments, allowing for the development of a predictive algorithm, which permits one to observe whether a given MiSeq run has performed adequately. This algorithm is available in an Excel spreadsheet: that is, MiSeq Instrument & Run (In-Run) Forecast. Our tool can help verify that the quantity/quality of the generated sequencing data consistently meets or exceeds recommended manufacturer expectations. Patterns of deviation from those expectations can be used to assess potential run problems and plan preventative maintenance, which can save valuable time and funding resources.