Project description:ObjectiveThe cause and mechanism of non-obstructive azoospermia (NOA) is complicated; therefore, an effective therapy strategy is yet to be developed. This study aimed to analyse the pathogenesis of NOA at the molecular biological level and to identify the core regulatory genes, which could be utilised as potential biomarkers.MethodsThree NOA microarray datasets (GSE45885, GSE108886, and GSE145467) were collected from the GEO database and merged into training sets; a further dataset (GSE45887) was then defined as the validation set. Differential gene analysis, consensus cluster analysis, and WGCNA were used to identify preliminary signature genes; then, enrichment analysis was applied to these previously screened signature genes. Next, 4 machine learning algorithms (RF, SVM, GLM, and XGB) were used to detect potential biomarkers that are most closely associated with NOA. Finally, a diagnostic model was constructed from these potential biomarkers and visualised as a nomogram. The differential expression and predictive reliability of the biomarkers were confirmed using the validation set. Furthermore, the competing endogenous RNA network was constructed to identify the regulatory mechanisms of potential biomarkers; further, the CIBERSORT algorithm was used to calculate immune infiltration status among the samples.ResultsA total of 215 differentially expressed genes (DEGs) were identified between NOA and control groups (27 upregulated and 188 downregulated genes). The WGCNA results identified 1123 genes in the MEblue module as target genes that are highly correlated with NOA positivity. The NOA samples were divided into 2 clusters using consensus clustering; further, 1027 genes in the MEblue module, which were screened by WGCNA, were considered to be target genes that are highly correlated with NOA classification. The 129 overlapping genes were then established as signature genes. The XGB algorithm that had the maximum AUC value (AUC=0.946) and the minimum residual value was used to further screen the signature genes. IL20RB, C9orf117, HILS1, PAOX, and DZIP1 were identified as potential NOA biomarkers. This 5 biomarker model had the highest AUC value, of up to 0.982, compared to other single biomarker models; additionally, the results of this biomarker model were verified in the validation set.ConclusionsAs IL20RB, C9orf117, HILS1, PAOX, and DZIP1 have been determined to possess the strongest association with NOA, these five genes could be used as potential therapeutic targets for NOA patients. Furthermore, the model constructed using these five genes, which possessed the highest diagnostic accuracy, may be an effective biomarker model that warrants further experimental validation.

Project description:BackgroundBreast cancer (BC) ranks first in incidence among women, with approximately 2 million new cases per year. Therefore, it is essential to investigate emerging targets for BC patients' diagnosis and prognosis.MethodsWe analyzed gene expression data from 99 normal and 1,081 BC tissues in The Cancer Genome Atlas (TCGA) database. Differentially expressed genes (DEGs) were identified using "limma" R package, and relevant modules were chosen through Weighted Gene Coexpression Network Analysis (WGCNA). Intersection genes were obtained by matching DEGs to WGCNA module genes. Functional enrichment studies were performed on these genes using Gene Ontology (GO), Disease Ontology (DO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Biomarkers were screened via Protein-Protein Interaction (PPI) networks and multiple machine-learning algorithms. The Gene Expression Profiling Interactive Analysis (GEPIA), The University of ALabama at Birmingham CANcer (UALCAN), and Human Protein Atlas (HPA) databases were employed to examine mRNA and protein expression of eight biomarkers. Kaplan-Meier mapper tool assessed their prognostic capabilities. Key biomarkers were analyzed via single-cell sequencing, and their relationship with immune infiltration was examined using Tumor Immune Estimation Resource (TIMER) database and "xCell" R package. Lastly, drug prediction was conducted based on the identified biomarkers.ResultsWe identified 1,673 DEGs and 542 important genes through differential analysis and WGCNA, respectively. Intersection analysis revealed 76 genes, which play significant roles in immune-related viral infection and IL-17 signaling pathways. DIX domain containing 1 (DIXDC1), Dual specificity phosphatase 6 (DUSP6), Pyruvate dehydrogenase kinase 4 (PDK4), C-X-C motif chemokine ligand 12 (CXCL12), Interferon regulatory factor 7 (IRF7), Integrin subunit alpha 7 (ITGA7), NIMA related kinase 2 (NEK2), and Nuclear receptor subfamily 3 group C member 1 (NR3C1) were selected as BC biomarkers using machine-learning algorithms. NEK2 was the most critical gene for diagnosis. Prospective drugs targeting NEK2 include etoposide and lukasunone.ConclusionsOur study identified DIXDC1, DUSP6, PDK4, CXCL12, IRF7, ITGA7, NEK2, and NR3C1 as potential diagnostic biomarkers for BC, with NEK2 having the highest potential to aid in diagnosis and prognosis in clinical settings.

Project description:ObjectiveAbdominal aortic aneurysm (AAA) is a life-threatening vascular condition. This study aimed to discover new indicators for the early detection of AAA and explore the possible involvement of immune cell activity in its development.MethodsSourced from the Gene Expression Omnibus, the AAA microarray datasets GSE47472 and GSE57691 were combined to generate the training set. Additionally, a separate dataset (GSE7084) was designated as the validation set. Enrichment analyses were carried out to explore the underlying biological mechanisms using Disease Ontology, Kyoto Encyclopedia of Genes and Genomes, and Gene Ontology. We then utilized weighted gene co-expression network analysis (WGCNA) along with 3 machine learning techniques: least absolute shrinkage and selection operator, support vector machine-recursive feature elimination, and random forest, to identify feature genes for AAA. Moreover, data were validated using the receiver operating characteristic (ROC) curve, with feature genes defined as those having an area under the curve above 85% and a p-value below 0.05. Finally, the single sample gene set enrichment analysis algorithm was applied to probe the immune landscape in AAA and its connection to the selected feature genes.ResultsWe discovered 72 differentially expressed genes (DEGs) when comparing healthy and AAA samples, including 36 upregulated and 36 downregulated genes. Functional enrichment analysis revealed that the DEGs associated with AAA are primarily involved in inflammatory regulation and immune response. By intersecting the result of 3 machine learning algorithms and WGCNA, 3 feature genes were identified, including MRAP2, PPP1R14A, and PLN genes. The diagnostic performance of all these genes was strong, as revealed by the ROC analysis. A significant increase in 15 immune cell types in AAA samples was observed, based on the analysis of immune cell infiltration. In addition, the 3 feature genes show a strong linkage with different types of immune cells.ConclusionThree feature genes (MRAP2, PPP1R14A, and PLN) related to the development of AAA were identified. These genes are linked to immune cell activity and the inflammatory microenvironment, providing potential biomarkers for early detection and a basis for further research into AAA progression.

Project description:BackgroundAs the leading cause of chronic kidney disease, diabetic kidney disease (DKD) is an enormous burden for all healthcare systems around the world. However, its early diagnosis has no effective methods.MethodsFirst, gene expression data in GEO database were extracted, and the differential genes of diabetic tubulopathy were obtained. Immune-related genesets were generated by WGCNA and immune cell infiltration analyses. Then, differentially expressed immune-related cuproptosis genes (DEICGs) were derived by the intersection of differential genes and genes related to cuproptosis and immune. To investigate the functions of DEICGs, volcano plots and GO term enrichment analysis was performed. Machine learning and protein-protein interaction (PPI) network analysis helped to finally screen out hub genes. The diagnostic efficacy of them was evaluated by GSEA analysis, receiver operating characteristic (ROC) curve, single-cell RNA sequencing and the Nephroseq website. The expression of hub genes at the animal level by STZ -induced and db/db DKD mouse models was further verified.ResultsFinally, three hub genes, including FSTL1, CX3CR1 and AGR2 that were up-regulated in both the test set GSE30122 and the validation set GSE30529, were screened. The areas under the curve (AUCs) of ROC curves of hub genes were 0.911, 0.935 and 0.922, respectively, and 0.946 when taking as a whole. Correlation analysis showed that the expression level of three hub genes demonstrated their negative relationship with GFR, while those of FSTL1 displayed a positive correlation with the level of serum creatinine. GSEA was enriched in inflammatory and immune-related pathways. Single-nucleus RNA sequencing indicated the main distribution of FSTL1 in podocyte and mesangial cells, the high expression of CX3CR1 in leukocytes and the main localization of AGR2 in the loop of Henle. In mouse models, all three hub genes were increased in both STZ-induced and db/db DKD models.ConclusionMachine learning was combined with WGCNA, immune cell infiltration and PPI analyses to identify three hub genes associated with cuproptosis, immunity and diabetic nephropathy, which all have great potential as diagnostic markers for DKD and even predict disease progression.

Dataset Information

Identifying functional subtypes of IgA nephropathy based on three machine learning algorithms and WGCNA

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets