ABSTRACT: Colon adenocarcinoma (COAD) is among the most common digestive system malignancies worldwide, and its pathogenesis and gene signatures remain unclear. This study explored the genetic characteristics and molecular mechanisms underlying colon cancer development. Three gene expression data sets were obtained from the Gene Expression Omnibus (GEO) database. GEO2R was used to determine differentially expressed genes (DEGs) between COAD and normal tissues. Then, the intersection of the data sets was obtained. Metascape was used to perform the functional enrichment analyses. Next, STRING was used to build protein-protein interaction (PPI) networks. Hub genes were identified and analysed using Cytoscape. Next, survival analysis and expression analysis of the hub genes were performed. ROC curve analysis was performed for further test of the diagnostic efficacy. Finally, alterations in the hub genes were predicted and analysed by cBioPortal. Altogether, 436 DEGs were detected. The DEGs were mainly enriched in cell cycle phase transition, nuclear division, meiotic nuclear division, and cytokinesis. Based on PPI networks, 20 hub genes were selected. Among them, 6 hub genes (CCNB1, CCNA2, AURKA, NCAPG, DLGAP5, and CENPE) showed significant prognostic value in colon cancer (P < 0.05), while 5 hub genes (CDK1, CCNB1, CCNA2, MAD2L1, and DLGAP5) were associated with early colon cancer diagnosis and ROC curve analysis showed good diagnostic accuracy. In conclusion, integrated bioinformatics analysis was used to identify hub genes that reveal the potential mechanism of carcinogenesis and progression of colon cancer. The hub genes might be novel biomarkers for early diagnosis, treatment, and prognosis of colon cancer.