Ontology highlight
ABSTRACT:
SUBMITTER: Salehi AR
PROVIDER: S-EPMC10908853 | biostudies-literature | 2024 Mar
REPOSITORIES: biostudies-literature
Salehi Amir Reza AR Khedmati Majid M
Scientific reports 20240302 1
In this paper, a Cluster-based Synthetic minority oversampling technique (SMOTE) Both-sampling (CSBBoost) ensemble algorithm is proposed for classifying imbalanced data. In this algorithm, a combination of over-sampling, under-sampling, and different ensemble algorithms, including Extreme Gradient Boosting (XGBoost), random forest, and bagging, is employed in order to achieve a balanced dataset and address the issues including redundancy of data after over-sampling, information loss in under-sam ...[more]