Ontology highlight
ABSTRACT:
SUBMITTER: Badawi S
PROVIDER: S-EPMC10147969 | biostudies-literature | 2023 Jun
REPOSITORIES: biostudies-literature
Badawi Soran S Saeed Ari M AM Ahmed Sara A SA Abdalla Peshraw Ahmed PA Hassan Diyari A DA
Data in brief 20230413
The rapid growth of technology has massively increased the amount of text data. The data can be mined and utilized for numerous natural language processing (NLP) tasks, particularly text classification. The core part of text classification is collecting the data for predicting a good model. This paper collects Kurdish News Dataset Headlines (KNDH) for text classification. The dataset consists of 50000 news headlines which are equally distributed among five classes, with 10000 headlines for each ...[more]