Ontology highlight
ABSTRACT:
SUBMITTER: Ashraf N
PROVIDER: S-EPMC9044368 | biostudies-literature | 2022
REPOSITORIES: biostudies-literature
Ashraf Noman N Khan Lal L Butt Sabur S Chang Hsien-Tsung HT Sidorov Grigori G Gelbukh Alexander A
PeerJ. Computer science 20220422
Urdu is a widely used language in South Asia and worldwide. While there are similar datasets available in English, we created the first multi-label emotion dataset consisting of 6,043 tweets and six basic emotions in the Urdu Nastalíq script. A multi-label (ML) classification approach was adopted to detect emotions from Urdu. The morphological and syntactic structure of Urdu makes it a challenging problem for multi-label emotion detection. In this paper, we build a set of baseline classifiers su ...[more]