Ontology highlight
ABSTRACT:
SUBMITTER: Yasir M
PROVIDER: S-EPMC8683192 | biostudies-literature | 2021
REPOSITORIES: biostudies-literature
Yasir Muhammad M Chen Li L Khatoon Amna A Malik Muhammad Amir MA Abid Fazeel F
Computational intelligence and neuroscience 20211210
Mixed script identification is a hindrance for automated natural language processing systems. Mixing cursive scripts of different languages is a challenge because NLP methods like POS tagging and word sense disambiguation suffer from noisy text. This study tackles the challenge of mixed script identification for mixed-code dataset consisting of Roman Urdu, Hindi, Saraiki, Bengali, and English. The language identification model is trained using word vectorization and RNN variants. Moreover, throu ...[more]