Ontology highlight
ABSTRACT:
SUBMITTER: Zerrouki T
PROVIDER: S-EPMC5310197 | biostudies-literature | 2017 Apr
REPOSITORIES: biostudies-literature
Data in brief 20170203
Arabic diacritics are often missed in Arabic scripts. This feature is a handicap for new learner to read َArabic, text to speech conversion systems, reading and semantic analysis of Arabic texts. The automatic diacritization systems are the best solution to handle this issue. But such automation needs resources as diactritized texts to train and evaluate such systems. In this paper, we describe our corpus of Arabic diacritized texts. This corpus is called Tashkeela. It can be used as a linguisti ...[more]