Unknown

Dataset Information

0

Semi-supervised morphosyntactic classification of Old Icelandic.


ABSTRACT: We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data.

SUBMITTER: Urban K 

PROVIDER: S-EPMC4100772 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Semi-supervised morphosyntactic classification of Old Icelandic.

Urban Kryztof K   Tangherlini Timothy R TR   Vijūnas Aurelijus A   Broadwell Peter M PM  

PloS one 20140716 7


We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantag  ...[more]

Similar Datasets

| S-EPMC8444075 | biostudies-literature
| S-EPMC4579132 | biostudies-literature
| S-EPMC8490428 | biostudies-literature
2022-09-03 | GSE168264 | GEO
| S-EPMC9178802 | biostudies-literature
| S-EPMC7856146 | biostudies-literature
| S-EPMC7551840 | biostudies-literature
| S-EPMC6638773 | biostudies-literature
| S-EPMC4537225 | biostudies-literature
2019-11-13 | GSE140262 | GEO