Skip to main content
Dryad

IceMorph morphological analysis data files

Data files

Jun 09, 2014 version files 17.39 MB

Click names to download individual files

Abstract

This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for word class and inflectional detail; a corpus of Old Icelandic sagas in plain text and chunked by chapter; a tagged version of the same text, output of the IceMorph system; a training corpus labeled "Expert" for training and testing a machine learning module; and a training corpus labeled "Gold" for training and testing a machine learning module.