CST Lemmatiser

CST's lemmatiser reduces all words in a text to their base form, the lemma. It is language independent in the sense that it can be trained for different inflected languages. What is needed is a list of lemmas, their infected forms and, if possible, their POS-tag. The online version of the tool is available for Danish, Dutch, English, French, German, Greek, Icelandic, Latin, Polish, and Russian.


Languages: French (fr), German (de), Dutch; Flemish (nl), Icelandic (is), Polish (pl), Modern Greek (1453-) (el), Danish (da), Russian (ru), English (en)