NLTK 
NLTK is a platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning,
People who looked at this resource also viewed the following:
- Compilation of Portuguese-Romanian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Irish-Slovak parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Lithuanian-Polish parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- COVID-19 EU presscorner v2 dataset. Multilingual (CEF languages)