Croatian and English monolingual corpus from Croatian web resources

"Croatian and English monolingual corpus from Croatian web resources" compiled from corpora listed in ReadMe file by Consortium of National Language Technology Platform (NLTP) Project (Action number: 2018-EU-IA-0082). Published under CC-BY-SA-4.0 license.'}

Monolingual corpus of Croatian web resources collected during NLTP project.
Resource size:
Croatian: 1131719 sentences, 24303220 words
English: 218436 sentences, 5711041 words