Latvian and English monolingual corpus from Latvian web resources

"Latvian and English monolingual corpus from Latvian web resources" compiled from corpora listed in ReadMe file by Consortium of National Language Technology Platform (NLTP) Project (Action number: 2018-EU-IA-0082). Published under CC-BY-SA-4.0 license.'}

Monolingual corpus Latvian web resources collected during NLTP project.
Resource size:
Latvian : 153 667 sentences, 2 106 839 words
English: 80 403 sentences, 1 080 017 words