Latvian and English monolingual corpus from Latvian web resources 
"Latvian and English monolingual corpus from Latvian web resources" compiled from corpora listed in ReadMe file by Consortium of National Language Technology Platform (NLTP) Project (Action number: 2018-EU-IA-0082). Published under CC-BY-SA-4.0 license.'}
Monolingual corpus Latvian web resources collected during NLTP project.
Resource size:
Latvian : 153 667 sentences, 2 106 839 words
English: 80 403 sentences, 1 080 017 words
People who looked at this resource also viewed the following:
- Compilation of German-French parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- Albanian web corpus MaCoCu-sq 1.0
- Compilation of Bulgarian-Italian parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- ParaCrawl release 8 Danish-English - deferred files