Multilingual (EN, MK, SQ) corpus from websites of government of North Macedonia v.1.0 
Multilingual dataset (EN, MK, SQ) based on the content of websites of the government of North Macedonia. It includes 177103 Translation Units in total. It was generated by crawling the websites in February 2021, detecting pairs of parallel documents, identifying parallel sentence pairs and filtering the results. The number of TUs are:
75701 en-mk
23412 en-sq
77990 mk-sq
People who looked at this resource also viewed the following:
- English-Albanian corpus from websites of national Agencies v.1.0
- HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
- Compilation of Latvian-Maltese parallel corpora resources used for training of NTEU Machine Translation engines.
- Manufactured data based on ParaCrawl release 8 German-English, law terms