Bilingual (EN, MK) corpus v.1.02 based on WikiMatrix

Bilinugal dataset (EN, MK) based on the WikiMatrix coprus which is constructed as described in "WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia". It was filtered with the purpose of removing TUs with limited or no use. It includes 770067 Translation Units.