16 Language Resources

Order by:

 Bilingual (EN, MK) corpus v.1.02 based on WikiMatrix
Number of downloads 3 Number of views 17
  • English
  • Macedonian
  • CC-BY-SA-3.0
 Bilingual (EN, MK) corpus v.1.05 based on WikiMatrix
Number of downloads 2 Number of views 17
  • English
  • Macedonian
  • CC-BY-SA-3.0
 COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-MK)
Number of downloads 4 Number of views 34
  • English
  • Macedonian
  • CC-BY-SA-3.0
 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 102 Number of views 190
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 22
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 40 Number of views 92
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in Moses format.
Number of downloads 5 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
Number of downloads 4 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in Moses-like format.
Number of downloads 5 Number of views 12
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI
 Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
Number of downloads 15 Number of views 21
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI