5 Language Resources

Order by:

 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 108 Number of views 205
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 25
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 44 Number of views 111
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in Moses-like format.
Number of downloads 5 Number of views 13
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI
 Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
Number of downloads 16 Number of views 23
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI