7 Language Resources

Order by:

 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 108 Number of views 205
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
Number of downloads 21 Number of views 59
  • Albanian
  • Bengali
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Filipino; Pilipino
  • Finnish
  • French
  • German
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Tai languages
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-ND-3.0
 OpenEdition culture-related publications. Multilingual (AR, DE, EL, EN, ES, FR, HR, IT, NL, PL, PT, RO, RU, SL, SV) collection of TMX files.
Number of downloads 7 Number of views 44
  • Arabic
  • Croatian
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 25
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 44 Number of views 111
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
Number of downloads 8 Number of views 21
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
Number of downloads 14 Number of views 51
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI