13 Language Resources

Order by:

 COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AR)
Number of downloads 21 Number of views 68
  • Arabic
  • English
  • CC-BY-SA-3.0
 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 108 Number of views 205
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
Number of downloads 23 Number of views 103
  • Arabic
  • Bengali
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Malagasy
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Spanish; Castilian
  • CC-BY-3.0
 COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
Number of downloads 16 Number of views 45
  • Arabic
  • Chinese
  • English
  • French
  • Italian
  • Korean
  • Persian
  • Philippine languages
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-SA-4.0
 COVID-19 Voltaire dataset v1. Bilingual (EN-AR)
Number of downloads 5 Number of views 30
  • Arabic
  • English
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 2 Number of views 16
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 12 Number of views 35
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.05) in TMX format.
Number of downloads 9 Number of views 25
  • Arabic
  • English
  • French
  • German
  • Russian
  • CC-BY-4.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.0) in TMX format.
Number of downloads 3 Number of views 12
  • Arabic
  • English
  • French
  • German
  • Russian
  • CC-BY-4.0
 OpenEdition culture-related publications. Multilingual (AR, DE, EL, EN, ES, FR, HR, IT, NL, PL, PT, RO, RU, SL, SV) collection of TMX files.
Number of downloads 7 Number of views 44
  • Arabic
  • Croatian
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0