Resource Type:

Corpus:
Lexical/Conceptual:
Language Description:
Tool/Service:

11 Language Resources

Order by:

 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 103 Number of views 191
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
Number of downloads 23 Number of views 98
  • Arabic
  • Bengali
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Malagasy
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Spanish; Castilian
  • CC-BY-3.0
 COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 2 Number of views 15
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 11 Number of views 28
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
Number of downloads 16 Number of views 41
  • Albanian
  • Bengali
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Filipino; Pilipino
  • Finnish
  • French
  • German
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Tai languages
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-ND-3.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 22
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 40 Number of views 92
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in Moses format.
Number of downloads 5 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
Number of downloads 4 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
Number of downloads 6 Number of views 18
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
Number of downloads 14 Number of views 43
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI