Filter by:

Corpus (15)
Text (15)
Parallel (15)
TMX (12)
Other (3)
ELRC3.0 (15)

Resource Type:

Corpus:
Lexical/Conceptual:
Language Description:
Tool/Service:

15 Language Resources

Order by:

 COVID-19 CDC dataset v2. Multilingual (EN, ES, FR, PT, IT, DE, KO, RU, ZH, UK, VI)
Number of downloads 18 Number of views 25
  • Chinese
  • English
  • French
  • German
  • Italian
  • Korean
  • Philippine languages
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Ukrainian
  • Vietnamese
  • Public Domain
 COVID-19 Government of Canada dataset v2. Multilingual (EN, FR, DE, ES, EL, IT, PL, PT, RO, KO, RU, ZH, UK, VI, TA, TL)
Number of downloads 10 Number of views 26
  • Chinese
  • English
  • French
  • German
  • Italian
  • Korean
  • Modern Greek (1453-)
  • Philippine languages
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Tamil
  • Ukrainian
  • Vietnamese
  • CC-BY-NC-4.0
 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 103 Number of views 191
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
Number of downloads 23 Number of views 98
  • Arabic
  • Bengali
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Malagasy
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Spanish; Castilian
  • CC-BY-3.0
 COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
Number of downloads 13 Number of views 35
  • Arabic
  • Chinese
  • English
  • French
  • Italian
  • Korean
  • Persian
  • Philippine languages
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-SA-4.0
 COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 2 Number of views 15
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 11 Number of views 28
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
Number of downloads 16 Number of views 41
  • Albanian
  • Bengali
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Filipino; Pilipino
  • Finnish
  • French
  • German
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Tai languages
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-ND-3.0
 OpenEdition culture-related publications. Multilingual (AR, DE, EL, EN, ES, FR, HR, IT, NL, PL, PT, RO, RU, SL, SV) collection of TMX files.
Number of downloads 7 Number of views 34
  • Arabic
  • Croatian
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 22
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 40 Number of views 92
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in Moses format.
Number of downloads 5 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
Number of downloads 4 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
Number of downloads 6 Number of views 18
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
Number of downloads 14 Number of views 43
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI