Filter by:

TMX (20)
Corpus (20)
Text (20)
Parallel (20)
ELRC3.0 (20)
COVID-19 (11)

Resource Type:

Corpus:
Lexical/Conceptual:
Language Description:
Tool/Service:

20 Language Resources

Order by:

 COVID-19 CDC dataset v2. Multilingual (EN, ES, FR, PT, IT, DE, KO, RU, ZH, UK, VI)
Number of downloads 18 Number of views 26
  • Chinese
  • English
  • French
  • German
  • Italian
  • Korean
  • Philippine languages
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Ukrainian
  • Vietnamese
  • Public Domain
 COVID-19 Government of Canada dataset v2. Multilingual (EN, FR, DE, ES, EL, IT, PL, PT, RO, KO, RU, ZH, UK, VI, TA, TL)
Number of downloads 10 Number of views 27
  • Chinese
  • English
  • French
  • German
  • Italian
  • Korean
  • Modern Greek (1453-)
  • Philippine languages
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Tamil
  • Ukrainian
  • Vietnamese
  • CC-BY-NC-4.0
 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 103 Number of views 193
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
Number of downloads 23 Number of views 98
  • Arabic
  • Bengali
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Malagasy
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Spanish; Castilian
  • CC-BY-3.0
 COVID-19 POLISH-GOV v2 dataset. Multilingual (EN, PL, FR, DE, VI, RU, UK)
Number of downloads 6 Number of views 13
  • English
  • French
  • German
  • Polish
  • Russian
  • Ukrainian
  • Vietnamese
  • Open Under-PSI
 COVID-19 UDSC-PL dataset. Multilingual (EN, PL, RU, UK)
Number of downloads 5 Number of views 23
  • English
  • Polish
  • Russian
  • Ukrainian
  • Open Under-PSI
 COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
Number of downloads 13 Number of views 36
  • Arabic
  • Chinese
  • English
  • French
  • Italian
  • Korean
  • Persian
  • Philippine languages
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-SA-4.0
 COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 2 Number of views 15
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 11 Number of views 29
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 COVID-19 WIPO dataset v1. Multilingual (EN, ES, FR, DE, PT, RU)
Number of downloads 1 Number of views 18
  • English
  • French
  • German
  • Portuguese
  • Russian
  • Spanish; Castilian
  • CC-BY-3.0
 COVID-19 WIPO dataset v2. Multilingual (EN, ES, FR, DE, PT, RU, AR, ZH)
Number of downloads 4 Number of views 11
  • English
  • French
  • German
  • Portuguese
  • Russian
  • Spanish; Castilian
  • CC-BY-3.0
 HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
Number of downloads 17 Number of views 42
  • Albanian
  • Bengali
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Filipino; Pilipino
  • Finnish
  • French
  • German
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Tai languages
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-ND-3.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.05) in TMX format.
Number of downloads 9 Number of views 25
  • Arabic
  • English
  • French
  • German
  • Russian
  • CC-BY-4.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.0) in TMX format.
Number of downloads 1 Number of views 11
  • Arabic
  • English
  • French
  • German
  • Russian
  • CC-BY-4.0
 OpenEdition culture-related publications. Multilingual (AR, DE, EL, EN, ES, FR, HR, IT, NL, PL, PT, RO, RU, SL, SV) collection of TMX files.
Number of downloads 7 Number of views 35
  • Arabic
  • Croatian
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 40 Number of views 94
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar UK-EN-RU
Number of downloads 16 Number of views 59
  • English
  • Russian
  • Ukrainian
  • CC-BY-NC-SA-4.0
 Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
Number of downloads 4 Number of views 12
  • Bulgarian
  • Czech
  • Danish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Russian
  • Slovak
  • Swedish
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
Number of downloads 14 Number of views 43
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI