Filter by:
Arabic (13)
English (13)
Russian (10)
French (9)
German (9)
Italian (6)
Portuguese (6)
Dutch; Flemish (5)
Polish (5)
Czech (4)
Persian (4)
Turkish (4)
Bengali (2)
Chinese (2)
Croatian (2)
Korean (2)
Norwegian Bokmål (2)
Serbian (2)
Slovenian (2)
Swedish (2)
Ukrainian (2)
Vietnamese (2)
Afrikaans (1)
Albanian (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Bulgarian (1)
Danish (1)
Esperanto (1)
Estonian (1)
Finnish (1)
Galician (1)
Hebrew (1)
Hindi (1)
Hungarian (1)
Indonesian (1)
Latvian (1)
Lithuanian (1)
Macedonian (1)
Malagasy (1)
Malayalam (1)
Norwegian (1)
Slovak (1)
Tagalog (1)
Tamil (1)
Telugu (1)
Thai (1)
Urdu (1)
SOCIAL QUESTIONS (13)
Corpus (13)
Text (13)
Multilingual (10)
Bilingual (3)
Parallel (13)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
13 Language Resources
Order by:
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
108
205
- Afrikaans
- Albanian
- Arabic
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Galician
- German
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Lithuanian
- Macedonian
- Malay (macrolanguage)
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- CC-BY-SA-3.0
COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
23
103
- Arabic
- Bengali
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Malagasy
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Spanish; Castilian
- CC-BY-3.0
COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
16
45
- Arabic
- Chinese
- English
- French
- Italian
- Korean
- Persian
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-SA-4.0
COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
2
16
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
12
35
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.05) in TMX format.
9
25
- Arabic
- English
- French
- German
- Russian
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.05) in TSV/Moses-like format.
3
11
- Arabic
- English
- French
- German
- Russian
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.0) in TMX format.
3
12
- Arabic
- English
- French
- German
- Russian
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.0) in TSV/MOSES-like format.
1
9
- Arabic
- English
- German
- Russian
- CC-BY-4.0
OpenEdition culture-related publications. Multilingual (AR, DE, EL, EN, ES, FR, HR, IT, NL, PL, PT, RO, RU, SL, SV) collection of TMX files.
7
44
- Arabic
- Croatian
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0