Filter by:
English (6)
French (6)
Italian (5)
German (4)
Portuguese (4)
Russian (4)
Swedish (4)
Albanian (3)
Bulgarian (3)
Croatian (3)
Czech (3)
Estonian (3)
Finnish (3)
Hungarian (3)
Latvian (3)
Lithuanian (3)
Macedonian (3)
Polish (3)
Slovak (3)
Slovenian (3)
Ukrainian (3)
Arabic (2)
Chinese (2)
Icelandic (2)
Korean (2)
Norwegian Bokmål (2)
Persian (2)
Turkish (2)
Vietnamese (2)
Afrikaans (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bengali (1)
Bosnian (1)
Danish (1)
Dutch; Flemish (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Indonesian (1)
Malayalam (1)
Norwegian (1)
Serbian (1)
Tagalog (1)
Tamil (1)
Telugu (1)
Thai (1)
Urdu (1)
Multilingual (6)
Corpus (6)
Text (6)
SOCIAL QUESTIONS (4)
SCIENCE (2)
E Health (4)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
6 Language Resources
Order by:
COVID-19 GEMR of UNESCO dataset. Multilingual (EN, FR, ES)
2
16
- English
- French
- Spanish; Castilian
- CC-BY-SA-4.0
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
103
193
- Afrikaans
- Albanian
- Arabic
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Galician
- German
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Lithuanian
- Macedonian
- Malay (macrolanguage)
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- CC-BY-SA-3.0
COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
13
35
- Arabic
- Chinese
- English
- French
- Italian
- Korean
- Persian
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-SA-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
5
22
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
40
94
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
Wikipedia monolingual collections of COVID-19 and health-related documents.
30
54
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Spanish; Castilian
- Swedish
- Ukrainian
- CC-BY-SA-3.0