Filter by:
Spanish; Castilian (10)
Ukrainian (10)
English (10)
French (10)
Italian (10)
German (9)
Portuguese (8)
Russian (7)
Swedish (7)
Polish (6)
Bulgarian (5)
Chinese (5)
Croatian (5)
Czech (5)
Danish (5)
Dutch; Flemish (5)
Korean (5)
Latvian (5)
Slovak (5)
Vietnamese (5)
Estonian (4)
Lithuanian (4)
Finnish (3)
Hungarian (3)
Irish (3)
Norwegian Bokmål (3)
Persian (3)
Turkish (3)
Albanian (2)
Arabic (2)
Bengali (2)
Indonesian (2)
Slovenian (2)
Tamil (2)
Urdu (2)
Afrikaans (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Macedonian (1)
Malayalam (1)
Maltese (1)
Norwegian (1)
Serbian (1)
Tagalog (1)
Tai languages (1)
Telugu (1)
Thai (1)
Corpus (10)
Text (10)
CC- BY- SA-3.0 (2)
Open Under- PSI (2)
CC- BY-3.0 (1)
CC- BY-4.0 (1)
CC- BY- NC-4.0 (1)
Public Domain (1)
Multilingual (10)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
10 Language Resources
Order by:
COVID-19 CDC dataset v2. Multilingual (EN, ES, FR, PT, IT, DE, KO, RU, ZH, UK, VI)
18
26
- Chinese
- English
- French
- German
- Italian
- Korean
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Ukrainian
- Vietnamese
- Public Domain
COVID-19 Government of Canada dataset v2. Multilingual (EN, FR, DE, ES, EL, IT, PL, PT, RO, KO, RU, ZH, UK, VI, TA, TL)
10
27
- Chinese
- English
- French
- German
- Italian
- Korean
- Modern Greek (1453-)
- Philippine languages
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Tamil
- Ukrainian
- Vietnamese
- CC-BY-NC-4.0
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
103
193
- Afrikaans
- Albanian
- Arabic
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Galician
- German
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Lithuanian
- Macedonian
- Malay (macrolanguage)
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- CC-BY-SA-3.0
COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
13
36
- Arabic
- Chinese
- English
- French
- Italian
- Korean
- Persian
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-SA-4.0
EU acts in Ukrainian
47
90
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Ukrainian
- CC-BY-4.0
Global Voices monolingual collections of COVID-19-related documents.
36
53
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Spanish; Castilian
- Swedish
- Ukrainian
- CC-BY-3.0
HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
17
42
- Albanian
- Bengali
- Bulgarian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Filipino; Pilipino
- Finnish
- French
- German
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Modern Greek (1453-)
- Norwegian Bokmål
- Persian
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Tai languages
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-ND-3.0
Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
7
18
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- French
- German
- Irish
- Italian
- Latvian
- Lithuanian
- Norwegian Bokmål
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Ukrainian
- Open Under-PSI
Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
14
43
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- French
- German
- Irish
- Italian
- Latvian
- Lithuanian
- Norwegian Bokmål
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Ukrainian
- Open Under-PSI
Wikipedia monolingual collections of COVID-19 and health-related documents.
30
54
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Spanish; Castilian
- Swedish
- Ukrainian
- CC-BY-SA-3.0