Filter by:
Russian (31)
English (31)
French (21)
German (21)
Italian (15)
Portuguese (15)
Spanish; Castilian (15)
Polish (14)
Czech (11)
Arabic (10)
Ukrainian (10)
Swedish (9)
Bulgarian (8)
Dutch; Flemish (8)
Latvian (8)
Slovak (8)
Croatian (7)
Estonian (7)
Lithuanian (7)
Norwegian Bokmål (7)
Danish (6)
Finnish (6)
Vietnamese (6)
Chinese (5)
Korean (5)
Macedonian (5)
Persian (5)
Turkish (5)
Albanian (4)
Hungarian (4)
Icelandic (4)
Slovenian (4)
Bengali (3)
Indonesian (2)
Irish (2)
Maltese (2)
Serbian (2)
Tamil (2)
Urdu (2)
Afrikaans (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Malagasy (1)
Malayalam (1)
Norwegian (1)
Tagalog (1)
Tai languages (1)
Telugu (1)
Thai (1)
Corpus (31)
Text (31)
Open Under- PSI (8)
CC- BY-3.0 (5)
CC- BY-4.0 (5)
CC- BY- SA-3.0 (2)
CC- BY- NC-4.0 (1)
Public Domain (1)
Multilingual (25)
Bilingual (6)
Parallel (31)
SOCIAL QUESTIONS (25)
SCIENCE (6)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
31 Language Resources (Page 1 of 2)
« Previous | Next »Order by:
COVID-19 CDC dataset v2. Multilingual (EN, ES, FR, PT, IT, DE, KO, RU, ZH, UK, VI)
18
25
- Chinese
- English
- French
- German
- Italian
- Korean
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Ukrainian
- Vietnamese
- Public Domain
COVID-19 Government of Canada dataset v2. Multilingual (EN, FR, DE, ES, EL, IT, PL, PT, RO, KO, RU, ZH, UK, VI, TA, TL)
10
26
- Chinese
- English
- French
- German
- Italian
- Korean
- Modern Greek (1453-)
- Philippine languages
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Tamil
- Ukrainian
- Vietnamese
- CC-BY-NC-4.0
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
103
191
- Afrikaans
- Albanian
- Arabic
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Galician
- German
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Lithuanian
- Macedonian
- Malay (macrolanguage)
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- CC-BY-SA-3.0
COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
23
98
- Arabic
- Bengali
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Malagasy
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Spanish; Castilian
- CC-BY-3.0
COVID-19 POLISH-GOV v2 dataset. Multilingual (EN, PL, FR, DE, VI, RU, UK)
6
13
- English
- French
- German
- Polish
- Russian
- Ukrainian
- Vietnamese
- Open Under-PSI
COVID-19 UDSC-PL dataset. Multilingual (EN, PL, RU, UK)
5
22
- English
- Polish
- Russian
- Ukrainian
- Open Under-PSI
COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
13
35
- Arabic
- Chinese
- English
- French
- Italian
- Korean
- Persian
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-SA-4.0
COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
2
15
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
11
28
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
COVID-19 WIPO dataset v1. Multilingual (EN, ES, FR, DE, PT, RU)
1
17
- English
- French
- German
- Portuguese
- Russian
- Spanish; Castilian
- CC-BY-3.0
COVID-19 WIPO dataset v2. Multilingual (EN, ES, FR, DE, PT, RU, AR, ZH)
4
10
- English
- French
- German
- Portuguese
- Russian
- Spanish; Castilian
- CC-BY-3.0
HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
17
41
- Albanian
- Bengali
- Bulgarian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Filipino; Pilipino
- Finnish
- French
- German
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Modern Greek (1453-)
- Norwegian Bokmål
- Persian
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Tai languages
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-ND-3.0
Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.05) in TMX format.
9
25
- Arabic
- English
- French
- German
- Russian
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1b (v.1.05) in TSV/Moses-like format.
3
10
- Arabic
- English
- French
- German
- Russian
- CC-BY-4.0
« Previous | Next »