Filter by:
Italian (55)
Spanish; Castilian (55)
English (55)
French (54)
German (52)
Portuguese (49)
Modern Greek (1453-) (46)
Polish (45)
Swedish (42)
Czech (41)
Dutch; Flemish (41)
Croatian (38)
Bulgarian (37)
Latvian (37)
Slovak (37)
Estonian (36)
Finnish (36)
Lithuanian (36)
Danish (35)
Hungarian (35)
Slovenian (35)
Maltese (31)
Irish (26)
Norwegian Bokmål (19)
Icelandic (14)
Russian (13)
Ukrainian (10)
Albanian (8)
Arabic (7)
Turkish (6)
Chinese (5)
Korean (5)
Macedonian (5)
Persian (5)
Vietnamese (5)
Bengali (3)
Indonesian (2)
Norwegian (2)
Serbian (2)
Tamil (2)
Urdu (2)
Afrikaans (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Malagasy (1)
Malayalam (1)
Tagalog (1)
Tai languages (1)
Telugu (1)
Thai (1)
Corpus (54)
Text (55)
CC- BY-4.0 (20)
CC- BY- NC- ND-4.0 (13)
Open Under- PSI (7)
Public Domain (5)
CC- BY-3.0 (2)
CC- BY- NC-4.0 (2)
CC- BY- SA-3.0 (2)
Multilingual (55)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
55 Language Resources (Page 1 of 3)
« Previous | Next »Order by:
Bilinguis Free Books v.1.04. Multilingual (CS, DE, EN, ES, FI, FR, IT, NL, PL, PT) corpus from the http://bilinguis.com/ website.
23
49
- Czech
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Polish
- Portuguese
- Spanish; Castilian
- Public Domain
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
66
166
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 CDC dataset v1. Multilingual (EN, ES, FR, PT, IT)
1
23
- English
- French
- Italian
- Portuguese
- Spanish; Castilian
- Public Domain
COVID-19 CDC dataset v2. Multilingual (EN, ES, FR, PT, IT, DE, KO, RU, ZH, UK, VI)
21
33
- Chinese
- English
- French
- German
- Italian
- Korean
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Ukrainian
- Vietnamese
- Public Domain
COVID-19 EC-EUROPA v1 dataset. Multilingual (CEF languages)
27
106
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 EU presscorner v1 dataset. Multilingual (CEF languages)
25
99
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 EU presscorner v2 dataset. Multilingual (CEF languages)
31
97
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 EUR-LEX dataset . Multilingual (CEF languages)
48
105
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 EUROPARL dataset v1. Multilingual (24 CEF languages)
25
85
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 EUROPARL dataset v2. Multilingual (24 CEF languages)
25
94
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 Government of Canada dataset v1. Multilingual (EN, FR, DE, ES, EL, IT, PL, PT, RO)
2
9
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Spanish; Castilian
- CC-BY-NC-4.0
COVID-19 Government of Canada dataset v2. Multilingual (EN, FR, DE, ES, EL, IT, PL, PT, RO, KO, RU, ZH, UK, VI, TA, TL)
17
37
- Chinese
- English
- French
- German
- Italian
- Korean
- Modern Greek (1453-)
- Philippine languages
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Tamil
- Ukrainian
- Vietnamese
- CC-BY-NC-4.0
COVID-19 HEALTH-AU dataset. Multilingual (EN, ES, IT, EL)
2
18
- English
- Italian
- Modern Greek (1453-)
- Spanish; Castilian
- Open Under-PSI
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
108
205
- Afrikaans
- Albanian
- Arabic
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Galician
- German
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Lithuanian
- Macedonian
- Malay (macrolanguage)
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- CC-BY-SA-3.0
COVID-19 OSHA-EUROPA dataset v1. Multilingual (CEF languages plus IS and NB but not Irish)
17
32
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
23
103
- Arabic
- Bengali
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Malagasy
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Spanish; Castilian
- CC-BY-3.0
COVID-19-related multilingual corpus from EU press Corner 2020 v.0.9 in Moses-like format
2
11
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
COVID-19-related multilingual corpus from EU press Corner 2020 v.0.9 in TMX format
8
16
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
COVID-19-related multilingual corpus from EU press Corner 2020 v.0.9 in TSV format
4
9
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
16
45
- Arabic
- Chinese
- English
- French
- Italian
- Korean
- Persian
- Philippine languages
- Portuguese
- Russian
- Spanish; Castilian
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-SA-4.0
« Previous | Next »