Filter by:
Dutch; Flemish (46)
French (46)
English (46)
German (46)
Portuguese (40)
Italian (39)
Polish (39)
Spanish; Castilian (39)
Czech (37)
Modern Greek (1453-) (35)
Croatian (34)
Swedish (34)
Bulgarian (33)
Danish (33)
Latvian (33)
Slovak (33)
Estonian (32)
Finnish (32)
Lithuanian (32)
Hungarian (31)
Slovenian (31)
Maltese (29)
Irish (24)
Norwegian Bokmål (17)
Icelandic (12)
Russian (8)
Albanian (6)
Arabic (6)
Turkish (5)
Ukrainian (5)
Persian (4)
Bengali (3)
Macedonian (3)
Chinese (2)
Indonesian (2)
Korean (2)
Norwegian (2)
Serbian (2)
Vietnamese (2)
Afrikaans (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Malagasy (1)
Malayalam (1)
Tagalog (1)
Tai languages (1)
Tamil (1)
Telugu (1)
Thai (1)
Urdu (1)
Corpus (46)
Text (46)
CC- BY-4.0 (16)
CC- BY- NC- ND-4.0 (13)
Open Under- PSI (13)
CC- BY-3.0 (1)
CC- BY- SA-3.0 (1)
Public Domain (1)
Multilingual (46)
Parallel (46)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
46 Language Resources (Page 2 of 3)
« Previous | Next »Order by:
COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
11
29
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
ELRC3.0 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020).
192
186
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
EU acts in Ukrainian
47
90
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Ukrainian
- CC-BY-4.0
EU press Corner 2000-2020 v.0.9 in Moses-like format
2
23
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2000-2020 v.0.9 in TMX format
8
26
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2000-2020 v.0.9 in TSV format
2
20
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2020 v.0.9 in Moses-like format
1
19
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2020 v.0.9 in TMX format
10
29
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2020 v.0.9 in TSV format
2
19
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
17
42
- Albanian
- Bengali
- Bulgarian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Filipino; Pilipino
- Finnish
- French
- German
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Modern Greek (1453-)
- Norwegian Bokmål
- Persian
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Tai languages
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-ND-3.0
Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part1, v.0).
19
35
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part 1 , v.1).
14
33
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Multilingual corpus from the European Vaccination Information Portal
49
126
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus from the Publications Office of the EU on the medical domain
69
124
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus from the Publications Office of the EU on the medical domain v.2
38
116
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TMX format.
2
8
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TSV/MOSES-like format.
2
11
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TMX format.
3
7
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TSV/MOSES-like format.
5
9
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020), provided in Moses format.
3
13
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0