Filter by:
French (42)
Latvian (42)
English (42)
German (42)
Bulgarian (41)
Czech (41)
Slovak (41)
Lithuanian (40)
Polish (40)
Spanish; Castilian (40)
Italian (39)
Portuguese (39)
Swedish (39)
Estonian (38)
Croatian (37)
Danish (37)
Finnish (37)
Modern Greek (1453-) (37)
Dutch; Flemish (35)
Hungarian (35)
Slovenian (34)
Maltese (33)
Irish (28)
Norwegian Bokmål (17)
Icelandic (16)
Albanian (9)
Russian (8)
Macedonian (7)
Ukrainian (5)
Bengali (2)
Chinese (2)
Indonesian (2)
Korean (2)
Norwegian (2)
Persian (2)
Turkish (2)
Vietnamese (2)
Afrikaans (1)
Arabic (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Malayalam (1)
Serbian (1)
Tagalog (1)
Tai languages (1)
Tamil (1)
Telugu (1)
Thai (1)
Urdu (1)
Corpus (42)
Text (42)
Multilingual (42)
Parallel (42)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
42 Language Resources (Page 2 of 3)
« Previous | Next »Order by:
EU press Corner 2020 v.0.9 in Moses-like format
1
19
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2020 v.0.9 in TMX format
10
29
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
EU press Corner 2020 v.0.9 in TSV format
2
19
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-ND-4.0
HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
17
42
- Albanian
- Bengali
- Bulgarian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Filipino; Pilipino
- Finnish
- French
- German
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Modern Greek (1453-)
- Norwegian Bokmål
- Persian
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Tai languages
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-ND-3.0
Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part1, v.0).
19
35
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part 1 , v.1).
14
33
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Multilingual corpus from the European Vaccination Information Portal
49
126
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus from the Publications Office of the EU on the medical domain
69
124
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus from the Publications Office of the EU on the medical domain v.2
38
116
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TMX format.
2
8
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TSV/MOSES-like format.
2
11
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TMX format.
3
7
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TSV/MOSES-like format.
5
9
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020), provided in Moses format.
3
13
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
5
22
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
40
95
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in Moses format.
5
12
- Bulgarian
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Russian
- Slovak
- Swedish
- Open Under-PSI
Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
4
12
- Bulgarian
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Russian
- Slovak
- Swedish
- Open Under-PSI
Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in Moses-like format.
5
12
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
15
21
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI