Filter by:
Icelandic (16)
English (16)
French (12)
German (12)
Polish (12)
Finnish (11)
Lithuanian (11)
Swedish (11)
Bulgarian (10)
Czech (10)
Danish (10)
Estonian (10)
Italian (10)
Latvian (10)
Norwegian Bokmål (10)
Slovak (10)
Spanish; Castilian (10)
Croatian (9)
Hungarian (9)
Maltese (9)
Portuguese (9)
Slovenian (9)
Dutch; Flemish (8)
Albanian (4)
Irish (4)
Macedonian (3)
Russian (2)
Norwegian (1)
Turkish (1)
TMX (16)
Corpus (16)
Text (16)
Multilingual (13)
Bilingual (3)
Parallel (16)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
16 Language Resources
Order by:
Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-IS).
45
66
- English
- Icelandic
- CC-BY-4.0
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
67
167
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 landlaeknir dataset v1. Multilingual (EN, IS, PL)
1
20
- English
- Icelandic
- Polish
- Open Under-PSI
COVID-19 landlaeknir dataset v2. Multilingual (EN, IS, PL, DE, ES, FR, LT)
6
22
- English
- French
- German
- Icelandic
- Lithuanian
- Polish
- Spanish; Castilian
- Open Under-PSI
COVID-19 OSHA-EUROPA dataset v1. Multilingual (CEF languages plus IS and NB but not Irish)
18
33
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 SST-DK dataset v1. Multilingual (EN, TR, SV, PL, NB, IS, FR, FI, DE, DA)
6
16
- Danish
- English
- Finnish
- French
- German
- Icelandic
- Norwegian Bokmål
- Polish
- Swedish
- Turkish
- Open Under-PSI
ELRC3.0 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020).
202
203
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part1, v.0).
21
42
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part 1 , v.1).
17
43
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TMX format.
4
10
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TMX format.
5
9
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
44
112
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
4
16
- Bulgarian
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Russian
- Slovak
- Swedish
- Open Under-PSI
Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
16
26
- Albanian
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- Open Under-PSI