Filter by:
English (12)
Icelandic (12)
Bulgarian (10)
Croatian (10)
Czech (10)
Estonian (10)
Finnish (10)
French (10)
German (10)
Hungarian (10)
Italian (10)
Latvian (10)
Lithuanian (10)
Modern Greek (1453-) (10)
Norwegian Bokmål (10)
Polish (10)
Portuguese (10)
Slovak (10)
Slovenian (10)
Spanish; Castilian (10)
Swedish (10)
Danish (8)
Dutch; Flemish (8)
Maltese (8)
Albanian (2)
Macedonian (2)
Russian (2)
Irish (1)
Corpus (12)
Text (12)
CC- BY-4.0 (10)
Multilingual (10)
Bilingual (2)
Parallel (12)
SOCIAL QUESTIONS (10)
SCIENCE (2)
E Health (10)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
12 Language Resources
Order by:
Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-IS).
45
65
- English
- Icelandic
- CC-BY-4.0
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
65
156
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Irish
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
COVID-19 OSHA-EUROPA dataset v1. Multilingual (CEF languages plus IS and NB but not Irish)
14
23
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
ELRC3.0 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020).
192
186
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TMX format.
2
8
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TSV/MOSES-like format.
2
11
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TMX format.
3
7
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TSV/MOSES-like format.
5
9
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020), provided in Moses format.
3
13
- Bulgarian
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Maltese
- Modern Greek (1453-)
- Norwegian Bokmål
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
5
22
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
40
95
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0