17 Language Resources

Order by:

 COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
Number of downloads 64 Number of views 150
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 OSHA-EUROPA dataset v1. Multilingual (CEF languages plus IS and NB but not Irish)
Number of downloads 14 Number of views 22
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 ELRC3.0 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020).
Number of downloads 192 Number of views 184
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
Number of downloads 16 Number of views 41
  • Albanian
  • Bengali
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Filipino; Pilipino
  • Finnish
  • French
  • German
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Tai languages
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-ND-3.0
 Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part1, v.0).
Number of downloads 19 Number of views 35
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI
 Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part 1 , v.1).
Number of downloads 14 Number of views 32
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI
 Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TMX format.
Number of downloads 2 Number of views 8
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.05) in TSV/MOSES-like format.
Number of downloads 2 Number of views 11
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TMX format.
Number of downloads 3 Number of views 7
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 Multilingual corpus in HEALTH (COVID-19) domain part_1a (v.1.0) in TSV/MOSES-like format.
Number of downloads 5 Number of views 9
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020), provided in Moses format.
Number of downloads 3 Number of views 13
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 22
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 40 Number of views 92
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in Moses-like format.
Number of downloads 5 Number of views 12
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI
 Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
Number of downloads 15 Number of views 21
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
Number of downloads 6 Number of views 18
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI
 Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
Number of downloads 14 Number of views 43
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • French
  • German
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Ukrainian
  • Open Under-PSI