Filter by:

Corpus (30)
Text (30)
Parallel (30)
ELRC3.0 (28)
COVID-19 (15)

Resource Type:

Corpus:
Lexical/Conceptual:
Language Description:
Tool/Service:

30 Language Resources (Page 1 of 2)

« Previous | Next »Order by:

 Bilinguis Free Books v.1.04. Multilingual (CS, DE, EN, ES, FI, FR, IT, NL, PL, PT) corpus from the http://bilinguis.com/ website.
Number of downloads 21 Number of views 40
  • Czech
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Italian
  • Polish
  • Portuguese
  • Spanish; Castilian
  • Public Domain
 COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
Number of downloads 64 Number of views 153
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 EC-EUROPA v1 dataset. Multilingual (CEF languages)
Number of downloads 27 Number of views 100
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 EU presscorner v1 dataset. Multilingual (CEF languages)
Number of downloads 24 Number of views 98
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 EU presscorner v2 dataset. Multilingual (CEF languages)
Number of downloads 29 Number of views 87
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 EUR-LEX dataset . Multilingual (CEF languages)
Number of downloads 46 Number of views 91
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 EUROPARL dataset v1. Multilingual (24 CEF languages)
Number of downloads 25 Number of views 84
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 EUROPARL dataset v2. Multilingual (24 CEF languages)
Number of downloads 25 Number of views 87
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 GOV-LUX dataset v3. Multilingual (EN, FR, DE, PT, NL)
Number of downloads 1 Number of views 13
  • Dutch; Flemish
  • English
  • French
  • German
  • Portuguese
  • Open Under-PSI
 COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Number of downloads 103 Number of views 193
  • Afrikaans
  • Albanian
  • Arabic
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Lithuanian
  • Macedonian
  • Malay (macrolanguage)
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tagalog
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese
  • CC-BY-SA-3.0
 COVID-19 OSHA-EUROPA dataset v1. Multilingual (CEF languages plus IS and NB but not Irish)
Number of downloads 14 Number of views 22
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
Number of downloads 23 Number of views 98
  • Arabic
  • Bengali
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Malagasy
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Spanish; Castilian
  • CC-BY-3.0
 COVID-19-related multilingual corpus from EU press Corner 2020 v.0.9 in Moses-like format
Number of downloads 1 Number of views 7
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0
 COVID-19-related multilingual corpus from EU press Corner 2020 v.0.9 in TMX format
Number of downloads 4 Number of views 10
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0
 COVID-19-related multilingual corpus from EU press Corner 2020 v.0.9 in TSV format
Number of downloads 2 Number of views 7
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 2 Number of views 15
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
Number of downloads 11 Number of views 28
  • Arabic
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Turkish
  • CC-BY-NC-ND-4.0
 ELRC3.0 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020).
Number of downloads 192 Number of views 185
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0
 HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
Number of downloads 17 Number of views 42
  • Albanian
  • Bengali
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Filipino; Pilipino
  • Finnish
  • French
  • German
  • Hungarian
  • Indonesian
  • Italian
  • Korean
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Tai languages
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-ND-3.0
 Multilingual corpus from the European Vaccination Information Portal
Number of downloads 49 Number of views 126
  • Bulgarian
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Irish
  • Italian
  • Latvian
  • Lithuanian
  • Maltese
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-4.0

« Previous | Next »