7 Language Resources

Order by:

 COVID-19 USAHELLO dataset v1. Bilingual (EN, ES)
Number of downloads 1 Number of views 18
  • English
  • Spanish; Castilian
  • CC-BY-NC-SA-4.0
 COVID-19 USAHELLO dataset v2. Multilingual (EN, AR, ES, FA, FR, IT, KO, PT, RU, TL, TR, UK, UR, VI, ZH)
Number of downloads 13 Number of views 35
  • Arabic
  • Chinese
  • English
  • French
  • Italian
  • Korean
  • Persian
  • Philippine languages
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • CC-BY-NC-SA-4.0
 Multilingual English, French, Polish to Ukrainian Parallel Corpus (processed)
Number of downloads 20 Number of views 54
  • English
  • French
  • Polish
  • Ukrainian
  • CC-BY-NC-SA-4.0
 Polish-Ukrainian Parallel Corpus (processed)
Number of downloads 4 Number of views 26
  • Polish
  • Ukrainian
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
Number of downloads 5 Number of views 22
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Number of downloads 40 Number of views 92
  • Albanian
  • Bulgarian
  • Croatian
  • Czech
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Latvian
  • Lithuanian
  • Macedonian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Polish
  • Portuguese
  • Russian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • CC-BY-NC-SA-4.0
 SciPar UK-EN-RU
Number of downloads 16 Number of views 56
  • English
  • Russian
  • Ukrainian
  • CC-BY-NC-SA-4.0