Filter by:
Finnish (6)
Russian (6)
Bulgarian (6)
Czech (6)
English (6)
French (6)
German (6)
Italian (6)
Latvian (6)
Slovak (6)
Swedish (6)
Estonian (5)
Lithuanian (5)
Macedonian (5)
Albanian (4)
Croatian (4)
Danish (4)
Hungarian (4)
Icelandic (4)
Polish (4)
Portuguese (4)
Norwegian Bokmål (3)
Slovenian (3)
Bengali (2)
Chinese (2)
Dutch; Flemish (2)
Indonesian (2)
Korean (2)
Maltese (2)
Persian (2)
Turkish (2)
Ukrainian (2)
Vietnamese (2)
Afrikaans (1)
Arabic (1)
Azerbaijani (1)
Basque (1)
Belarusian (1)
Bosnian (1)
Esperanto (1)
Galician (1)
Hebrew (1)
Hindi (1)
Malayalam (1)
Norwegian (1)
Serbian (1)
Tagalog (1)
Tai languages (1)
Tamil (1)
Telugu (1)
Thai (1)
Urdu (1)
Multilingual (6)
Corpus (6)
Text (6)
Parallel (6)
SOCIAL QUESTIONS (4)
SCIENCE (2)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
6 Language Resources
Order by:
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
103
193
- Afrikaans
- Albanian
- Arabic
- Azerbaijani
- Basque
- Belarusian
- Bengali
- Bosnian
- Bulgarian
- Catalan; Valencian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Esperanto
- Estonian
- Finnish
- French
- Galician
- German
- Hebrew
- Hindi
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Lithuanian
- Macedonian
- Malay (macrolanguage)
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tagalog
- Tamil
- Telugu
- Thai
- Turkish
- Ukrainian
- Vietnamese
- CC-BY-SA-3.0
HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
17
42
- Albanian
- Bengali
- Bulgarian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Filipino; Pilipino
- Finnish
- French
- German
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Modern Greek (1453-)
- Norwegian Bokmål
- Persian
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Tai languages
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-ND-3.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
5
22
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
40
95
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in Moses format.
5
12
- Bulgarian
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Russian
- Slovak
- Swedish
- Open Under-PSI
Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
4
12
- Bulgarian
- Czech
- Danish
- English
- Estonian
- Finnish
- French
- German
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Maltese
- Russian
- Slovak
- Swedish
- Open Under-PSI