Filter by:
Norwegian Bokmål (6)
English (6)
Czech (5)
French (5)
German (5)
Italian (5)
Polish (5)
Portuguese (5)
Russian (5)
Albanian (3)
Bulgarian (3)
Croatian (3)
Dutch; Flemish (3)
Finnish (3)
Hungarian (3)
Latvian (3)
Persian (3)
Slovak (3)
Swedish (3)
Turkish (3)
Arabic (2)
Estonian (2)
Icelandic (2)
Lithuanian (2)
Macedonian (2)
Slovenian (2)
Bengali (1)
Chinese (1)
Danish (1)
Indonesian (1)
Korean (1)
Tai languages (1)
Ukrainian (1)
Urdu (1)
Vietnamese (1)
Corpus (6)
Text (6)
Multilingual (5)
Bilingual (1)
Parallel (6)
SOCIAL QUESTIONS (4)
SCIENCE (2)
Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Language Description: | |
Tool/Service: |
6 Language Resources
Order by:
COVID-19 Voltaire dataset v1. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
2
15
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
COVID-19 Voltaire dataset v2. Multilingual (EN, AR, CS, DE, EL, ES, FA, FR, IT, NB, NL, NN, PL, PT, RO, RU, TR)
11
29
- Arabic
- Czech
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Persian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Turkish
- CC-BY-NC-ND-4.0
HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
17
42
- Albanian
- Bengali
- Bulgarian
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Filipino; Pilipino
- Finnish
- French
- German
- Hungarian
- Indonesian
- Italian
- Korean
- Latvian
- Modern Greek (1453-)
- Norwegian Bokmål
- Persian
- Polish
- Portuguese
- Russian
- Slovak
- Spanish; Castilian
- Swedish
- Tai languages
- Turkish
- Ukrainian
- Urdu
- Vietnamese
- CC-BY-NC-ND-3.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
5
22
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0
SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
40
94
- Albanian
- Bulgarian
- Croatian
- Czech
- English
- Estonian
- Finnish
- French
- German
- Hungarian
- Icelandic
- Italian
- Latvian
- Lithuanian
- Macedonian
- Modern Greek (1453-)
- Norwegian Bokmål
- Norwegian Nynorsk
- Polish
- Portuguese
- Russian
- Slovak
- Slovenian
- Spanish; Castilian
- Swedish
- CC-BY-NC-SA-4.0