Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) corpus generated by processing content of websites related to scientific research (e.g. Research Center and Institutes , Universities, Ministries of Research, etc.). The total number of Tus is 458930.
de-es 182
de-fr 116997
de-it 46877
de-pl 139
de-ru 185
de-uk 170
en-bg 1746
en-cs 22856
en-da 47213
en-de 39824
en-es 172
en-et 2488
en-fr 32635
en-ga 13911
en-hr 633
en-it 13251
en-lt 16175
en-lv 4007
en-nl 8232
en-nb 1416
en-pl 19029
en-pt 1494
en-ru 1522
en-sk 5350
en-sv 6705
en-uk 163
es-fr 2022
es-pl 133
es-ru 183
es-uk 173
et-ru 3010
fr-it 48999
fr-nl 106
fr-pl 127
fr-ru 171
fr-uk 163
pl-ru 146
pl-uk 146
ru-uk 179
DSI Relevance: Europeana
People who looked at this resource also viewed the following:
- SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
- Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part1, v.0).
- Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
- Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
People who downloaded this resource also downloaded the following:
- Web-acquired data related to culture (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, FI, FR, HR, IS, IT, LT, LV, MK, MT, RU, SK, SV) collection of files in TMX format.
- SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
- Web-acquired data related to health/covid-19 (Part I). Multilingual (BG, CS, DA, DE, EL, EN, ET, ES, FI, FR, GA, HR, HU, IS, IT, LT, LV, MK, MT, NL, NB, NN, NO, PL, PT, RO, SK, SL, SQ, SV) collection of files in TMX format.
- Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.