Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.

Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) corpus generated by processing content of websites related to scientific research (e.g. Research Center and Institutes , Universities, Ministries of Research, etc.). The total number of Tus is 458930.
de-es 182
de-fr 116997
de-it 46877
de-pl 139
de-ru 185
de-uk 170
en-bg 1746
en-cs 22856
en-da 47213
en-de 39824
en-es 172
en-et 2488
en-fr 32635
en-ga 13911
en-hr 633
en-it 13251
en-lt 16175
en-lv 4007
en-nl 8232
en-nb 1416
en-pl 19029
en-pt 1494
en-ru 1522
en-sk 5350
en-sv 6705
en-uk 163
es-fr 2022
es-pl 133
es-ru 183
es-uk 173
et-ru 3010
fr-it 48999
fr-nl 106
fr-pl 127
fr-ru 171
fr-uk 163
pl-ru 146
pl-uk 146
ru-uk 179

DSI Relevance: Europeana
























People who looked at this resource also viewed the following:
People who downloaded this resource also downloaded the following:
Resources from the same project