SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
Collection of 31 bilingual TMX files for EN-X language pairs, where X is BG, CS, DE, EL, EN, ES, ET, FI, FR, HR, HU, IS, IT, LT, LV, MK, NB, NN, PL, PT, RU, SK, SL, SQ, SV. It also contains small collection for a few more language combinations. It was generated by processing abstracts of Bachelor, Master and PhD Theses available at academic repositories and archives. The total number of Tus is 9172462.
DSI Relevance: Europeana
People who looked at this resource also viewed the following:
People who downloaded this resource also downloaded the following: