SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.

Collection of 31 bilingual TMX files for EN-X language pairs, where X is BG, CS, DE, EL, EN, ES, ET, FI, FR, HR, HU, IS, IT, LT, LV, MK, NB, NN, PL, PT, RU, SK, SL, SQ, SV. It also contains small collection for a few more language combinations. It was generated by processing abstracts of Bachelor,Read More