Ministry of State Assets Dataset (Processed)

A collection of parallel Polish-English texts extracted from the documents delivered by the Ministry of State Assets (MAP). Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format. Then, merging/filtering of segment pairs and conversion into TMX format have also been applied. The collection includes 2769 TUs.