Polish Ministry of Foreign Affairs Historical Dataset (Processed)
A collection of parallel Polish-English texts published by the Polish Ministry of Polish Affairs. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format.
There are three publications in the collection
a) Nazi Concentration Camps (obozy2014.xlf, 398 segments 14146 words),
b) A Guide to History of Poland (przewodnik_po_historii_polski.xlf, 828 segments, 25572 words) and
c) The Katyn Crime (zbrodnia_katyn_xlf, 1455 segments, 66396 words).
The total size of the collection is 106 114 words in 2681 parallel segments.
It was converted into a 2223-TUs English-Polish resource in TMX format.
DSI Relevance: Europeana
People who looked at this resource also viewed the following:
- Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (Processed)
- English-Swedish parallel corpus from the web site of the Swedish Migration Board - Migrationsverket (Processed)
- Parallel corpus from Estonian Ministry of Foreign Affairs (Processed)
- Legal texts from Estonian Ministry of Justice (Processed)
People who downloaded this resource also downloaded the following: