Public Procurement Dataset 2 (Processed)

A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format.
There are two publications in the collection:
a) Report on functioning of public procurement system in 2006 (raport_uzp_2006.xlf, 705 segments 31141 words) and
b) Report on functioning of public procurement system in 2011 (raport_uzp_2010.xlf, 1434 segments,73456 words).
The total size of the collection is 104597 words in 2139 parallel segments.
It was converted into a 1842-TUs English-Polish resource in TMX format.

DSI Relevance: eProcurement