Post-editing corpus English to Dutch/French/Portuguese, legal domain 
APE-QUEST postedition tuples

ID:
https://ape-quest.eu
Training data created for constructing quality estimation and automatic post-edition models (Ive et al. 2020). The data consists of
tuples (source sentence, machine translation output, manual post-edition, independent reference translation) for three European language pairs. The data cover the following domains: online dispute resolution, procurement and justice.
Number of tuples per language pair:
- English-Dutch: 11249
- English-French: 9989
- English-Portuguese: 10165
The machine translation output was produced with neural MT systems. All data were anonymized (replacement of person names and contact information). The tuples were randomized. The data are distributed as a set of four files per language pair (one for each element in the tuple).
DSI Relevance: OnlineDisputeResolution, eJustice, eProcurement
People who looked at this resource also viewed the following:
- Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in Moses format.
- Web-acquired data related to Scientific research (Part I). Multilingual (BG, CS, DA, DE, EN, ES, ET, FR, GA, HR, IT, LT, LV, NB, NL, PL, PT, RU, SK, SV, UK) collection of files in TMX format.
- FranceTerme
- Manufactured data based on ParaCrawl release 8 German-English, financial terms