Synthetic multilingual corpus of public services of Germany
Part of synthetic multilingual dataset of webcontent from municipalities all over Europe.
This dataset was produced within the CEFAT4Cities project.
The data is scraped and translated from 6 countires: Belgium, Croatia, Germany, Italy, Norway and Slovenia.
This is a fragment of the whole corpus and is limited to data from Germany.
DSI Relevance: OpenDataPortal
People who looked at this resource also viewed the following:
- Catalan WMT2013 Machine Translation Shared Task Test Set
- Monolingual Icelandic corpus from the official journal Stjórnartíðindi
- CEF Data Marketplace second multilingual benchmark for the evaluation of cleaning tools
- Compilation of German-Swedish parallel corpora resources used for training of NTEU Machine Translation engines.