Synthetic multilingual corpus of public services of Germany
Part of synthetic multilingual dataset of webcontent from municipalities all over Europe.
This dataset was produced within the CEFAT4Cities project.
The data is scraped and translated from 6 countires: Belgium, Croatia, Germany, Italy, Norway and Slovenia.
This is a fragment of the whole corpus and is limited to data from Germany.
DSI Relevance: OpenDataPortal
People who looked at this resource also viewed the following:
- English-Lithuanian EASTIN-CL Multilingual Ontology of Assistive Technology
- Compilation of German-English parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Bulgarian-Greek parallel corpora resources used for training of NTEU Machine Translation engines.
- CEF Data Marketplace second multilingual benchmark for the evaluation of cleaning tools