Synthetic multilingual corpus of public services of Slovenia
Part of synthetic multilingual dataset of webcontent from municipalities all over Europe.
This dataset was produced within the CEFAT4Cities project.
The data is scraped and translated from 6 countires: Belgium, Croatia, Germany, Italy, Norway and Slovenia.
This is a fragment of the whole corpus and is limited to data from Slovenia.
DSI Relevance: OpenDataPortal
People who looked at this resource also viewed the following:
- Compilation of German-Slovak parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Greek-Slovenian parallel corpora resources used for training of NTEU Machine Translation engines.
- Manufactured data based on ParaCrawl release 7 Greek-English
- Compilation of Latvian-Romanian parallel corpora resources used for training of NTEU Machine Translation engines.
People who downloaded this resource also downloaded the following: