Synthetic multilingual corpus of public services of Slovenia
Part of synthetic multilingual dataset of webcontent from municipalities all over Europe.
This dataset was produced within the CEFAT4Cities project.
The data is scraped and translated from 6 countires: Belgium, Croatia, Germany, Italy, Norway and Slovenia.
This is a fragment of the whole corpus and is limited to data from Slovenia.
DSI Relevance: OpenDataPortal
People who looked at this resource also viewed the following:
- Manufactured data based on ParaCrawl release 7 Greek-English
- Compilation of Greek-Slovenian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Latvian-Romanian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of German-Slovak parallel corpora resources used for training of NTEU Machine Translation engines.
People who downloaded this resource also downloaded the following: