ParaCrawl release 4 Slovenian-English 
ParaCrawl 4 sl-en

Slovenian-English parallel from release 4 of the ParaCrawl project, specifically "Provision of Web-Scale Parallel Corpora for Official European Languages". This version is filtered with BiCleaner with a threshold of 0.7. Data was crawled from the web following robots.txt, as is standard practice. The crawl is not targeted to a particular domain, intending to provide broad coverage.
DSI Relevance: BusinessRegistersInterconnectionSystem, Cybersecurity, ElectronicExchangeOfSocialSecurityInformation, Europeana, OnlineDisputeResolution, OpenDataPortal, eHealth, eJustice, eProcurement, saferInternet
People who looked at this resource also viewed the following:
- COVID-19 Health Service Executive of Ireland dataset v1. Bilingual (EN-BG)
- Compilation of Bulgarian-German parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- Compilation of Maltese-Slovenian parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)