EuroPat release 1 English-French

Europat 1 en-fr

Parallel corpora extracted from European and US patents in English-French as release 1 of the Europat project, specifically "EuroPat: Unleashing European Patent Translations". The TMX file combines all domains and all fields (Title, Claim, Abstract, Description) in one file; these are labeled in the metadata so one can extract separate corpora from them if desired. Corpora were cleaned with Bicleaner 0.14 with a threshold of 0.5.

DSI Relevance: BusinessRegistersInterconnectionSystem, Cybersecurity, ElectronicExchangeOfSocialSecurityInformation, Europeana, OnlineDisputeResolution, OpenDataPortal, eHealth, eJustice, eProcurement, saferInternet