English-Portuguese website parallel corpus (Processed) 
Texts crawled from various websites open under PSI regulations : Sligo county council, Direcção Geral da Administração Escolar, Director of Public Prosecutions, Ministério dos Negócios Estrangeiros, Parlamento italiano, Office of the Revenue Commissioners
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 843 TUs.
Manual validation has been performed on a sample of the data.
People who looked at this resource also viewed the following:
- Compilation of Italian-Latvian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of French-Swedish parallel corpora resources used for training of NTEU Machine Translation engines.
- COVID-19 CDC dataset v1. Multilingual (EN, ES, FR, PT, IT)
- Compilation of Finnish-Hungarian parallel corpora resources used for training of NTEU Machine Translation engines.