Compilation of Czech-English parallel corpora resources used for training of NTEU Machine Translation engines.
This corpora compilation is build from select public and private corpora. See ReadMe.txt for more information.
People who looked at this resource also viewed the following:
- Compilation of Estonian-Hungarian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Greek-Slovak parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of English-Lithuanian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Czech-Greek parallel corpora resources used for training of NTEU Machine Translation engines.
People who downloaded this resource also downloaded the following:
- CEF Data Marketplace multilingual benchmark for the evaluation of cleaning and clustering tools
- Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-CS).
- Czech-English Parallel corpus from Tatoeba project
- Parallel Global Voices (English - Czech) (Processed)