Bilinguis Free Books v.1.04. Multilingual (CS, DE, EN, ES, FI, FR, IT, NL, PL, PT) corpus from the http://bilinguis.com/ website.
Multilingual (CS, DE, EN, ES, FI, FR, IT, NL, PL, PT) dataset based on the content of the http://bilinguis.com/ website. It includes 151927 Translation Units in total. It was generated by harvesting the website in October 2021, identifying parallel sentence pairs and filtering the results. The number of TUs are:
DSI Relevance: Europeana
People who looked at this resource also viewed the following:
People who downloaded this resource also downloaded the following:
- ELRC3.0 Multilingual corpus made out of PDF documents from the European Medicines Agency (EMEA), https://www.ema.europa.eu, (February 2020).
- Auslandsgesellschaft.de Dortmund Serviceheft Ukraine (Processed)
- Audioguide for the Military History Museum in Vienna (Processed)
- Compilation of Dutch; Flemish-Polish parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.