Human-reviewed automatic English translations of Europeana metadata
The resource includes human-reviewed or post-edited translations of metadata sourced from the Europeana platform. The human-inspected automatic translations are from 17 European languages to English.
The translations from Bulgarian, Croatian, Czech, Danish, German, Greek, Spanish, Finnish, Hungarian, Polish, Romanian, Slovak, Slovenian and Swedish have been reviewed by a group of linguist experts (2 experts for each language).
The translations from Dutch, French, and Italian are sourced from Europeana collections coming from the fashion, audiovisual and museum heritage domains and have been evaluated by cultural heritage experts.
In both cases, evaluators were solicited to rate the automatic translations on a scale from 0 to 100. The textual segments have been extracted from different metadata properties of the Europeana Data Model, which captures aspects of a CH item, such as the title of a painting or its description.
The TSV files include automatic translations that received a human rating of 90% or above. For Italian, French, and Dutch, we also include in the TSV files automatic translations that have been post-edited by human experts so as to reflect a correct translation. TTSV files have the first row in the source language and the second in English.
DSI Relevance: Europeana
People who looked at this resource also viewed the following:
People who downloaded this resource also downloaded the following:
- HRW dataset v1. Multilingual (EN, AR, BG, BN, CS, DA, DE, EL, ES, FA, FI, FR, HR, HU, IN, IT, KO, LV, NB, NL, PL, PT, RU, SK, SQ, SV, TH, TL, TR, UK, UR, Vi, ZH)
- Multilingual content acquired from advocacy and law associations/firms, conciliation/arbitration/co-operation institutes, dispute prevention and resolution agencies (part 1 , v.1).
- German-French website parallel corpus from the Federal Foreign Office Berlin
- German-French website parallel corpus from the Federal Foreign Office Berlin (Processed)