PRINCIPLE SDURDD Croatian-English Parallel Corpus of international agreements
View resource name in all available languages
PRINCIPLE SDURDD Hrvatsko-engleski paralelni korpus međunarodnih ugovora
PRINCIPLE SDURDD Croatian-English Parallel Corpus of international agreements contains 1166 documents (583 in Croatian and 583 in English) in the eJustice domain, totaling 234,500 translation units. Automatic text extraction from HTML documents has been performed, followed bymanual consolidaiton and document checking. Documents were cleaned, and a manual content check was performed on a sample. Automatic TU alignment was performed, followed by manual check of alignment on a sample. It is open and freely available under the PSI licence.
View resource description in all available languages
PRINCIPLE SDURDD Hrvatsko-engleski paralelni korpus međunarodnih ugovora sadrži 1166 dokumenta (583 na hrvatskom i 583 na engleskom), sveukupno 234.500 prijevodnih jedinica. Izvršena je automatska ekstrakcija teksta iz HTMLdokumenata koji su zatim ručno konsolidirani i pregledani. Dokumenti su očišćeni, a na uzorku je provedena ručna provjera sadržaja. Sravnjivanje prijevodnih jedinica napravljeno je automatski te je naknadno uzorak ručno pregledan. Otvoren je i slobodno dostupan na temelju informacija javnog sektora.
DSI Relevance: eJustice
People who looked at this resource also viewed the following:
- The Icelandic Met Office - Weather forecasts and warnings
- PRINCIPLE MVEP Croatian-English Parallel Corpus of legal documents
- Compilation of French-Romanian parallel corpora resources used for training of NTEU Machine Translation engines.
- Official web-portal of the Parliament of Ukraine, Ukrainian laws in EN
People who downloaded this resource also downloaded the following: