Parallel Global Voices (Greek - Spanish) (Processed)
Parallel Global Voices (Greek - Spanish) was created for the European Language Resources Coordination Action (ELRC) (http://lr-coordination.eu/) by researchers at the NLP group of the Institute for Language and Speech Processing (http://www.ilsp.gr/) with primary data copyrighted by Parallel Global Voices (https://globalvoices.org/) and is licensed under "CC-BY 3.0" (https://creativecommons.org/licenses/by/3.0/).
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news stories in more than 40 languages. The original content from the Global Voices websites is available by the authors and publishers under a Creative Commons Attribution license. The original content was crawled in 2015-2016 and web documents were exported to XML by researchers at the Institute for Language and Speech Processing (http://www.ilsp.gr/). Crawled documents that were translations of each other were paired on the basis of their link information. After document pairing, sentence alignments were generated with the hunalign sentence aligner. This dataset contains one tmx file with alignments from 3161 el-es document pairs.
DSI Relevance: Europeana
People who looked at this resource also viewed the following:
- Polish Ministry of Foreign Affairs reports in EN and PL (Processed)
- Statens Vegvesen Translation Memories
- Press and Information Office (PIO) Publication: "CYPRUS still occupied still divided 1974-2016" (Processed)
- Parallel English-Icelandic corpus from the contents of Icelandic National Debt Management Agency website
People who downloaded this resource also downloaded the following: