PRINCIPLE An tAonad Aistriúcháin agus Ateangaireachta ÓEG/NUIG Translation Unit dataset

Aligned parallel corpus based on translated material from NUI Galway. The data originally came in unaligned format. The following processing was performed: automatic text extraction from raw documents, normalization, TU alignment, cleaning, automated error detection, manual spot-check for quality.
Languages: English-Irish
Domain: mixed (general-purpose with some eProcurement)
Size: 17949 translation units