PRINCIPLE Foras na Gaeilge parallel translation memory dataset

Aligned parallel corpus based on translation memory data from Foras na Gaeilge. The data originally came in an aligned format, and was since normalized and cleaned. The cleaned content was subsequently searched (automated) for obvious errors, and spot-checked (manually) for quality.
Languages: English-Irish
Domain: mixed (general-purpose with some eProcurement)
Size: 60443 translation units