PRINCIPLE Foras na Gaeilge parallel translation memory dataset (evaluated) 
Aligned parallel corpus based on translation memory data from Foras na Gaeilge. The data originally came in an aligned format, and was since normalized and cleaned. The cleaned content was subsequently searched (automated) for obvious errors, and spot-checked (manually) for quality. This dataset has been used in the development of an MT system for the EN-GA language pair, and so is considered to be of high quality.
Languages: English-Irish
Domain: mixed (general-purpose with some eProcurement)
Size: 54141 translation units
People who looked at this resource also viewed the following:
- Compilation of Dutch; Flemish-Romanian; Moldavian; Moldovan parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- Irish Primary Stem Cells Lesson Plan Booklet and Presentation
- Montenegrin web corpus MaCoCu-cnr 1.0
- PAeSI : Public Administration and Foreign Immigrants (Processed)