Citizens Information Bilingual Web-Corpus

Contains Irish Public Sector Data licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) licence.

A web corpus crawled from http://www.citizensinformation.ie. Contains 10,297 parallel sentences of English/Irish that have undergone manual cleaning. May be reproduced and/or re-used free of charge subject to the latest PSI licence, Creative Commons Attribution 4.0 International (CC BY 4.0). Contains 2 .txt files.