Bilingual hr-en parallel corpus from the Journal of the Croatian Association of Civil Engineers website (Processed)

The dataset was created for the European Language Resources Coordination Action (ELRC) (http://lr-coordination.eu/) by ELRC Consortium partner, ILSP/R.C. "Athena" (https://www.athena-innovation.gr/) from the website of the Journal of the Croatian Association of Civil Engineers website (http://www.casopis-gradjevinar.hr) and is licensed under "CC-BY 4.0" (https://creativecommons.org/licenses/by/4.0/).

Contents of http://casopis-gradjevinar.hr were crawled, aligned on document and sentence level and converted into a parallel corpus