Romanian - English literature corpus (Processed)

Romanian – English literature corpus was created for the European Language Resources Coordination Action (ELRC) (http://lr-coordination.eu/) by Tufis Dan, Institutul de Cercetari pentru Inteligenta Artificiala ”Mihai Draganescu”, Academia Romana (www.racai.ro/) and is licensed under "CC-BY 4.0" (https://creativecommons.org/licenses/by/4.0/). Primary data consist of a small set of freely available literature books (drama, sci-fi, etc.)

Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on line i in the English text is aligned with the sentence on line i in the Romanian text. Alignment was manually validated.

DSI Relevance: Europeana