Romanian – English literature corpus

Romanian – English literature corpuswas created for the European Language Resources Coordination Action (ELRC) ( by Tufis Dan, Institutul de Cercetari pentru Inteligenta Artificiala ”Mihai Draganescu”, Academia Romana ( and is licensed under "CC-BY 4.0" (

Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on line i in the English text is aligned with the sentence on line i in the Romanian text. Alignment was manually validated.

DSI Relevance: Europeana