Bilingual hr-en parallel corpus from the Hrvatski Telekom website

Contents of http://www.t.ht.hr were crawled, aligned on document and sentence level and converted into a parallel corpus