CURLICAT Croatian corpus

Reference: Tamás Váradi, Bence Nyéki, Svetla Koeva, Marko Tadić, Vanja Štefanec, Maciej Ogrodniczuk, Bartłomiej Nitoń, Piotr Pęzik, Verginica Barbu Mititelu, Elena Irimia, Maria Mitrofan, Dan Tufiș, Radovan Garabík, Simon Krek, and Andraž Repar. 2022. Introducing the CURLICAT Corpora: Seven-language Domain Specific Annotated Corpora from Curated Sources. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 100–108, Marseille, France. European Language Resources Association. (https://aclanthology.org/2022.lrec-1.11/)

The CURLICAT Croatian corpus contains redistributable data from the Croatian National Corpus, data from the Croatian Medical Corpus, and data from several scientific journals of the Croatian portal of open access journals, and data from other sources.

DSI Relevance: Europeana, OpenDataPortal, eHealth, eProcurement