Unofficial Consolidated legislative texts (Slovene)

Unofficial Consolidated legislative texts (Slovene) is an open dataset made available by the Legislation Service, Government of Republic of Slovenia ( In its current form, it was contributed to the European Language Resources Coordination Action (ELRC) ( by Krek Simon, Jozef Stefan Institut ( and is licensed under "CC-BY 4.0" (

A collection (corpus in json format) of unofficial Consolidated text of the Laws, Regulations and other general acts in Slovenia. The dataset comprises 21556 HTML files with approximately 103 million tokens. More information:

DSI Relevance: eJustice