Language identifier for Bosnian, Croatian and Serbian

BS-HR-SR LID

The identifier is a Naive Bayes classifier trained on the tritext of Bosnian, Croatian and Serbian from the SETimes corpus using lowercased tokens as features.


Languages: Serbian (sr), Croatian (hr), Bosnian (bs)