Language identifier for Bosnian, Croatian and Serbian

BS-HR-SR LID

The identifier is a Naive Bayes classifier trained on the tritext of Bosnian, Croatian and Serbian from the SETimes corpus using lowercased tokens as features.


Languages: Serbian (sr), Croatian (hr), Bosnian (bs)
People who looked at this resource also viewed the following:
Resources from the same project