Home
Browse Resources
Help
About
What is ELRC-SHARE
LR Provision
Access to ELRC-SHARE Language Resources
Licensing LRs for the ELRC action
Notice and Takedown Policy
Disclaimers and Limitation of Liability
Log information, cookies and analytics
Data Protection Record
Register
Login
53
Last view: 2024-11-21
2
Last update: 2020-02-14
reldi-tokeniser
http://www.clarin.si/info/k-centre/web-services-documentation/
A tokeniser developed inside the ReLDI project. Supports three languages -- Croatian, Serbian and Slovene, and two modes -- for standard and non-standard text.
Back
Distribution
Availability:
Available
Licences
Apache-2.0
Distribution Details
Download location :
https://github.com/c...
Distribution Medium:
Data Downloadable
Contact Person
Simon Krek
Jozef Stefan Institute
[javascript protected email address]
Slovenia (SI)
Slovenia
[javascript protected email address]
toolService
Tool (Tokenization)
Language Dependent
Input
Media type:
Text
Languages:
Serbian
(sr)
, Croatian
(hr)
, Slovenian
(sl)
Resource Creation
Funding Project
Not Applicable
(N/A)
Funding Type:
Other
Metadata
Created:
21/05/2019
Last Updated:
21/05/2019
Metadata Language:
English (en)
People who looked at this resource also viewed the following:
REDI Diacritic restoration
reldi-tagger
NER system for South Slavic languages
SCStemmers - a collection of stemmers for Serbian and Croatian
Resources from the same project