Filter by:

English (14)
German (14)
French (12)
Italian (9)
Chinese (4)
Czech (4)
Danish (4)
Finnish (4)
Polish (4)
Slovak (4)
Arabic (3)
Basque (3)
Latvian (3)
Russian (3)
Swedish (3)
Hebrew (2)
Hindi (2)
Irish (2)
Kazakh (2)
Korean (2)
Kurdish (2)
Latin (2)
Persian (2)
Serbian (2)
Turkish (2)
Urdu (2)
Amharic (1)
Aymara (1)
Bengali (1)
Bosnian (1)
Breton (1)
Buriat (1)
Burmese (1)
Cornish (1)
Gothic (1)
Hausa (1)
Ido (1)
Kabyle (1)
Marathi (1)
Sindhi (1)
Somali (1)
Tagalog (1)
Tajik (1)
Tamil (1)
Tatar (1)
Telugu (1)
Thai (1)
Uzbek (1)
Text (15)
JSON (1)
N/ A (15)

Resource Type:

Corpus:
Lexical/Conceptual:
Language Description:
Tool/Service:

15 Language Resources

Order by:

 A Massive Spanish Crawling Corpus
Number of downloads 0 Number of views 25
  • Spanish; Castilian
  • CC-BY-NC-ND-4.0
 Apache Tika - a content analysis toolkit
Number of downloads 0 Number of views 112
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hungarian
  • Icelandic
  • Italian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Polish
  • Portuguese
  • Spanish; Castilian
  • Swedish
  • Apache-2.0
 FreeLing
Number of downloads 0 Number of views 62
  • Asturian; Asturleonese; Bable; Leonese
  • Catalan; Valencian
  • Croatian
  • English
  • French
  • Galician
  • German
  • Italian
  • Norwegian
  • Portuguese
  • Russian
  • Slovenian
  • Spanish; Castilian
  • Welsh
  • AGPL-3.0
 GATE -- a full-lifecycle open source solution for text processing
Number of downloads 0 Number of views 37
  • Dutch; Flemish
  • English
  • French
  • German
  • Hungarian
  • Spanish; Castilian
  • GPL-3.0
 IXA pipes
Number of downloads 0 Number of views 33
  • Basque
  • Dutch; Flemish
  • English
  • French
  • Galician
  • German
  • Italian
  • Spanish; Castilian
  • Apache-2.0
 LASER: Language-Agnostic SEntence Representations
Number of downloads 0 Number of views 144
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Aymara
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Berber languages
  • Bosnian
  • Breton
  • Bulgarian
  • Burmese
  • Catalan; Valencian
  • Central Dusun
  • Central Khmer
  • Chavacano
  • Chinese
  • Coastal Kadazan
  • Cornish
  • Croatian
  • Czech
  • Danish
  • Dhivehi; Divehi; Maldivian
  • Dutch; Flemish
  • Eastern Mari
  • English
  • Esperanto
  • Estonian
  • Finnish
  • French
  • Galician
  • Georgian
  • German
  • Hausa
  • Hebrew
  • Hindi
  • Hungarian
  • Icelandic
  • Ido
  • Indonesian
  • Interlingua (International Auxiliary Language Association)
  • Interlingue; Occidental
  • Irish
  • Italian
  • Japanese
  • Kabyle
  • Kazakh
  • Korean
  • Kurdish
  • Latin
  • Latvian
  • Lingua Franca Nova
  • Lithuanian
  • Low German; Low Saxon
  • Macedonian
  • Malagasy
  • Malay (individual language)
  • Malayalam
  • Marathi
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Occitan (post 1500)
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Sindhi
  • Sinhala; Sinhalese
  • Slovak
  • Slovenian
  • Somali
  • Spanish; Castilian
  • Swahili (individual language); Kiswahili
  • Swedish
  • Tagalog
  • Tajik
  • Tamil
  • Tatar
  • Telugu
  • Thai
  • Turkish
  • Uighur; Uyghur
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Wu Chinese
  • Yue Chinese
  • BSD-3-Clause
 LEXACC - Lucene-based parallel phrase EXtractor from Comparable Corpora
Number of downloads 0 Number of views 32
  • Croatian
  • English
  • German
  • Latvian
  • Lithuanian
  • Modern Greek (1453-)
  • Romanian; Moldavian; Moldovan
  • Slovenian
  • Spanish; Castilian
  • Non-standard/ Other Licence/ Terms
 NLP-Cube
Number of downloads 0 Number of views 96
  • Afrikaans
  • Ancient Greek (to 1453)
  • Arabic
  • Armenian
  • Basque
  • Bulgarian
  • Buriat
  • Catalan; Valencian
  • Chinese
  • Church Slavic; Church Slavonic; Old Bulgarian; Old Church Slavonic; Old Slavonic
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • Galician
  • German
  • Gothic
  • Hebrew
  • Hindi
  • Hungarian
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Kazakh
  • Korean
  • Kurdish
  • Latin
  • Latvian
  • Modern Greek (1453-)
  • Norwegian Bokmål
  • Norwegian Nynorsk
  • Persian
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Sami languages
  • Serbian
  • Slovak
  • Spanish; Castilian
  • Swedish
  • Turkish
  • Uighur; Uyghur
  • Ukrainian
  • Upper Sorbian
  • Urdu
  • Vietnamese
  • Apache-2.0
 OpeNER suite of tools
Number of downloads 0 Number of views 51
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian
  • Apache-2.0
 RACAI Translation System
Number of downloads 0 Number of views 34
  • English
  • German
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Spanish; Castilian
  • Non-standard/ Other Licence/ Terms
 Shallow Processing with Unification and Typed Feature Structures
Number of downloads 0 Number of views 26
  • Chinese
  • Czech
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Japanese
  • Polish
  • Spanish; Castilian
  • Non-standard/ Other Licence/ Terms
 spaCy
Number of downloads 0 Number of views 46
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Portuguese
  • Spanish; Castilian
  • MIT
 Stanford CoreNLP
Number of downloads 0 Number of views 48
  • Arabic
  • Chinese
  • English
  • French
  • German
  • Spanish; Castilian
  • GPL-3.0
 TreeTagger -- A chunker for English, German, French, and Spanish
Number of downloads 0 Number of views 35
  • English
  • French
  • German
  • Spanish; Castilian
  • Non-standard/ Other Licence/ Terms
 TreeTagger - a part-of-speech tagger for many languages
Number of downloads 0 Number of views 85
  • Bulgarian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Non-standard/ Other Licence/ Terms