LinguA is a state-of-the-art linguistic annotation pipeline which combines rule-based and machine learning algorithms. It includes the following annotation steps: Sentence splitting Tokenization Part-of-speech tagging and lemmatization Dependency parsing