English-Albanian corpus from websites of national Agencies v.1.0 
Bilingual dataset (EN-SQ) based on the content of websites of national agencies. It includes 84747 Translation Units. It was generated by crawling the websites in January 2021, detecting pairs of parallel documents, identifying parallel sentence pairs and filtering the results.
People who looked at this resource also viewed the following:
- SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in MOSES format.
- Multilingual (EN, MK, SQ) corpus from websites of government of North Macedonia v.1.04
- Multilingual (EN, MK, SQ) corpus from websites of government of North Macedonia v.1.0
- Monolingual Albanian corpus from websites of government of Albania (part 1)
People who downloaded this resource also downloaded the following: