English-Albanian corpus from websites of national Agencies v.1.0 
Bilingual dataset (EN-SQ) based on the content of websites of national agencies. It includes 84747 Translation Units. It was generated by crawling the websites in January 2021, detecting pairs of parallel documents, identifying parallel sentence pairs and filtering the results.
People who looked at this resource also viewed the following:
- SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
- Multilingual (EN, MK, SQ) corpus from websites of government of North Macedonia v.1.0
- Compilation of Czech-Hungarian parallel corpora resources used for training of NTEU Machine Translation engines.
- Anonymised ParaCrawl release 9 English-Finnish