Financial English Crawling 
The Financial English Crawling is a 67-million-token corpus of English built from the web by targeting specific in-domain urls that belong to the finance sector such as bank websites, finance resource sites, finance blogs and forums on banking and economy-related issues. It consists of 67,732,749 tokens, 3,672,407 sentences and 167,731 documents.
Documents are separated by single new lines.
The corpus has been developed in the framework of the CEF project MT4ALL (http://ixa2.si.ehu.eus/mt4all/project)
We license the actual packaging of this data under a CC0 1.0 Universal License.
People who looked at this resource also viewed the following:
- Compilation of English-Latvian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Irish-Lithuanian parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of Bulgarian-Maltese parallel corpora resources used for training of NTEU Machine Translation engines.
- Compilation of German-Lithuanian parallel corpora resources used for training of NTEU Machine Translation engines.