Home
Browse Resources
Help
About
What is ELRC-SHARE
LR Provision
Access to ELRC-SHARE Language Resources
Licensing LRs for the ELRC action
Notice and Takedown Policy
Disclaimers and Limitation of Liability
Log information, cookies and analytics
Register
Login
36
Last view: 2022-08-12
4
Last update: 2020-04-13
Bitextor release 7.1
https://github.com/bitextor/bitextor/releases/tag/v7.1
,
https://github.com/bitextor/bitextor/releases/
,
https://github.com/bitextor/bitextor/blob/master/technical_requirements.md
Bitextor is a tool to mine the web for parallel corpora used in the ParaCrawl project. Version 7.1 includes broader document formats.
Back
Distribution
Availability:
Under Review
Licences
GPL-3.0
Distribution Details
Download location :
https://github.com/b...
Distribution Medium:
Data Downloadable
Contact Person
Miquel EsplĂ
https://www.dlsi.ua....
[javascript protected email address]
Spain (ES)
toolService
Suite Of Tools (Alignment, Bilingual Lexicon Induction, Sentence Alignment, Web Crawling)
Language Independent
Resource Creation
Funding Project
Broader Web-Scale Provision of Parallel Corpora for European Languages
(Paracrawl)
URL:
http://paracrawl.eu/
Funding Type:
Eu Funds
Funder:
European Commission
Metadata
Created:
29/06/2019
Last Updated:
29/06/2019
Metadata Language:
English (en)
Version
Version:
7.1
People who looked at this resource also viewed the following:
Bitextor release 7
Domain Adaptation Filter for parallel corpora
Collocation and Term Extractor
Corpus Crawler
Resources from the same project