Home
Browse Resources
Help
About
What is ELRC-SHARE
LR Provision
Access to ELRC-SHARE Language Resources
Licensing LRs for the ELRC action
Notice and Takedown Policy
Disclaimers and Limitation of Liability
Log information, cookies and analytics
Data Protection Record
Register
Login
57
Last view: 2024-11-21
4
Last update: 2020-04-13
Bitextor release 7.1
https://github.com/bitextor/bitextor/releases/tag/v7.1
,
https://github.com/bitextor/bitextor/releases/
,
https://github.com/bitextor/bitextor/blob/master/technical_requirements.md
Bitextor is a tool to mine the web for parallel corpora used in the ParaCrawl project. Version 7.1 includes broader document formats.
Back
Distribution
Availability:
Under Review
Licences
GPL-3.0
Distribution Details
Download location :
https://github.com/b...
Distribution Medium:
Data Downloadable
Contact Person
Miquel EsplĂ
https://www.dlsi.ua....
[javascript protected email address]
Spain (ES)
toolService
Suite Of Tools (Alignment, Bilingual Lexicon Induction, Sentence Alignment, Web Crawling)
Language Independent
Resource Creation
Funding Project
Broader Web-Scale Provision of Parallel Corpora for European Languages
(Paracrawl)
URL:
http://paracrawl.eu/
Funding Type:
Eu Funds
Funder:
European Commission
Metadata
Created:
29/06/2019
Last Updated:
29/06/2019
Metadata Language:
English (en)
Version
Version:
7.1
People who looked at this resource also viewed the following:
Domain Adaptation Filter for parallel corpora
Bitextor release 7
Collocation and Term Extractor
Bilingual Sentence Aligner
Resources from the same project