Home
Browse Resources
Help
About
What is ELRC-SHARE
LR Provision
Access to ELRC-SHARE Language Resources
Licensing LRs for the ELRC action
Notice and Takedown Policy
Disclaimers and Limitation of Liability
Log information, cookies and analytics
Register
Login
52
Last view: 2021-03-01
2
Last update: 2020-06-18
17
Last download: 2021-02-24
COVID-19 - HEALTH Wikipedia dataset. Multilingual (53 EN-X language pairs)
Multilingual (53 EN-X language pairs) corpus acquired from Wikipedia on health and COVID-19 domain (2nd May 2020). It contains 81671 TUs in total.
DSI Relevance:
eHealth
Back
Download
Distribution
Availability:
Available
Licences
CC-BY-SA-3.0
Conditions:
Attribution, Share Alike
Distribution Details
Contact Person
Prokopis Prokopidis
Institute for Language and Speech Processing / Athena Research Center
ILSP / ATHENA R.C.
[javascript protected email address]
Greece
http://www.ilsp.gr
,
http://www.athenarc.gr
ILSP / ATHENA R.C.
Greece
text
Multilingual text corpus
Languages
Serbian
Ukrainian (uk)
Language Script:
Cyrillic
Turkish (tr)
Language Script:
Latin
Chinese
Vietnamese (vi)
Language Script:
Latin
Swedish (sv)
Language Script:
Latin
Albanian (sq)
Language Script:
Latin
Thai (th)
Language Script:
Thai
Tagalog (tl)
Language Script:
Latin
Lithuanian (lt)
Language Script:
Latin
Latvian (lv)
Language Script:
Latin
Hungarian (hu)
Language Script:
Latin
Indonesian (id)
Language Script:
Latin
Italian (it)
Language Script:
Latin
Korean (ko)
Language Script:
Korean (alias For Hangul + Han)
Galician (gl)
Language Script:
Latin
Hebrew (he)
Language Script:
Hebrew
Hindi (hi)
Language Script:
Devanagari; Nagari
Croatian (hr)
Language Script:
Latin
Swahili (macrolanguage) (sw)
Language Script:
Latin
Serbo-Croatian
English (en)
Language Script:
Latin
Slovenian (sl)
Language Script:
Latin
Russian (ru)
Language Script:
Cyrillic
Romanian; Moldavian; Moldovan (ro)
Language Script:
Latin
Portuguese (pt)
Language Script:
Latin
Slovak (sk)
Language Script:
Latin
Belarusian (be)
Language Script:
Cyrillic
Azerbaijani
Bengali (bn)
Language Script:
Bengali
Bulgarian (bg)
Language Script:
Cyrillic
French (fr)
Language Script:
Latin
Arabic (ar)
Language Script:
Arabic
Afrikaans (af)
Language Script:
Latin
Tamil (ta)
Language Script:
Tamil
Catalan; Valencian (ca)
Language Script:
Latin
Bosnian (bs)
Language Script:
Latin
Telugu (te)
Language Script:
Telugu
Danish (da)
Language Script:
Latin
Czech (cs)
Language Script:
Latin
Modern Greek (1453-) (el)
Language Script:
Greek
German (de)
Language Script:
Latin
Spanish; Castilian (es)
Language Script:
Latin
Esperanto (eo)
Language Script:
Latin
Basque (eu)
Language Script:
Latin
Estonian (et)
Language Script:
Latin
Finnish (fi)
Language Script:
Latin
Persian (fa)
Language Script:
Arabic
Polish (pl)
Language Script:
Latin
Norwegian (no)
Language Script:
Latin
Dutch; Flemish (nl)
Language Script:
Latin
Malay (macrolanguage) (ms)
Language Script:
Latin
Malayalam (ml)
Language Script:
Malayalam
Macedonian (mk)
Language Script:
Cyrillic
Linguality
Linguality type:
Multilingual
Multi-linguality type:
Parallel
Text Format
TMX
Size
81,671 Translation Units
Character encoding
UTF-8
Domains
SOCIAL QUESTIONS
Health (Eurovoc 2841)
EUROVOC
Resource Creation
Created using ELRC Services
Funding Project
COVID-19 Initiative
(COVID-19)
Funding Type:
Other
Funding Country:
European Union (EU)
European Language Resource Coordination 3.0
(ELRC3.0 - SMART 2019/1083 LC-01325001)
URL:
http://www.lr-coordi...
Funding Type:
Eu Funds
Funder:
European Commission
Funding Country:
European Union (EU)
Metadata
Created:
06/11/2019
Last Updated:
02/05/2020
Metadata Language:
English (en)
Metadata Creator
Prokopis Prokopidis
Institute for Language and Speech Processing / Athena Research Center
ILSP / ATHENA R.C.
[javascript protected email address]
Greece
http://www.ilsp.gr
,
http://www.athenarc.gr
ILSP / ATHENA R.C.
Greece
Version
Version:
1.0
Last Updated:
02/05/2020
Relations
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-EO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AF)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-CA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-EU)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-UK)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-NL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-FI)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BS)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TH)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ID)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-MK)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SQ)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-GL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-IT)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BN)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ZH)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-EL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AZ)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BG)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HI)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-CS)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-DE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HU)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-DA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-LT)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-KO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-NO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ML)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SW)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-FR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-RU)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SK)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ES)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-FA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ET)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-PT)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SH)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SV)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-PL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-VI)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-RO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-MS)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-LV)
Relation Type:
Has Part
People who looked at this resource also viewed the following:
COVID-19 EUROPARL dataset v2. Multilingual (24 CEF languages)
COVID-19 EU presscorner v1 dataset. Multilingual (CEF languages)
COVID-19 EC-EUROPA v1 dataset. Multilingual (CEF languages)
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
People who downloaded this resource also downloaded the following:
COVID-19 EUROPARL dataset v2. Multilingual (24 CEF languages)
Multilingual corpus from the European Vaccination Information Portal
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
COVID-19 EUROPARL dataset v1. Multilingual (24 CEF languages)
Resources from the same project