Home
Browse Resources
Help
About
What is ELRC-SHARE
LR Provision
Access to ELRC-SHARE Language Resources
Licensing LRs for the ELRC action
Notice and Takedown Policy
Disclaimers and Limitation of Liability
Log information, cookies and analytics
Data Protection Record
Register
Login
211
Last view: 2024-12-16
4
Last update: 2021-10-01
112
Last download: 2024-12-11
COVID-19 - HEALTH Wikipedia dataset. Multilingual (52 EN-X language pairs)
Multilingual (52 EN-X language pairs) corpus acquired from Wikipedia on health and COVID-19 domain (2nd May 2020). It contains 81671 TUs in total.
DSI Relevance:
eHealth
Back
Download
Distribution
Availability:
Available
Licences
CC-BY-SA-3.0
Conditions:
Attribution, Share Alike
Distribution Details
Contact Person
Prokopis Prokopidis
Institute for Language and Speech Processing / Athena Research Center
ILSP / ATHENA R.C.
[javascript protected email address]
Greece
http://www.ilsp.gr
,
http://www.athenarc.gr
ILSP / ATHENA R.C.
Greece
text
Multilingual text corpus
Languages
Swedish (sv)
Language Script:
Latin
Vietnamese (vi)
Language Script:
Latin
Ukrainian (uk)
Language Script:
Cyrillic
Chinese
Swahili (macrolanguage) (sw)
Language Script:
Latin
Serbian
Tagalog (tl)
Language Script:
Latin
Turkish (tr)
Language Script:
Latin
Lithuanian (lt)
Language Script:
Latin
Latvian (lv)
Language Script:
Latin
Hungarian (hu)
Language Script:
Latin
Indonesian (id)
Language Script:
Latin
Italian (it)
Language Script:
Latin
Korean (ko)
Language Script:
Korean (alias For Hangul + Han)
Galician (gl)
Language Script:
Latin
Hebrew (he)
Language Script:
Hebrew
Hindi (hi)
Language Script:
Devanagari; Nagari
Croatian (hr)
Language Script:
Latin
Tamil (ta)
Language Script:
Tamil
Slovak (sk)
Language Script:
Latin
English (en)
Language Script:
Latin
Albanian (sq)
Language Script:
Latin
Russian (ru)
Language Script:
Cyrillic
Romanian; Moldavian; Moldovan (ro)
Language Script:
Latin
Portuguese (pt)
Language Script:
Latin
Slovenian (sl)
Language Script:
Latin
Belarusian (be)
Language Script:
Cyrillic
Azerbaijani
Bengali (bn)
Language Script:
Bengali
Bulgarian (bg)
Language Script:
Cyrillic
French (fr)
Language Script:
Latin
Arabic (ar)
Language Script:
Arabic
Afrikaans (af)
Language Script:
Latin
Telugu (te)
Language Script:
Telugu
Catalan; Valencian (ca)
Language Script:
Latin
Bosnian (bs)
Language Script:
Latin
Thai (th)
Language Script:
Thai
Danish (da)
Language Script:
Latin
Czech (cs)
Language Script:
Latin
Modern Greek (1453-) (el)
Language Script:
Greek
German (de)
Language Script:
Latin
Spanish; Castilian (es)
Language Script:
Latin
Esperanto (eo)
Language Script:
Latin
Basque (eu)
Language Script:
Latin
Estonian (et)
Language Script:
Latin
Finnish (fi)
Language Script:
Latin
Persian (fa)
Language Script:
Arabic
Polish (pl)
Language Script:
Latin
Norwegian (no)
Language Script:
Latin
Dutch; Flemish (nl)
Language Script:
Latin
Malay (macrolanguage) (ms)
Language Script:
Latin
Malayalam (ml)
Language Script:
Malayalam
Macedonian (mk)
Language Script:
Cyrillic
Linguality
Linguality type:
Multilingual
Multi-linguality type:
Parallel
Text Format
TMX
Size
81,671 Translation Units
Character encoding
UTF-8
Domains
SOCIAL QUESTIONS
Health (Eurovoc 2841)
EUROVOC
Resource Creation
Created using ELRC Services
Funding Project
COVID-19 Initiative
(COVID-19)
Funding Type:
Other
Funding Country:
European Union (EU)
European Language Resource Coordination 3.0
(ELRC3.0 - SMART 2019/1083 LC-01325001)
URL:
http://www.lr-coordi...
Funding Type:
Eu Funds
Funder:
European Commission
Funding Country:
European Union (EU)
Metadata
Created:
06/11/2019
Last Updated:
02/05/2020
Metadata Language:
English (en)
Metadata Creator
Prokopis Prokopidis
Institute for Language and Speech Processing / Athena Research Center
ILSP / ATHENA R.C.
[javascript protected email address]
Greece
http://www.ilsp.gr
,
http://www.athenarc.gr
ILSP / ATHENA R.C.
Greece
Version
Version:
1.0
Last Updated:
02/05/2020
Relations
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-EO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AF)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-CA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-EU)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-UK)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-NL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-FI)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BS)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TH)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ID)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-MK)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SQ)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-GL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-IT)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BN)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ZH)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-EL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AZ)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-BG)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HI)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-CS)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-DE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HU)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HE)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-DA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-TA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-LT)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-KO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-NO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ML)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SW)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-FR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-RU)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SK)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ES)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-FA)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-ET)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-PT)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-HR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SV)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-AR)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-PL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-SL)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-VI)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-RO)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-MS)
Relation Type:
Has Part
Related Resource:
COVID-19 - HEALTH Wikipedia dataset. Bilingual (EN-LV)
Relation Type:
Has Part
People who looked at this resource also viewed the following:
COVID-19 EC-EUROPA v1 dataset. Multilingual (CEF languages)
COVID-19 EUROPARL dataset v2. Multilingual (24 CEF languages)
COVID-19 Parallel Global Voices dataset. Multilingual (EN, ES, FR, IT, EL, RU, AR, MG, NL, SR, BN, PT, PL, DE, RO, CS)
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
People who downloaded this resource also downloaded the following:
COVID-19 EUROPARL dataset v2. Multilingual (24 CEF languages)
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)
COVID-19 EU presscorner v1 dataset. Multilingual (CEF languages)
COVID-19 EUR-LEX dataset . Multilingual (CEF languages)
Resources from the same project