Global Voices monolingual collections of COVID-19-related documents.

"Covid Parallel Global Voices" dataset was created for the European Language Resources Coordination Action (ELRC) ( by researchers at the NLP group of the Institute for Language and Speech Processing ( with primary data copyrighted by Global Voices ( and is licensed under "CC-BY 3.0" (

The collection was generated from content available at . It includes 2601 documents in total in the following languages:
EN 571
DE 51
ES 595
FR 539
IT 446
EL 328
SV 5
UK 66

DSI Relevance: eHealth