DAESO - headlines_paraphrases_automatic_intersects (Processed)
Detecting semantic overlap:A parallel monolingual treebank for Dutch. Erwin Marsi and Emiel Krahmer. CLIN, 2007
From the DAESO corpus: Comparable text/paraphrases from automaticlaly generated clustered headlines of similar news articles, having a "intersects" relation. Contains 27400 TMX pairs (NL-NL), consisting of longer sequences (phrases and paragraphs).
People who looked at this resource also viewed the following:
- Bilingual English-Norwegian parallel corpus from Altinn website
- Dutch laws as Dutch monolingual corpus from www.overheid.nl web site
- Bilingual English-Norwegian parallel corpus from The Norwegian Petroleum Directorate website
- Collection of business legislation terminology from Term Bank of Republic of Lithuania