ANVAN-LS: Lexical Substitution for Evaluating Compositional Distributional Models

ANVAN-LS is a lexical substitution dataset for CDSM evaluation sampled from an English-language corpus with manual “all-words” lexical substitution annotation

ANVAN-LS: Lexical Substitution for Evaluating Compositional Distributional Models

Type
Corpus
Author
Sebastian Pado
Description

ANVAN-LS is a lexical substitution dataset for CDSM evaluation sampled from an English-language corpus with manual “all-words” lexical substitution annotation. The sentences all have the traditional Adjective-Noun-Verb-Adjective-Noun (ANVAN) format.

Reference

Maja Buljan, Sebastian Padó, Jan Snajder.  Lexical Substitution for Evaluating Compositional Distributional Models. Proceedings of NAACL 2018, New Orleans, LA.

Download

Data (.txt)

Readme (.txt)

 

General Contact IMS

Pfaffenwaldring 5 b, 70569 Stuttgart

 

Webmaster of the IMS

  • Write e-mail
  • If you have any problems with the website, please directly contact the webmaster.
To the top of the page