Lexical Contrast Dataset for Antonym-Synonym Distinction
- Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu
This dataset contains 600 adjective pairs (300 antonymous pairs and 300 synonymous pairs), 700 noun pairs (350 antonymous pairs and 350 synonymous pairs) and 800 verb pairs (400 antonymous pairs and 400 synonymous pairs). These pairs were drawn from the Database of Paradigmatic Semantic Relation Pairs according to the relations of antonymy and synonyms.
Kim Anh Nguyen, Sabine Schulte im Walde and Ngoc Thang Vu. Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL). Berlin, Germany, August 2016.
The resources are freely available for education, research and other non-commercial purposes. For download, click here.