Vietnamese dataset for similarity and relatedness
- Typ
-
ExperimentData
- Autor
-
Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu
-
This dataset consists of two kinds of datasets: The first dataset, namely ViCon, comprises pairs of synonyms and antonymys across noun, verb, and adjective classes, offerring data to distinguish between similarity and dissimilarity. The second dataset ViSim-400 is a dataset of semantic relation pairs which contains degrees of similarity across five semantic relations, as rated by human judges.
- Referenz
-
Kim-Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu (2018)
Introducing Two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness
In: Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). New Orleans, LA. - Download
![Dieses Bild zeigt Sabine Schulte im Walde](https://www.ims.uni-stuttgart.de/images/team/schulte-im-walde/foto-2016_2.jpg?__scale=w:150,h:150,cx:381,cy:2,cw:681,ch:681)
Sabine Schulte im Walde
Prof. Dr.Akademische Rätin
![Dieses Bild zeigt Thang Vu](https://www.ims.uni-stuttgart.de/images/team/vu-thang-2019.jpg?__scale=w:150,h:150,cx:0,cy:83,cw:374,ch:374)
Thang Vu
Prof. Dr.Lehrstuhlinhaber Digitale Phonetik, Stiftungsprofessur der Carl-Zeiss-Stiftung