For full functionality of this site it is necessary to enable JavaScript. Here are the instructions how to enable JavaScript in your web browser.

Position within the page tree

Institute for Natural Language Processing
Research
Resources
Experiment-Data
Vietnamese dataset for similarity and relatedness

Vietnamese dataset for similarity and relatedness

This dataset consists of two kinds of datasets: The first dataset, namely ViCon, comprises pairs of synonyms and antonymys across noun, verb, and adjective classes, offerring data to distinguish between similarity and dissimilarity. The second dataset ViSim-400 is a dataset of semantic relation pairs which contains degrees of similarity across five semantic relations, as rated by human judges

Vietnamese dataset for similarity and relatedness

Type: ExperimentData
Author: Kim Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu; This dataset consists of two kinds of datasets: The first dataset, namely ViCon, comprises pairs of synonyms and antonymys across noun, verb, and adjective classes, offerring data to distinguish between similarity and dissimilarity. The second dataset ViSim-400 is a dataset of semantic relation pairs which contains degrees of similarity across five semantic relations, as rated by human judges.
Reference: Kim-Anh Nguyen, Sabine Schulte im Walde, Ngoc Thang Vu (2018)
Introducing Two Vietnamese Datasets for Evaluating Semantic Models of (Dis-)Similarity and Relatedness
In: Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). New Orleans, LA.
Download: The resources are available per download under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA).

Vietnamese dataset for similarity and relatedness

Vietnamese dataset for similarity and relatedness

Sabine Schulte im Walde

Thang Vu

Audience

Formalities

Services

Organization

Vietnamese dataset for similarity and relatedness

Vietnamese dataset for similarity and relatedness

Sabine Schulte im Walde

Thang Vu

Here you can reach us

Audience

Formalities

Services

Organization