Domain-Specific Dataset of Difficulty Ratings for German Noun Compounds in the Domains DIY, Cooking and Automotive
- Typ
-
ExperimentData
- Autor
-
Julia Bettinger, Anna Hätty, Michael Dorna, Sabine Schulte im Walde
- Beschreibung
-
The dataset contains difficulty ratings for 1,030 German closed noun compounds extracted from domain-specific texts for do-it-yourself (DIY), cooking and automotive. It includes two-part compounds for cooking and DIY, and two- to four-part compounds for automotive. The compounds were identified in text using the Simple Compound Splitter (Weller-Di Marco, 2017); a subset was filtered and balanced for frequency and productivity criteria as basis for manual annotation and fine-grained interpretation. The final dataset was annotated with ratings from 20 annotators.
- Download
-
Please contact the SemRel group to obtain the data.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA).
SemRel
Research Group SemRel
- Write e-mail
- Research Group Sabine Schulte im Walde
Sabine Schulte im Walde
Prof. Dr.Akademische Rätin (Associate Professor)