Domain-Specific Dataset of Difficulty Ratings for German Noun Compounds

Domains: DIY, Cooking and Automotive

Domain-Specific Dataset of Difficulty Ratings for German Noun Compounds in the Domains DIY, Cooking and Automotive

Typ

ExperimentData

Autor

Julia Bettinger, Anna Hätty, Michael Dorna, Sabine Schulte im Walde

Beschreibung

The dataset contains difficulty ratings for 1,030 German closed noun compounds extracted from domain-specific texts for do-it-yourself (DIY), cooking and automotive. It includes two-part compounds for cooking and DIY, and two- to four-part compounds for automotive. The compounds were identified in text using the Simple Compound Splitter (Weller-Di Marco, 2017); a subset was filtered and balanced for frequency and productivity criteria as basis for manual annotation and fine-grained interpretation. The final dataset was annotated with ratings from 20 annotators.

Download

Der Datensatz ist frei verfügbar im Rahmen von Creative Commons Share Alike.

Sabine Schulte im Walde
Apl. Prof. Dr.

Sabine Schulte im Walde

Akademische Rätin

Zum Seitenanfang