Institut

Studium

Forschung


 

DErivBase

Typ Lexicon
Titel DErivBase
Autor Sebastian Padó

Beschreibung

DErivBase is a large-coverage derivational lexicon for German (Zeller et al., 2013). It consists of derivational families, groups of lemmas which are derivationally related among each other. Since v2.0, such derivational families are automatically split into semantically consistent clusters (Zeller et al., 2014.). Version 2.0 covers 280,336 lemmas; 65,420 of them are grouped into 20,371 non-singleton families (i.e., 214,916 are singleton families). The lexicon was built with HOFM, a rule-based framework written in Haskell.

 

 


Referenz

@InProceedings{zellerEtAl:13,
  author = {Zeller, Britta and \v{S}najder, Jan and Pad{\'o}, Sebastian},
  title = {{DE}riv{B}ase: Inducing and Evaluating a 
Derivational Morphology Resource for {G}erman}, booktitle = {{Proceedings of ACL 2013}}, year = {2013}, address = {Sofia, Bulgaria}, pages = {1201--1211}, url = {www.aclweb.org/anthology/P13-1118.pdf}, }
@InProceedings{padoEtAl:13,
  author = {Pad{\'o}, Sebastian \v{S}najder, Jan and Zeller, Britta},
  title = {Derivational Smoothing for Syntactic Distributional Semantics},
  booktitle = {{Proceedings of ACL 2013}},
  year = {2013},                  
  address = {Sofia, Bulgaria},
  pages = {731--735},
  url = {www.aclweb.org/anthology/P13-2128},
}
@InProceedings{zeller-pado-vsnajder:2014:Coling,
  author = {Zeller, Britta  and  Pad\'{o}, Sebastian  and  \v{S}najder, Jan},
  title = {Towards Semantic Validation of a Derivational Lexicon},
  booktitle = {Proceedings of COLING 2014, the 25th International Conference on
    Computational Linguistics: Technical Papers},
  month = {August},
  year = {2014},
  address = {Dublin, Ireland},
  publisher = {Dublin City University and Association for Computational Linguistics},
  pages = {1728--1739},
  url = {http://www.aclweb.org/anthology/C14-1163}
}

Download

Choose your favourite DErivBase version here (for earlier versions, please send a request to:
zeller at cl dot uni-heidelberg dot de):

DErivBase version Features Download link
v2.0 Morphological families are split into semantically coherent sublusters DErivBase-v2.0.zip
v1.4.1 Morphological families built with 267 derivation rules (incl. meaning-changing prefixations) + bugfix DErivBase-v1.4.1.zip
v1.4 Morphological families built with 267 derivation rules (incl. meaning-changing prefixations) DErivBase-v1.4.zip



The gold-annotated evaluation datasets described in the paper are available here: test-samples.tar

For further details on DErivBase's build process, versions, and data format, please consult the documentation.

If you use the lexicon, please cite the paper "DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German" as shown below.



DErivBase is made available under the CreativeCommons license CC BY-SA 3.0. By downloading the software and/or lexicon, you acknowledge the terms and conditions of the CC BY-SA license.