Lemma Correction Script

Scripts for converting TreeTagger POS tags into MulText and correction of lemmas output by the RFTagger and the TreeTagger

Lemma correction script

Type
Tool
Author
Anita Ramm
Description

The tool was created in the context of the TTC project and contains scripts for:

  1. converting TreeTagger POS tags into MulText for DE, EN, ES and FR
    (EN, ES and FR: only nouns adjectives and verbs)
  2. correction of lemmas output by the RFTagger for DE
    and the TreeTagger for the other languages
Reference

Anita Gojun, Ulrich Heid, Bernd Weissbach, Carola Loth, Insa Mingers (2012). Adapting and evaluating a generic term extraction tool. In: Proceedings of the 8th international conference on Languge Resources and Evaluation (LREC). Istanbul, Turkey. PDF

Download

.zip (version 1.0)

Ulrich Heid

Apl. Prof. PD Dr.
To the top of the page