TIGER

Das TIGER-Korpus (Versionen 2.1 und 2.2) besteht aus ca. 900.000 Token (50.000 Sätze) deutscher Zeitungstexte entnommen aus der Frankfurter Rundschau

The TIGER Project

Laufzeit
1999-2004
PI
Peter Eisenberg (Potsdam), Christian Rohrer (Stuttgart), Hans Uszkoreit (Saarbrücken)
Geldgeber
Deutsche Forschungsgemeinschaft (DFG)
Langbeschreibung

TIGER was a joint project of

The project was funded by the Deutsche Forschungsgemeinschaft (DFG) from 1999 to 2004.

Tasks

The aim of the project was the creation of a large syntactically annotated corpus of German newspaper text. It comprises the following tasks:

    • Development of a scheme for the syntactic annotation of German newspaper texts. 
      The scheme should be as theory-independent as possible in order to ensure a high degree of acceptance and re-usability.

 

    • Development of new techniques for the automation of corpus annotation, 
      aiming at very fast but still very reliable and accurate annotation.

 

    • Syntactic annotation of newspaper texts
      creating the treebank TIGER Corpus based on the annotation scheme and tools for automation.

 

  • Phenomenon-based retrieval of sentences from the annotated corpus. 
    The query tool TIGERSearch for syntactically annotated text was developed and implemented.

 

 

Projekt CLARIN-D

Zum Seitenanfang