TIGER

Der TIGER-Korpus (Versionen 2.1 und 2.2) besteht aus ca. 900.000 Token (50.000 Sätze) deutscher Zeitungstexte entnommen aus der Frankfurter Rundschau

The TIGER Project

Laufzeit
1999-2004
PI
Peter Eisenberg (Potsdam), Christian Rohrer (Stuttgart), Hans Uszkoreit (Saarbrücken)
Geldgeber
Deutsche Forschungsgemeinschaft (DFG)
Langbeschreibung

TIGER was a joint project of

The project was funded by the Deutsche Forschungsgemeinschaft (DFG) from 1999 to 2004.

Tasks

The aim of the project was the creation of a large syntactically annotated corpus of German newspaper text. It comprises the following tasks:

  • Development of a scheme for the syntactic annotation of German newspaper texts. 
    The scheme should be as theory-independent as possible in order to ensure a high degree of acceptance and re-usability.

  • Development of new techniques for the automation of corpus annotation, 
    aiming at very fast but still very reliable and accurate annotation.

  • Syntactic annotation of newspaper texts
    creating the treebank TIGER Corpus based on the annotation scheme and tools for automation.

  • Phenomenon-based retrieval of sentences from the annotated corpus. 
    The query tool TIGERSearch for syntactically annotated text was developed and implemented.

 

 

Kontakt IMS

Pfaffenwaldring 5 b, 70569 Stuttgart

 

Webmaster des IMS

Zum Seitenanfang