TIGER

The TIGER Corpus (versions 2.1 and 2.2) consists of app. 900,000 tokens (50,000 sentences) of German newspaper text, taken from the Frankfurter Rundschau

The TIGER Project

Term
1999-2004
PI
Peter Eisenberg (Potsdam), Christian Rohrer (Stuttgart), Hans Uszkoreit (Saarbrücken)
Sponsor
Deutsche Forschungsgemeinschaft (DFG)
Long description

TIGER was a joint project of

The project was funded by the Deutsche Forschungsgemeinschaft (DFG) from 1999 to 2004.

Tasks

The aim of the project was the creation of a large syntactically annotated corpus of German newspaper text. It comprises the following tasks:

    • Development of a scheme for the syntactic annotation of German newspaper texts. 
      The scheme should be as theory-independent as possible in order to ensure a high degree of acceptance and re-usability.

 

    • Development of new techniques for the automation of corpus annotation, 
      aiming at very fast but still very reliable and accurate annotation.

 

    • Syntactic annotation of newspaper texts
      creating the treebank TIGER Corpus based on the annotation scheme and tools for automation.

 

  • Phenomenon-based retrieval of sentences from the annotated corpus. 
    The query tool TIGERSearch for syntactically annotated text was developed and implemented.

 

 

Project CLARIN-D

To the top of the page