Position within the page tree

Institute for Natural Language Processing
Research
Projects
TIGER

TIGER

The TIGER Corpus (versions 2.1 and 2.2) consists of app. 900,000 tokens (50,000 sentences) of German newspaper text, taken from the Frankfurter Rundschau

The TIGER Project

Term

1999-2004

PI

Peter Eisenberg (Potsdam), Christian Rohrer (Stuttgart), Hans Uszkoreit (Saarbrücken)

Sponsor

Deutsche Forschungsgemeinschaft (DFG)

Long description

TIGER was a joint project of

the Department of Computational Linguistics and Phonetics in Saarbrücken,
the Institute for Natural Language Processing (IMS) in Stuttgart,
and the Institut für Germanistik in Potsdam.

The project was funded by the Deutsche Forschungsgemeinschaft (DFG) from 1999 to 2004.

Tasks

The aim of the project was the creation of a large syntactically annotated corpus of German newspaper text. It comprises the following tasks:

Development of a scheme for the syntactic annotation of German newspaper texts.
The scheme should be as theory-independent as possible in order to ensure a high degree of acceptance and re-usability.

Development of new techniques for the automation of corpus annotation,
aiming at very fast but still very reliable and accurate annotation.

Syntactic annotation of newspaper texts
creating the treebank TIGER Corpus based on the annotation scheme and tools for automation.

Phenomenon-based retrieval of sentences from the annotated corpus.
The query tool TIGERSearch for syntactically annotated text was developed and implemented.

Further information
Write e-mail
Functional contact address

TIGER

The TIGER Project

Tasks

Project CLARIN-D

Audience

Formalities

Services

Organization

TIGER

The TIGER Project

Tasks

Project CLARIN-D

Here you can reach us

Audience

Formalities

Services

Organization