Institut

Studium

Forschung


 

TIGER Corpus (and derivatives)

License agreement

 

 
Universität Stuttgart, Institut für Maschinelle Sprachverarbeitung
Lehrstuhl Prof. Dr. Jonas Kuhn
Pfaffenwaldring 5b, D-70569 Stuttgart, Germany

 

Universität Potsdam, Institut für Germanistik
Postfach 601553, D-14415 Potsdam, Germany

 

Universität des Saarlandes, FR 4.7 Computerlinguistik
Lehrstuhl Prof. Dr. Hans Uszkoreit
Postfach 151150, D-66041 Saarbrücken, Germany

("Licenser")
and "Licensee"
 
  agree as follows  
 
 

1. Status of Licensee

Licensee confirmes that Licensee is (part of) an academic or educational institution.

2. Product

The TIGER Treebank, consists of approximately 900,000 tokens (50,000 sentences) of German newspaper text, taken from the Frankfurter Rundschau. The corpus was annotated with parts-of-speech and syntactic structures in the project TIGER (DFG) in Potsdam, Saarbrücken and Stuttgart.

Copyright of different parts of the TIGER Treebank is to:

  • Universität Stuttgart, Institut für Maschinelle Sprachverarbeitung
  • Universität Potsdam, Institut für Germanistik
  • Universität des Saarlandes, FR 4.7 Computerlinguistik
for all part-of-speech and structural annotations, and corpus documentation.

 

The TIGER Treebank includes third party data as listed in the Appendix. Licenser and Licensee recognize that the copyrights of these data belong to the respective organisation, as mentioned in the Appendix.

3. License

Licenser grants Licensee a non-exclusive license to use the TIGER Treebank. Licensee agrees

  • to use the corpus only for non-commercial, non-profit research purposes;
  • to make no changes to the corpus;
  • and to acknowledge the use of the corpus in all publications reporting on results produced with the help of the TIGER Treebank.

 

Use of the corpus or use of data derived from the corpus for any commercial purposes requires explicit written agreement of Licenser.

Licenser and Licensee recognize that the copyrights of the third party data belong to the respective organisation, as mentioned in the Appendix.

4. Non-Disclosure

The TIGER Treebank and its documentation will be held in confidence by Licensee and will not be disclosed by Licensee to third parties, except for those parts of the corpus or its documentation that are explicitly marked for public use, and except for example sentences in scientific publications.

Licensee shall and will employ all necessary precautions to ensure that no persons or institutions other than persons as are in the employ of Licensee or in the same research project as Licensee will get access to the TIGER Treebank or parts thereof. Other persons or institutions desiring access to the corpus should be directed to Licenser to obtain separate licence agreements.

5. Commencement and Duration

This Agreement will take effect from the date of signature and will continue until terminated either by Licensee or Licenser in accordance with Clause 6 below.

6. Termination

Licenser may terminate the agreement forthwith if the Licensee commits any material breach of this agreement or if the copyright holder raises an objection against the terms of the license agreement.

Licensee may terminate this agreement within 30 days of written notice.

On termination howsoever caused, the Licensee shall erase all copies of the TIGER Treebank.

7. Fee

This license is granted by Licenser to Licensee free of charge.

8. Disclaimer

This license is granted under the premise that the copyright holder of the raw data accepts the terms of the license agreement. If the copyright holder raises an objection against the terms of the license agreement, lincenser may terminate the agreement and Licensee shall then erase all copies of the TIGER Treebank.

The TIGER Treebank and its documentation is provided on an ``as is'' basis, with no guarantee of its veracity or accuracy. No liability is accepted for any damage caused by its use except in cases of a mandatory liability.

Licenser hereby informs Licensee that, although large efforts were spent on removing annotation errors, some of them still remain in the corpus.

The License Agreement shall be governed by German law.

Appendix: Acknowledgements

The raw text of the Frankfurter Rundschau as used in the corpus is copyrighted by the Frankfurter Rundschau:

Druck- und Verlagshaus Frankfurt am Main GmbH
Verlag der Frankfurter Rundschau
Große Eschenheimer Straße 16-18
D-60313 Frankfurt am Main