VII. Appendix

1. References

[Abeille2003] Abeillé, Anne; Clement, Lionel and Kinyon, Alexandra (2003): Building and using syntactically annoted corpora. Kluwer Academic Publishers, Dordrecht.

[BlackburnEtAl1993] Blackburn, Patrick; Gardent, Claire and Meyer-Viol, Wilfried (1993): Talking About Trees. In: Proceedings of the 6th Conference of the European Chapter of the Association for Computational Linguistics, pp. 21-29, Utrecht.

[Christ1994] Christ, Oliver (1994): A modular and flexible architecture for an integrated corpus query system. In: Proceedings of COMPLEX'94: 3rd Conference on Computational Lexicography and Text Research, pp. 23-32, Budapest.

[ChristEtAl1999] Christ, Oliver; Schulze, Bruno M. and König, Esther (1999): Corpus Query Processor (CQP). User's Manual. Institut für Maschinelle Sprachverarbeitung, University of Stuttgart.

[Doerre1996] Dörre, Jochen (1996): Feature-Logik und Semiunifikation. Dissertationen zur Künstlichen Intelligenz, infix-Verlag, Sankt Augustin.

[DoerreDorna1993] Dörre, Jochen and Dorna, Michael (1993): CUF - A Formalism for Linguistic Knowledge Representation. Deliverable R.1.2A, DYANA2

[DoerreEtAl1996] Dörre, Jochen; Gabbay, Dov M. and König, Esther (1996): Fibred Semantics for Feature-based Grammar Logic. Journal of Logic, Language, and Information. Special Issue on Language and Proof Theory, vol. 5, pp. 387-422.

[DuchierNieren1999] Duchier, Denys and Niehren, Joachim (1999): Solving Dominance Constraints with Finite Set Constraint Programming. Technical report, Universität des Saarlandes, Programming Systems Lab.

[Emele1997] Emele, Martin (1997): Der TFS-Repräsentationsformalismus. Ph.D. thesis, Institut für Maschinelle Sprachverarbeitung, University of Stuttgart. Arbeitspapiere des IMS, vol. 3, no. 6.

[EmeleZajac1990] Emele, Martin and Zajac, Rémi (1990): A Fixed-Point Semantics for Feature Type Systems. In: Proceedings of the 2nd International Workshop on Conditional and Typed Rewriting Systems, Montreal.

[HoehfeldSmolka1988] Höhfeld, Markus and Smolka, Gert (1988): Definite Relations over Constraint Languages. LILOG-Report 53, IBM Deutschland, Stuttgart.

[KoenigLezius2002] König, Esther and Lezius, Wolfgang (2003): The TIGER language - A Description Language for Syntax Graphs. Formal Definition. Technical report, IMS, University of Stuttgart.

[Lezius2002] Lezius, Wolfgang (2002): Ein Suchwerkzeug für syntaktisch annotierte Textkorpora. Ph.D. thesis, IMS, University of Stuttgart. Arbeitspapiere des IMS, vol. 8, no. 4. http://www.ims.uni-stuttgart.de/projekte/corplex/paper/lezius/diss/

[Lezius2002b] Lezius, Wolfgang (2002): TIGERSearch - Ein Suchwerkzeug für Baumbanken. In: Stephan Busemann, editor: Proceedings der 6. Konferenz zur Verarbeitung natürlicher Sprache (KONVENS 2002), pp. 107-114, Saarbrücken.

[MarcusEtAl1993] Marcus, Mitchell; Santorini, Beatrice and Marcinkiewicz, Mary Ann (1993): Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, vol. 19, pp. 313-330.

[RogersEtAl1992] Rogers, James and Vijay-Shanker, K. (1992): Reasoning with Descriptions of Trees. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics. Newark, Delaware.

[Schieber1986] Shieber, Stuart M. (1986): An Introduction to Unification-Based Approaches to Grammar. Lecture Notes, Center for the Study of Language and Information, Stanford.

[Schmid1999] Schmid, Helmut (1999) YAP: Parsing and Disambiguation With Feature-Based Grammar Ph.D. thesis, Institut für Maschinelle Sprachverarbeitung, University of Stuttgart.

[SkutEtAl1997] Skut, Wojciech; Krenn, Brigitte; Brants, Thorsten and Uszkoreit, Hans (1997): An Annotation Scheme for Free Word Order Languages. In: Proceedings of the 5th Conference on Applied Natural Language Processing (ANLP), Washington, D.C.

[Smith2002] Smith, George (2002): A brief introduction to the TIGER Corpus Sampler. University of Potsdam.

[SteinerKallmeyer2002] Steiner, Ilona and Kallmeyer, Laura (2002) VIQTORYA - A Visual Query Tool for Syntactically Annotated Corpora. In: Proceedings of LREC 2002, Las Palmas, Gran Canaria.

[SterlingShapiro1986] Sterling, Leon and Shapiro, Ehud (1986): The Art of Prolog: Advanced Programming Techniques. MIT Press, Cambridge, Mass.

[WallEtAl1996] Wall, Larry; Christiansen, Tom; Schwartz, Randal L. and Potter, Stephen (1996): Programming Perl. O'Reilly, Cambrige.

[Voormann2002] Voormann, Holger (2002): Grafische Eingabe von Suchanfragen in TIGERSearch. Diploma thesis, Fakultät für Informatik, Universität Stuttgart.

2. Acknowledgements

2.1 Project background

The TIGERSearch software suite has been developed in the context of the following projects:

DEREKO project funded by the Land Baden-Württemberg

TIGER project funded by the Deutsche Forschungsgemeinschaft

Esther König's post-doc research

Wolfgang Lezius' PhD and post-doc research

Holger Voormann's diploma thesis and PhD research

2.2 Third party software

The following software is part of the TIGERSearch distribution:

Software Organization URL
batik.jar Apache Software Foundation, Apache XML Project http://xml.apache.org/batik/
fop.jar Apache Software Foundation, Apache XML Project http://xml.apache.org/fop/
jakarta-oro.jar Apache Software Foundation, Apache Jakarta Project http://jakarta.apache.org/oro/
jh.jar Sun Microsystems, Inc. http://java.sun.com/products/javahelp/
jdom.jar JDOM Project http://www.jdom.org
log4j.jar Apache Software Foundation, Apache Jakarta Project http://jakarta.apache.org/log4j/
poi.jar Apache Software Foundation, Apache Jakarta Project http://jakarta.apache.org/poi/
xalan.jar Apache Software Foundation, Apache XML Project http://xml.apache.org/xalan-j/
xercesImpl.jar Apache Software Foundation, Apache XML Project http://xml.apache.org/xerces-j/
xml-apis.jar Apache Software Foundation, Apache XML Project http://xml.apache.org/xerces-j/
Java Runtime Environments Sun Microsystems, Inc. http://java.sun.com/j2se/

The following software has been used to generate parts of the functionality of TIGERSearch:

Software Organization URL
JavaCC Sun Microsystems Laboratories http://www.experimentalstuff.com

The following software has been used to develop the TIGERSearch software:

Software Organization URL
Ant Apache Software Foundation http://ant.apache.org
Eclipse The Eclipse Consortium http://www.eclipse.org

2.3 Third party copyright statements

"This product includes software developed by the Apache Software Foundation ( http://www.apache.org/)."

"This product includes software developed by the JDOM Project ( http://www.jdom.org/)."

"This product includes code licensed from RSA Security, Inc." (JRE)

"Some portions licensed from IBM are available at http://oss.software.ibm.com/icu4j/" (JRE)

2.4 Trademarks

Adobe is a registered trademark of Adobe Systems, Inc.

Excel is a registered trademark of Microsoft Corporation.

InstallAnywhere is a registered trademark of Zero G Software, Inc.

Mac OS is a registered trademark of Apple Computer, Inc.

PowerPoint is a registered trademark of Microsoft Corporation.

Solaris and Java are trademarks of Sun Microsystems, Inc.

StuffIt Expander is a registered trademark of Aladdin Systems, Inc.

Windows is a registered trademark of Microsoft Corporation.

All other marks are properties of their respective owners.

2.5 Third party corpus samplers

A range of institutions have kindly agreed that excerpts from their text corpora may be distributed with TIGERSearch. The current version of TIGERSearch includes the following corpus samplers (in alphabetic order):

Chinese

Chinese Treebank sampler

105 corpus graphs, University of Pennsylvania, distributed by LDC

English

Penn Treebank: Brown Corpus and Switchboard Corpus samplers

200 sentences each, University of Pennsylvania, distributed by LDC

Penn-Helsinki Parsed Corpus of Middle English (PPCME2 Corpus) sampler

200 sentences, University of Pennsylvania / PPCME2 Project

Susanne and Christine Corpus samplers

200 sentences each, Sussex University / Susanne and Christine projects

VerbMobil Corpus sampler

250 sentences, see German VerbMobil sampler

German

DEREKO Corpus sampler

250 sentences, SfS, University of Tübingen and IMS, University of Stuttgart / DEREKO project

IMS chunking and parsing tools

The tools LoPar, TreeTagger, and YAC processed the same technical text (about 250 sentences). IMS, University of Stuttgart

Negra Corpus sampler

250 sentences, Department of Computational Linguistics, Universität des Saarlandes / Negra project

TIGER Corpus sampler

200 sentences, Institut für Germanistik, University of Potsdam / Department of Computational Linguistics, Universität des Saarlandes / IMS, University of Stuttgart / TIGER project

VerbMobil Corpus sampler

250 sentences, SfS, University of Tübingen / VerbMobil Project, distributed by IPSK, Ludwig-Maximilian-Universität München

Japanese

VerbMobil Corpus sampler

250 sentences, see German VerbMobil sampler

Korean

Korean Treebank sampler

125 corpus graphs, University of Pennsylvania, distributed by LDC