Heike Zinsmeister

Prof. Dr.
Heike Zinsmeister

Heike Zinsmeister
University of Hamburg
Department for German Language and Literature
Von-Melle-Park 6
20146 Hamburg

I was senior researcher in the CLARIN-D center Stuttgart from April 2012 to August 2013. The main aim of CLARIN-D is to create an infrastructure for language resources and language processing tools for researchers in the humanities and social science.

Since September 2013, I'm professor for linguistics of German / corpus linguistics at University of Hamburg. You'll get to my new homepage by follwing this link.


Will be added soon.

For the time being, see my German webpage.


 See my German webpage. Some of the courses are linked to slides and other teaching material.



  • Dipper, Stefanie and Heike Zinsmeister. Syntactic and semantic categories in annotating abstract anaphora -- a survey. 35 pages.


  • Zinsmeister Heike. Syntax and Corpora. In: Artemis Alexiadou and Tibor Kiss (eds.) Syntax. Erscheint in: An International Handbook (Reihe Handbücher zur Sprach- und Kommunikationswissenschaft). Berlin: Mouton de Gruyter. 20 pages


  • Breckle, Margit and Heike Zinsmeister. 2013. L1 Transfer versus fixed chunks: A learner corpus-based study on L2 German. In: Sylviane Granger, Gaëtanelle Gilquin and Fanny Meunier (eds.) Twenty Years of Learner Corpus Research. Looking Back, Moving Ahead. Corpora and Language in Use – Proceedings 1. Louvain-la-Neuve: Presses universitaires de Louvain, 25–35.
  • Zinsmeister, Heike. 2013. Corpus-based modeling of the semantic transparency of noun-noun compounds. In: Holden Härtl (ed.) Interfaces of Morphology. A Festschrift for Susan Olsen. Berlin: Akademie Verlag. 303–321.
  • Dipper, Stefanie, Heike Zinsmeister and Bonnie Webber (eds.). 2013. Beyond Semantics: the challenges of annotating pragmatic and discourse phenomena. Dialogue and Discourse 4 (2) (special issue).


  • Zinsmeister, Heike and Margit Breckle. 2012. The ALeSKo learner corpus: design – annotation – quantitative analyses. In: Thomas Schmidt and Kai Wörner (eds.) Multilingual Corpora and Multilingual Corpus Analysis. Hamburg Studies in Multilingualism. Benjamins. 71-96. [draft version]
  • Zinsmeister, Heike and Eva Smolka. 2012. Corpus-based evidence for approximating semantic transparency of complex verbs. In: Kay-Michael Würzner and Edmund Pohl (eds.): Lexical Resources in Psycholinguistic Research. Potsdam Cognitive Science Series (vol. 3). Universitätsverlag Potsdam, 45-59. [pdf]
  • Zinsmeister, Heike, Stefanie Dipper und Melanie Seiss. 2012. Abstract pronominal anaphors and label nouns in German and English: Selected case studies and quantitative investigations. In TC3. Translation: Computation, Corpora, Cognition. (2) 1, 47-80 [pdf].
  • Dipper, Stefanie, Melanie Seiss, and Heike Zinsmeister. 2012. The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), 138-145. Istanbul, Turkey. [pdf]
  • Dipper, Stefanie and Heike Zinsmeister. 2012. Annotating Abstract Anaphora. Language Resources and Evaluation 46 (1), 37-52. (Published Online First at 3. September 2011, see Dipper and Zinsmeister 2011b).
  • Breckle, Margit und Heike Zinsmeister. 2012. A corpus-based contrastive analysis of local coherence in L1 and L2 German. In: Vladimir Karabalić, Melita Aleksa Varga und Leonard Pon (Hrsg.) Discourse and Dialogue / Diskurs- und Dialog. Frankfurt am Main [u.a.]: Peter Lang Verlag, 235-250. [draft version]


  • Dipper, Stefanie, Christine Rieger, Melanie Seiss und Heike Zinsmeister. 2011. Abstract Anaphors in German and English. In: Antonio Branco, Sobha L., Iris Hendrickx und Ruslan Mitkov (Hrsg.). Selected Papers from the 8th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2011. Lecture Notes in Computer Science. Springer, 96-107. [Pre-print, [pdf], Endversion bei

  • Zinsmeister, Heike. 2011. Chancen und Probleme der Nutzung von Korpora, Taggern und anderen Sprachressourcen in Seminaren. In: Maja Bärenfänger, Frank Binder, Henning Lobin, Harald Lüngen & Maik Stührenberg (Hrsg.): Language Resources and Technologies in Learning and Teaching / Sprachressourcen in der Lehre (Journal for Language Technology and Computational Linguistics 1/2011, Themenheft), 67 - 80. [pdf, bib]

  • Dipper, Stefanie and Heike Zinsmeister. 2011b. Annotating Abstract Anaphora. Language Resources and Evaluation. [pre-print version] Online First, 3. September 2011. (Revised and extended version of Dipper and Zinsmeister 2009a).

  • Dipper, Stefanie und Heike Zinsmeister (Hrsg.). 2011a. Proceedings of the Workshop Beyond Semantics: Corpus-based Investigations of Pragmatic and Discourse Phenomena, Göttingen, Deutschland, 23.-25. Februar 2011. Bochumer Linguistische Arbeitsberichte (3). [pdf]

  • Stefanie Dipper, Maike Müller, Christine Rieger, Melanie Seiss and Heike Zinsmeister. 2011. How to Refer to Abstract Objects - A Contrastive Analysis of Abstract Anaphora in English and German. Poster bei der Postersession der Sektion Computerlinguistik, DGfS Jahrestagung 2011, Göttingen. [pdf]


  • Zinsmeister, Heike and Margit Breckle. 2010c. Starting a sentence in L2 German – Discourse annotation of a learner corpus. In: Manfred Pinkal, Ines Rehbein, Sabine Schulte im Walde and Angelika Storrer (eds.), Semantic Approaches in Natural Language Processing: Proceedings of the Conference on Natural Language Processing 2010. Saarbrücken: unversaar. 181–185. [link to the volume]

  • Breckle, Margit und Heike Zinsmeister. 2010b. Zur lernersprachlichen Generierung referierender Ausdrücke in argumentativen Texten. In: Dirk Skiba (Hrsg.) "Textmuster: schulisch-universitär-kulturkontrastiv." Frankfurt/Main: Peter Lang. 79-101. [pre-print version]

  • Dipper, Stefanie and Heike Zinsmeister. 2010. Towards a standard for annotating abstract anaphora. "Proceedings of the LREC 2010 workshop on Language Resources and Language Technology Standards", 54-59, Valletta, Malta. [pdf]
    (NOTE: this is a fixed version! --- by mistake, the study by Hedberg et al. 2007 is missing from the survey table in the workshop proceedings)

  • Lemnitzer, Lothar und Heike Zinsmeister. 2010. "Korpuslinguistik – Eine Einführung." narr studienbücher. 2. aktualisierte Auflage. Tübingen: Gunter Narr Verlag. (webpage)

  • Zinsmeister, Heike und Margit Breckle. 2010a. "ALeSKo - an annotated learner corpus". Poster auf der Postersession der Sektion Computerlinguistik der Deutschen Gesellschaft für Sprachwissenschaft (DGfS). Berlin. [pdf]

  • Zinsmeister, Heike. 2010b. "Korpora". In: K.-U. Carstensen, Ch. Ebert, C. Ebert, S. Jekat, R. Klabunde und H. Langer (Hrsg.) Computerlinguistik und Sprachtechnologie. Eine Einführung 3. Auflage. Heidelberg: Spektrum Akademischer Verlag. 482-491. [pre-print version]

  • Zinsmeister, Heike. 2010a. Rezension zu: Karl-Heinz Best. Quantitative Linguistik. Eine Annäherung. 3. stark überarbeitete und ergänzte Auflage. Göttingen: Peust & Gutschmidt Verlag 2006. Zeitschrift für Rezensionen zur germanistischen Sprachwissenschaft (ZRS) 2 (1). 25-31.


  • Dipper, Stefanie und Heike Zinsmeister. 2009b. "The Role of the German Vorfeld for Local Coherence". In: Christian Chiarcos, Richard Eckart de Castilho und Manfred Stede (Hrsg.) Von der Form zur Bedeutung: Texte automatisch verarbeiten / From Form to Meaning: Processing Texts Automatically. Tübingen: Narr. 69–79. Pre-print draft: [pdf]

  • Dipper, Stefanie und Heike Zinsmeister. 2009a. "Annotating Discourse Anaphora". In Proceedings of the Third Linguistic Annotation Workshop, Association of Computational Linguistics, Suntec, Singapore. 166–169. [pdf, bib]


  • Heike Zinsmeister, Andreas Witt, Sandra Kübler, Erhard Hinrichs. 2008. Linguistically Annotated Corpora: Quality Assurance, Reusability and Sustainability. In: Anke Lüdeling and Merja Kytö (Hrsg.) "Corpus Linguistics. An International Handbook" Bd. 1 (Reihe Handbücher zur Sprach- und Kommunikations-wissenschaft). Berlin: Mouton de Gruyter. 759-776. Pre-print draft: [pdf]

  • Heike Zinsmeister. 2008. Freshmen's CL Curriculum: The Benefits of Redundancy. In: "Proceedings of The Third Workshop on Issues on Teaching Computational Linguistics", held in conjunction with ACL 2008: HLT. June 19-20, Columbus, OH. [pdf]

  • Jan-Philipp Söhn, Heike Zinsmeister and Georg Rehm. 2008. "Requirements of a User-Friendly, General-Purpose Corpus Query Interface." In: Proceedings of the LREC 2008 Workshop Sustainability of Language Resources and Tools for Natural Language Processing, May 31, Marrakech, Morocco, 2008. [pdf]

  • Heike Zinsmeister. 2008. Improving syntactic analysis by parse reranking In: "Proceedings of International Conference on Linguistic Evidence 2008", Jan 30-Feb 1, Tübingen, Germany, 2008. [pdf]

  • Lemnitzer, Lothar und Heike Zinsmeister. 2008."Rezension: Tony McEnery, Richard Xiao, Yukio Tono: Corpus-Based Language Studies. Routledge, London and New York 2006 (= Routledge Applied Linguistics). "Deutsch als Fremdsprache" 45 (4): 243–244.


  • Georg Rehm, Andreas Witt, Heike Zinsmeister, Johannes Dellert. 2007. Masking Treebanks for the Free Distribution of Linguistic Resources and Other Applications. In "Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories (TLT 2007)". December 7–8, Bergen, Norway, 2007, 127–138. (proceedings)

  • Yannick Versley, Holger Wunsch, Heike Zinsmeister. 2007. A Pilot Study on Computer-aided Coreference Annotation. In "RANLP 2007 Workshop on Computer Aided Language Processing", Borovets, Bulgaria. [pdf]

  • Georg Rehm, Andreas Witt, Heike Zinsmeister, Johannes Dellert. 2007. Corpus Masking: Legally Bypassing Licensing Restrictions for the Free Distribution of Text Collections. In "Proceedings of Digital Humanities 2007", University of Illinois, 166-169. (proceedings)
  • Heike Zinsmeister. 2007. Parsing of Coordinate Structures -- A Preliminary Study. Poster at DGFS-CL, Siegen.

  • Heike Zinsmeister. 2007. Kompetenzen aktiv fördern. In: Manfred Künzel et al.: "Aktive Studierende, kompetenzorientierte Ausbildung, lernende Lehrende: Fallbeispiele aus Tübingen". Tübinger Beiträge zur Hochschuldidaktik 3,2. (link to the volume)


  • Yannick Versley and Heike Zinsmeister. 2006. From Surface Dependencies towards Deeper Semantic Representations. In " Proceedings of the Fifth Workshop on Treebanks and Linguistic Theories" (TLT 2006), 115-126. [pdf] Due to technical issues, the title in the conference proceedings has been changed to "Semantic Representations".

  • A. Meyers, A. C. Fang, L. Ferro, S. Kübler, T. Jia-Lin, M. Palmer, M. Poesio, A. Dolbey, K. K. Schuler, E. Loper, H. Zinsmeister, G. Penn, N. Xue, E. Hinrichs, J. Wiebe, J. Pustejovsky, D. Farwell, E. Hajicova, B. Dorr, E. Hovy, B. A. Onyshkevych, L. Levin. 2006. Annotation Compatibility Working Group Report. In "Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora&uot; 2006, 38-53, Sydney, July 2006. [pdf]

  • Heike Zinsmeister. 2006. Treebank Data as Linguistic Evidence? Coordination in TüBa–D/Z. In "Pre-Proceedings of the International Conference on Linguistic Evidence", Tübingen, February 2006. pdf

  • Lemnitzer, Lothar und Heike Zinsmeister. 2006. "Korpuslinguistik – Eine Einführung." narr studienbücher. Tübingen: Gunter Narr Verlag. (webpage)

  • Heike Telljohann, Erhard Hinrichs, Sandra Kübler und Heike Zinsmeister. 2006. "Stylebook for the Tübingen Treebank of Written German (TüBa–D/Z)". Überarbeitete Version. Technischer Bericht, Seminar für Sprachwissenschaft, Universität Tübingen. [pdf]

  • Contributions to the entries of "Korpuslinguistik and Perl". In Irene Cramer andSabine Schulte im Walde (eds.), "Studienbibliographie Computerlinguistik und Sprachtechnologie". By order of the Institut für Deutsche Sprache, Mannheim. Julius Groos Verlag Brigitte Narr GmbH, Tübingen.


  • Erhard Hinrichs, Sandra Kübler, Karin Naumann, Heike Telljohann, Julia Trushkina und Heike Zinsmeister. 2005. "Recent Developments in Linguistic Annotations of the TüBa–D/Z Treebank (Poster)." 27. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft, Köln, Februar 2005.

  • Heike Telljohann, Erhard Hinrichs, Sandra Kübler und Heike Zinsmeister. 2005. "Stylebook for the Tübingen Treebank of Written German" (TüBa–D/Z)." Überarbeitete Version. Technischer Bericht, Seminar für Sprachwissenschaft, Universität Tübingen.


  • Krenn, Brigitte, Stefan Evert, and Heike Zinsmeister. 2004. Determining intercoder agreement for a collocation identification task. In "Proceedings of KONVENS 2004" Vienna, Austria. ]pdf, .ps.gz]

  • Zinsmeister, Heike and Ulrich Heid. 2004. "Collocations of Complex Nouns: Evidence for Lexicalisation." In "Proceedings of KONVENS 2004". Vienna, Austria. [pdf]


  • Zinsmeister, Heike and Ulrich Heid. 2003. Significant Triples: Adjective+Noun+Verb Combinations. In "Proceedings of Complex 2003", Budapest, Hungaria. [postscript, pdf]

  • Zinsmeister, Heike and Ulrich Heid. 2003. Identifying predicatively used adverbs by means of a Statistical Grammar Model. In Dawn Archer, Paul Rayson, Andrew Wilson and Tony McEnery (eds.), "Proceedings of the Corpus Linguistics 2003 Conference". UCREL, Lancaster University, 932-939. [pdf]

  • Brandner, Ellen and Heike Zinsmeister (eds.). 2003 ."New Perspectives on Case and Case Theory", CSLI publications, Stanford (distributed by The University of Chicago Press).


  • Reyle, Uwe, Jasmin Saric, Philipp Cimiano and Heike Zinsmeister, Heike. 2002. Ontology-driven disambiguation of syntactic and semantic ambiguities in GenIE. European Media Laboratory, Heidelberg poster at "Workshop on Ontology for Biology".

  • Zinsmeister, Heike; Kuhn, Jonas and Dipper, Stefanie. 2002. TIGER TRANSFER -- Utilizing LFG Parses for Treebank Annotations. In "Proceedings of the LFG02 Conference", CSLI publications. [url, postscript, pdf]

  • Zinsmeister, Heike and Heid, Ulrich. 2002. Collocations of Complex Words: Implications for the Acquisition with a Stochastic Grammar. In "Proceedings of the International Workshop on 'Computational Approaches to Collocations'", Vienna. [postscript, pdf]


  • Zinsmeister, Heike, Jonas Kuhn, Bettina Schrader and Stefanie Dipper. 2001. TIGER Transfer -- From LFG Structures to the TIGER Treebank. Technical report IMS, University of Stuttgart. [html, postscript, pdf]


  • Emele, Martin C., Michael Dorna, Anke Lüdeling, Heike Zinsmeister and Christian Rohrer. 2000. Semantic-Based Transfer. In Wolfgang Wahlster (ed.), "Verbmobil: Foundations of Speech-to-Speech Translation". Series: Artificial Intelligence. 359-376. Springer Verlag, Berlin [a.o.]. [html, postscript]

Recent events