Foundational Course
Departament de Traducció i Filologia
Universitat Pompeu Fabra
April 16-20, 2007

 

Introduction to Corpus Resources, Annotation and Access

 

Sabine Schulte im Walde
Institut für Maschinelle Sprachverarbeitung
Universität Stuttgart


Course Description

This course presents an introduction to corpus resources, combining the theoretical background of corpora, resource examples, annotation levels, and tools for exploitation. First, we motivate corpus resources for empirical linguistics, and describe the properties/problems of corpus data, the levels of annotation, and standardisation efforts. We then relate the annotation levels to appropriate tools and uses for exploitation:

Schedule (April 16-20)


Course Material


Acknowledgements

Most of the course material was adopted from an earlier version of this course at ESSLLI 2006. Thanks to my collegue Heike Zinsmeister who prepared the introduction and the lectures on syntactic annotation and the web as corpus!