Institut

Studium

Forschung


 

Sarah Schulz

Frau  Sarah Schulz

Sarah Schulz
Telefon +49 711 685-81394
E-Mail
Anschrift
Universität Stuttgart
Institut für Maschinelle Sprachverarbeitung
Pfaffenwaldring 5 b
70569 Stuttgart
Deutschland

Sprechstunde

nach Vereinbarung

Ich bin Computerlinguistin mit einem geisteswissenschaftlichen Hintergrund in Theater- und Medienwissenschaften und Germanistik. Digital Humanities bieten mir die Möglichkeit, meine informatischen Interessen mit meiner Neugier für geisteswissenschaftliche Themen zu vereinen.

Ich bin Doktorandin im Centrum für reflektierte Textanalyse (CRETA) CRETA Webseite

Ich bin auch auf Github zu finden.


Forschung
  • non-standard text processing
  • Digital Humanities
  • historical language processing
Lehre
Publikationen

2017:

  • Sarah Schulz and Jonas Kuhn.. Multi-modular domain-tailored OCR post-correction. Empirical Methods for Natural Language Processing (EMNLP) 2017. Copenhagen, 2017.
  • Nora Echelmeyer, Nils Reiter, Sarah Schulz. 2017. PoS­-Tagger für „das” Mittelhochdeutsche. In Book of Abstracts of DHd 2017, Bern, Switzerland, 2017.
  • Nils Reiter, Sarah Schulz, Gerhard Kremer, Roman Klinger, Gabriel Viehhauser, Jonas Kuhn. 2017. Teaching Computational Aspects in the Digital Humanities Program at University of Stuttgart – Intentions and Experiences.  Teaching NLP for Digital Humanities, Workshop at GSCL, 43-48.
  • Derek Doran , Sarah Schulz , and Tarek R. Besold. 2017. What Does Explainable AI Really Mean? A New Conceptualization of Perspectives. Comprehensibility and Explanation in AI and ML, Workshop at AI*IA.

2016:

  • Sarah Schulz, Guy De Pauw, Orphée De Clercq, Bart Desmet, Véronique Hoste, Walter Daelemans, and Lieve Macken. 2016. Multimodular Text Normalization of Dutch User-Generated Content. ACM Trans. Intell. Syst. Technol. 7, 4, Article 61 (July 2016), 22 pages. DOI: http://dx.doi.org/10.1145/2850422
  • Schulz, S. & Kuhn, J. (2016). Learning from Within? Comparing PoS Tagging Approaches for Historical Text. In N. C. (C. Chair), K. Choukri, T. Declerck, M. Grobelnik, B. Maegaard, J. Mariani, A. Moreno, J. Odijk & S. Piperidis (eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), May, Slovenia: European Language Resources Association (ELRA).
  • Schulz, S. & Reiter, N. (2016). Authorship Attribution of Mediaeval German Text: Style and Contents in Apollonius von Tyrland . Proceeding of Digital Humanities 2016 (p./pp. 883-885), July, Krakau.
  • Schulz, S. & Keller, M. (2016). Code-Switching Ubique Est - Language Identification and Part-of-Speech Tagging for Historical Mixed Text. LaTeCH@ACL, August, Berlin: The Association for Computer Linguistics.
  • Çetinoğlu, Ö., Schulz, S. & Vu, N. T. (2016). Challanges of Computational Processing of Code-Switching. Proceedings of EMNLP Workshop on Computational Approaches to Linguistic Code Switching (CALCS 2016) @EMNLP, November, Austin, Texas, USA.

2014

  •  Orphée De Clercq, Schulz Schulz, Bart Desmet, and Véronique Hoste. Towards Shared Datasets for Normalization Research. In Nicoletta Calzolari (Conference Chair), Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, May 2014. European Language Resources Association (ELRA).
  • Bart Desmet, Orphée De Clercq, Marjan Van de Kauter, Sarah Schulz, Cynthia Van Hee, and Veronique Hoste. Taaltechnologie 2.0: sentimentanalyse en normalisatie, pages 157–161. Beschouwingen uit een talenhuis : opstellen over onderwijs en onderzoek in de vakgroep Vertalen, Tolken en Communicatie aangeboden aan Rita Godyns. Academia Press, 2014.
  • Sarah Schulz. Named-Entity Recognition for User-Generated Content. In Proceedings of European Summer School in Logic Language and Computation 2014 Student Session. Springer, 2014.

 2013

  •  Sarah Schulz, Verena Lyding, and Lionel Nicolas. Compiling a diverse web corpus for South Tyrolean German - STirWaC. In Proceedings of the 8th Web as Corpus Workshop, pages 37–45, Lancaster, UK, 2013.
  • Orphée De Clercq, Sarah Schulz, Bart Desmet, Els Lefever, and Véronique Hoste. Normalization of Dutch User-Generated Content. In Proceedings of the 9th International Conference on Recent Advances in Natural Language Processing, Hissar, Bulgaria, 2013.

2012

  • Marisa Delz, Benjamin Layer, Sarah Schulz, and Johannes Wahle. Overgeneralization of verbs — The change of the German verb system. In Proceedings of the 9th International Conference on the Evolution of Language, Evolang IX, pages 96–103, Kyoto, Japan, 3 2012.
Ressourcen
  • Middle High German POS Tagger Model: MHG Pos
  • Language Identification and POS Tagging for Mixed Middle English - Latin text: Webapplication
  • OCR and OCR post-correction: Webapplication (for access please contact me, you need login data)
Lebenslauf