University of Stuttgart - Parsing and Machine Translation Reading Group

Invitation

Are you interested in parsing and/or machine translation?

Would you like to learn about integrating deep linguistic representations into statistical machine translation?

Do you want to discuss and critique published research with computational linguists, linguists and computer scientists?

We are organizing a reading group on (statistical) parsing and machine translation. Please join us whether you call what you do computational linguistics, linguistics, natural language processing, artificial intelligence or machine learning. Everyone is invited. The language of the reading group is English.

After several introductory lectures by Helmut and Alex we will alternate informal presentations of research papers by members of the group. Our initial goal is to reach the point where we are able to read about and discuss new ideas in statistical machine translation research involving the integration of deep linguistic representations

NEW: Our new meeting time is Wednesday, 15:45. We will meet in the IMS phonetics lab (top floor, last room on the right in the back of the building).

Schedule

Future and Present


Past

Wed July 22nd, 15:45, 3.11 (IMS Phonetik Labor) Alex Fraser: David Chiang. A hierarchical phrase-based model for statistical machine translation. ACL 2005 (best paper) paper
Wed July 8th, 15:45, 3.11 (IMS Phonetik Labor) Alex Fraser: Christoph Tillmann. A Unigram Orientation Model for Statistical Machine Translation. HLT-NAACL 2004 short paper. paper
June 4th, 10:30, Office Hans Kamp Fabienne Braune: Dekai Wu. A polynomial-time algorithm for statistical machine translation. ACL 1996. paper (ps) (pdf)
May 28th, 9:45-11:15, 12.21 Im Rahmen des Hauptseminars Maschinelle Übersetzung I (Heid), spricht PD Dr. Kurt Eberle (Heidelberg/Stuttgart): "Aktuelle Architekturfragen in der Maschinellen Übersetzung: semantischer Transfer und Integration statistischer Information in 'translate'"
May 14th, 10:30 Alex Balabanov: Kenji Yamada and Kevin Knight. A syntax-based statistical translation model. ACL 2001. paper
May 7th, 10:30 Hassan Sajjad: Yaser Al-Onaizan and Kevin Knight. Translating Named Entities Using Monolingual and Bilingual Resources. ACL 2002. paper
April 30th, 10:30 Hassan Sajjad, Alex Fraser: EACL 2009 report (interesting papers), organizational meeting
March 26th, 10:30 Aoife Cahill, Alex Fraser, Hassan Sajjad: Practice talks for EACL Papers: Cahill Fraser1 Fraser2 Sajjad
March 19th, 10:30 Helmut Schmid: Liang Huang. Forest Reranking: Discriminative Parsing with Non-Local Features. ACL 2008 (1 of 2 outstanding paper awards). paper
March 5th, 10:30 Alex Fraser: Chris Quirk, Arul Menezes, Colin Cherry. Dependency Treelet Translation: Syntactically Informed Phrasal SMT. ACL 2005.
Part II: decoding, experiments, discussion.
Feb 26th, 10:30 Our first paper on a non-preprocessing approach to syntactic SMT!
Alex Fraser: Chris Quirk, Arul Menezes, Colin Cherry. Dependency Treelet Translation: Syntactically Informed Phrasal SMT. ACL 2005.
Part I: model and training.
paper
Feb 19th, 10:30 Two papers on preprocessing approaches for coping with composita and rich inflection:
Fabienne Fritzinger: Empirical Methods for Compound Splitting. Philipp Koehn and Kevin Knight. EACL 2003.
Alex Fraser: Improving Statistical MT Through Morphological Analysis. Sharon Goldwater and David McClosky. EMNLP 2005.
composita paper inflection paper
Feb 12th, 10:30 Hassan Sajjad: Michael Collins and Philipp Koehn and Ivona Kucerova. Clause Restructuring for Statistical Machine Translation. ACL 2005. paper
Feb 5th, 10:30 Alex Fraser: Franz Josef Och. Minimum Error Rate Training for Statistical Machine Translation. ACL 2003. paper
Jan 22nd, 10:30 Amit Dubey: Franz Josef Och, Hermann Ney. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation. ACL 2002 (best paper). paper
Dec 18th, 10:30 Amit Dubey: Hoifung Poon and Pedro Domingos. EMNLP 2008. Joint Unsupervised Coreference Resolution with Markov Logic. paper
Dec 11th, 10:30 Amit Dubey: Richardson and Domingos. Machine Learning, 62, 107-136, 2006. Markov Logic Networks. paper
Dec 4th, 10:30, IMS Mitarbeiter Zimmer Amit Dubey: Agirre, Baldwin and Martinez. ACL 2008. Improving Parsing and PP Attachment Performance with Sense Information Discriminative Reranking for Natural Language Parsing. paper
Nov 27th, 10:30, IMS Mitarbeiter Zimmer Aoife Cahill: Michael Collins. Discriminative Reranking for Natural Language Parsing. ICML 2000. paper
Nov 20th, 10:30, IMS Mitarbeiter Zimmer Alex Balabanov: Michael Collins. Three Generative, Lexicalised Models for Statistical Parsing. ACL/EACL 1997. You might also be interested in the slides for this paper or the longer Computational Linguistics journal paper (see Michael Collins' homepage) paper
Nov 13th, 10:30, IMS Mitarbeiter Zimmer Nadir Durrani: Statistical Phrase-Based Translation (HLT-NAACL 2003). Philipp Koehn, Franz Josef Och, Daniel Marcu Statistical Phrase-Based Translation
Nov 6th, 10:30, IMS Mitarbeiter Zimmer Alex Fraser: BLEU: a Method for Automatic Evaluation of Machine Translation (ACL 2002). Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu BLEU paper
October 23rd, 10:30, 3.11 (IMS Phonetik Labor) Christian Scheible: Introduction to Language Modeling. For slides and reference list, please click here. Chen and Goodman LM tutorial (focus on interpolation and Kneser/Ney smoothing)
October 9th, 10:30, 12.21 Martin Forst: Grammatical Machine Translation II. Martin Forst will discuss recent work on hybrid MT using LFG. For a full abstract, click here.
October 2nd, 10:30, 12.21 Helmut Schmid: PCFG parsing algorithms continued No required reading.
September 25th, 10:00, 12.21 Helmut Schmid: PCFG parsing algorithms No required reading.
August 14th, 10:00, 12.21 (IMS lecture hall) Helmut Schmid: Introduction to CFG parsing algorithms No required reading.
August 7th, 10:00, 12.21 (IMS lecture hall) Helmut Schmid: Introduction to HMM tagging No required reading. Manning and Schuetze HMM Chapter recommended.
July 31st, 10:00, 12.21 (IMS lecture hall) Alex Fraser: Introduction to statistical machine translation - Part 3, phrase-based modeling and decoding no required reading
July 21st to July 25th EMA Summer School, website is here. First two SMT lectures will be repeated on Tuesday, along with a practice assignment (implementing IBM Model 1). The lecture from the next reading group meeting (phrase-based modeling and decoding) will be on Wed at 14:00, followed by a practice assignment on decoding. Thursday morning's lecture will consist of a discussion of the assignments and a brief overview of some more advanced topics.
July 17th, 10:00, 3.11 (IMS Phonetik Labor) Alex Fraser: Introduction to statistical machine translation - Part 2. Bitext alignment (extracting lexical knowledge from parallel corpora) Kevin Knight's SMT Tutorial
July 10th, 10:00, 3.11 (IMS Phonetik Labor) Alex Fraser: Introduction to statistical machine translation - Part 1. I will define the MT problem and talk about evaluation. I will also discuss parallel corpora and sentence alignment and give a brief overview of statistical machine translation (SMT). Kevin Knight's tutorial is recommended, but not necessary until next week. Kevin Knight's SMT Tutorial

Organizers

Organizers: Alex Fraser and Helmut Schmid

Email Addresses: SubstituteLastName@ims.uni-stuttgart.de

University of Stuttgart

SFB 732 - Incremental Specification in Context

Institute for Natural Language Processing (IMS/IfNLP)