Are you interested in parsing and/or machine translation?
Would you like to learn about integrating deep linguistic representations into statistical machine translation?
Do you want to discuss and critique published research with computational linguists, linguists and computer scientists?
We are organizing a reading group on (statistical) parsing and machine translation. Please join us whether you call what you do computational linguistics, linguistics, natural language processing, artificial intelligence or machine learning. Everyone is invited. The language of the reading group is English.
After several introductory lectures by Helmut and Alex we will alternate informal presentations of research papers by members of the group. Our initial goal is to reach the point where we are able to read about and discuss new ideas in statistical machine translation research involving the integration of deep linguistic representations
NEW: Our new meeting time is Wednesday, 15:45. We will meet in the IMS phonetics lab (top floor, last room on the right in the back of the building).
Past
| Wed July 22nd, 15:45, 3.11 (IMS Phonetik Labor) | Alex Fraser: David Chiang. A hierarchical phrase-based model for statistical machine translation. ACL 2005 (best paper) | paper |
| Wed July 8th, 15:45, 3.11 (IMS Phonetik Labor) | Alex Fraser: Christoph Tillmann. A Unigram Orientation Model for Statistical Machine Translation. HLT-NAACL 2004 short paper. | paper |
| June 4th, 10:30, Office Hans Kamp | Fabienne Braune: Dekai Wu. A polynomial-time algorithm for statistical machine translation. ACL 1996. | paper (ps) (pdf) |
| May 28th, 9:45-11:15, 12.21 | Im Rahmen des Hauptseminars Maschinelle Übersetzung I (Heid), spricht PD Dr. Kurt Eberle (Heidelberg/Stuttgart): "Aktuelle Architekturfragen in der Maschinellen Übersetzung: semantischer Transfer und Integration statistischer Information in 'translate'" | |
| May 14th, 10:30 | Alex Balabanov: Kenji Yamada and Kevin Knight. A syntax-based statistical translation model. ACL 2001. | paper |
| May 7th, 10:30 | Hassan Sajjad: Yaser Al-Onaizan and Kevin Knight. Translating Named Entities Using Monolingual and Bilingual Resources. ACL 2002. | paper |
| April 30th, 10:30 | Hassan Sajjad, Alex Fraser: EACL 2009 report (interesting papers), organizational meeting | |
| March 26th, 10:30 | Aoife Cahill, Alex Fraser, Hassan Sajjad: Practice talks for EACL | Papers: Cahill Fraser1 Fraser2 Sajjad |
| March 19th, 10:30 | Helmut Schmid: Liang Huang. Forest Reranking: Discriminative Parsing with Non-Local Features. ACL 2008 (1 of 2 outstanding paper awards). | paper |
| March 5th, 10:30 |
Alex Fraser: Chris Quirk, Arul Menezes, Colin Cherry. Dependency Treelet Translation: Syntactically Informed Phrasal SMT. ACL 2005. Part II: decoding, experiments, discussion. | |
| Feb 26th, 10:30 |
Our first paper on a non-preprocessing approach to syntactic SMT! Alex Fraser: Chris Quirk, Arul Menezes, Colin Cherry. Dependency Treelet Translation: Syntactically Informed Phrasal SMT. ACL 2005. Part I: model and training. | paper |
| Feb 19th, 10:30 |
Two papers on preprocessing approaches for coping with composita and rich inflection: Fabienne Fritzinger: Empirical Methods for Compound Splitting. Philipp Koehn and Kevin Knight. EACL 2003. Alex Fraser: Improving Statistical MT Through Morphological Analysis. Sharon Goldwater and David McClosky. EMNLP 2005. | composita paper inflection paper |
| Feb 12th, 10:30 | Hassan Sajjad: Michael Collins and Philipp Koehn and Ivona Kucerova. Clause Restructuring for Statistical Machine Translation. ACL 2005. | paper |
| Feb 5th, 10:30 | Alex Fraser: Franz Josef Och. Minimum Error Rate Training for Statistical Machine Translation. ACL 2003. | paper |
| Jan 22nd, 10:30 | Amit Dubey: Franz Josef Och, Hermann Ney. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation. ACL 2002 (best paper). | paper |
| Dec 18th, 10:30 | Amit Dubey: Hoifung Poon and Pedro Domingos. EMNLP 2008. Joint Unsupervised Coreference Resolution with Markov Logic. | paper |
| Dec 11th, 10:30 | Amit Dubey: Richardson and Domingos. Machine Learning, 62, 107-136, 2006. Markov Logic Networks. | paper |
| Dec 4th, 10:30, IMS Mitarbeiter Zimmer | Amit Dubey: Agirre, Baldwin and Martinez. ACL 2008. Improving Parsing and PP Attachment Performance with Sense Information Discriminative Reranking for Natural Language Parsing. | paper |
| Nov 27th, 10:30, IMS Mitarbeiter Zimmer | Aoife Cahill: Michael Collins. Discriminative Reranking for Natural Language Parsing. ICML 2000. | paper |
| Nov 20th, 10:30, IMS Mitarbeiter Zimmer | Alex Balabanov: Michael Collins. Three Generative, Lexicalised Models for Statistical Parsing. ACL/EACL 1997. You might also be interested in the slides for this paper or the longer Computational Linguistics journal paper (see Michael Collins' homepage) | paper |
| Nov 13th, 10:30, IMS Mitarbeiter Zimmer | Nadir Durrani: Statistical Phrase-Based Translation (HLT-NAACL 2003). Philipp Koehn, Franz Josef Och, Daniel Marcu | Statistical Phrase-Based Translation |
| Nov 6th, 10:30, IMS Mitarbeiter Zimmer | Alex Fraser: BLEU: a Method for Automatic Evaluation of Machine Translation (ACL 2002). Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu | BLEU paper |
| October 23rd, 10:30, 3.11 (IMS Phonetik Labor) | Christian Scheible: Introduction to Language Modeling. For slides and reference list, please click here. | Chen and Goodman LM tutorial (focus on interpolation and Kneser/Ney smoothing) |
| October 9th, 10:30, 12.21 | Martin Forst: Grammatical Machine Translation II. Martin Forst will discuss recent work on hybrid MT using LFG. For a full abstract, click here. | |
| October 2nd, 10:30, 12.21 | Helmut Schmid: PCFG parsing algorithms continued | No required reading. |
| September 25th, 10:00, 12.21 | Helmut Schmid: PCFG parsing algorithms | No required reading. |
| August 14th, 10:00, 12.21 (IMS lecture hall) | Helmut Schmid: Introduction to CFG parsing algorithms | No required reading. |
| August 7th, 10:00, 12.21 (IMS lecture hall) | Helmut Schmid: Introduction to HMM tagging | No required reading. Manning and Schuetze HMM Chapter recommended. |
| July 31st, 10:00, 12.21 (IMS lecture hall) | Alex Fraser: Introduction to statistical machine translation - Part 3, phrase-based modeling and decoding | no required reading |
| July 21st to July 25th | EMA Summer School, website is here. First two SMT lectures will be repeated on Tuesday, along with a practice assignment (implementing IBM Model 1). The lecture from the next reading group meeting (phrase-based modeling and decoding) will be on Wed at 14:00, followed by a practice assignment on decoding. Thursday morning's lecture will consist of a discussion of the assignments and a brief overview of some more advanced topics. | |
| July 17th, 10:00, 3.11 (IMS Phonetik Labor) | Alex Fraser: Introduction to statistical machine translation - Part 2. Bitext alignment (extracting lexical knowledge from parallel corpora) | Kevin Knight's SMT Tutorial |
| July 10th, 10:00, 3.11 (IMS Phonetik Labor) | Alex Fraser: Introduction to statistical machine translation - Part 1. I will define the MT problem and talk about evaluation. I will also discuss parallel corpora and sentence alignment and give a brief overview of statistical machine translation (SMT). Kevin Knight's tutorial is recommended, but not necessary until next week. | Kevin Knight's SMT Tutorial |
Organizers: Alex Fraser and Helmut Schmid
Email Addresses: SubstituteLastName@ims.uni-stuttgart.de