Multilingual TED Talks

A small corpus of parallel TED talks together with models for topic and gender classification.

Multilingual TED Talks



FInd the corpus here


Erenay Dayanik and Sebastian Padó


Erenay Dayanik and Sebastian Padó. Disentangling Document Topic and Author Gender in Multiple Languages: Lessons for Adversarial Debiasing. Proceedings of the EACL WASSA workshop, 2021. To appear.

Erenay Dayanik


Former Doctoral Researcher

This image shows Sebastian Padó

Sebastian Padó

Prof. Dr.

Chair of Theoretical Computational Linguistics, Managing Director of the IMS

To the top of the page