Multilingual TED Talks

A small corpus of parallel TED talks together with models for topic and gender classification.

Multilingual TED Talks

Type

Corpus

FInd the corpus here

Author

Erenay Dayanik and Sebastian Padó

Reference

Erenay Dayanik and Sebastian Padó. Disentangling Document Topic and Author Gender in Multiple Languages: Lessons for Adversarial Debiasing. Proceedings of the EACL WASSA workshop, 2021. To appear.

Erenay Dayanik

 

Former Doctoral Researcher

This image shows Sebastian Padó

Sebastian Padó

Prof. Dr.

Chair of Theoretical Computational Linguistics, Managing Director of the IMS

To the top of the page