Dieses Bild zeigt  Pavel Denisov


Pavel Denisov

Institut für maschinelle Sprachverarbeitung
Digital Phonetics


+49 711 685-81396

Pfaffenwaldring 5 b
70569 Stuttgart
Raum: 02.014


As needed, write me an e-mail.


  • D. Raj, P. Denisov, Z. Chen, H. Erdogan, Z. Huang, M. He, S. Watanabe, J. Du, T. Yoshioka, Y. Luo, N. Kanda, J. Li, S. Wisdom, J. R. Hershey. Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. In Proceedings of the 8th IEEE Spoken Language Technology Workshop (SLT), 2021.


  • P. Denisov and N. T. Vu. Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning. In Proceedings of Interspeech, 2020.
  • C. Y. Li, D. Ortega, D. Väth, F. Lux, L. Vanderlyn, M. Schmidt, M. Neumann, M. Völkel, P. Denisov, S. Jenne, Z. Kacarevic and N. T. Vu. ADVISER: A Toolkit for Developing Multimodal, Multi-domain and Socially-engaged Conversational Agents. In Proceedings of ACL - Systems Demonstration, 2020.


  • P. Denisov and N. T. Vu. End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning. In Proceedings of Interspeech, 2019.
  • D. Ortega, C. Y. Li, G. Vallejo, P. Denisov, N. T. Vu. Context-aware Neural-based Dialog Act Classification On Automatically Generated Transcriptions. In Proceedings of the 44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019.
  • P. Denisov and N. T. Vu. IMS-Speech: A Speech to Text Tool. In Proceedings of the 30th Conference on Electronic Speech Signal Processing (ESSV), 2019.


  • P. Denisov, N. T. Vu and M. Ferras. Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition. In Proceedings of the 13th ITG Conference on Speech Communication, 2018.

WS 2020/21 Speech Recognition
SS 2020
Team Laboratory Phonetics
SS 2019
Spoken Language Processing

Since 08.2018:
PhD candidate in Digital Phonetics, Institute for Natural Language Processing (IMS),University of Stuttgart

10.2017 - 07.2018:
Master Thesis Student, Working Student, Sony Stuttgart Technology Center

10.2016 - 07.2018:
MSc. in Computational Linguistics, Institute for Natural Language Processing (IMS),University of Stuttgart
Thesis: Transfer Learning for Robust Acoustic Modeling in Automatic Speech Recognition

11.2006 - 10.2016:
Software Engineer, Scoros International Inc.
Document Processing, Content Extraction, Release Engineering

09.2003 - 02.2009:
Diploma of Engineer, Saint Petersburg State University of Aerospace Instrumentation
Computer Aided Engineering

Zum Seitenanfang