Speech Synthesis at the IMS

festival logo

TTS demo

We provide two demo pages. On the first one you can synthesize any given text online. The second is a speaking metro information system for the region of Stuttgart.

Project description

The speech synthesis activities at the IMS (Experimental Phonetics) concentrates on various linguistic and application oriented aspects of speech synthesis. Our goal is to achieve naturally sounding and linguistically motivated speech synthesis.
The speech synthesis system developed at the IMS is the IMSGermanFestival system (download IMS German Festival). It is based on the original externer Link Festival speech synthesis framework developed at CSTR, University of Edinburgh. The current voice of our system uses diphones taken from the externer Link MBROLA project.
We are currently working on various aspects that will improve our system. One main aspect is prosody generation. We are exploiting methods that take into account linguistic information (like part-of-speech or syntactic parsing) to predict accents and phrasing. A sentence's pitch contour is generated using statistic or rule-based methods. Our speech synthesis system has a refined text-preprocessing module that deals with all kinds of abbreviations and various different number formats (cardinal and ordinal numbers, dates, currency, ...).
Our group is participating in the SmartKom project, a large German-wide project on multimodal human-machine interaction founded by the German gouvernment. In this project we are responsible for the speech output modality. Additional funding comes from the externer Link Deutsche Forschungsgemeinschaft (DFG) and the Sony Research Center Germany.

Members of the IMS speech synthesis group

Former members of the IMS speech synthesis groups

Download

You can download a version of our German Festival synthesis system.

Interesting local links

hoch top