next up previous contents
Next: ims_german_token.scm Up: Files in festival/lib/german Previous: ims_german_smartkom.scm &nbsp Contents


This file provides the module Ling_Preprocessing that deals with linguistic text preprocessing. To include it in the synthesis process, the function redefine_UttTypes_for_ling_preproc, as defined in ims_german_scm, must be called (usually in ims_german_voices).

The linguistic analysis carried out within the module creates an additional relation NP for each utterance containing all noun chunks of the utterance. These can be used for the prediction of accents and phrase boundaries, together with a number of other linguistic features and functions defined in ims_german_ling_preproc. The most important function is the identification of the so-called 'Vorfeld', i.e. enumerations and focus particles, as well as 'weight' and 'head' of noun chunks and the analysis of sequences of verbs.

CART trees for the prediction of phrase breaks (german_rule_based_phrase_cart_tree and german_data_based_phrase_cart_tree) and accents (german_rule_based_accent_cart_tree and german_data_based_accent_cart_tree) using the linguistic features introduced by Ling_Preprocessing are included in ims_german_ling_preproc. To use them, phrase_cart_tree and int_accent_cart_tree must be set accordingly (e.g. in ims_german_voices).

(Based on tobi_f0.scm by Alan W Black)

Martin Barbisch