5. Phonological Representation
of Intonation - ToBI (Tone Layer) Coding Module
-
name: ToBI (Tone Layer) Annotation
-
coding purpose: phonological annotation
of the intonation curve, distinguishing pitch accents, phrase accents and
boundary tones, according to the ToBI scheme
-
data sources: spoken corpora (speech
files, f0 files, orthographic transcription, phonetic transcription)
-
module references: orthographic
transcription module, phonetic transcription module, f0 coding module
-
description: In our XML adaptation
of ToBI, four elements have been defined:
-
<tobitone>, for the
tones, distinguished according to their function as
pitch accents,
phrase accents or boundary tones and labeled according to
a classification of their linguistically admissible types
-
<target>,
to mark peak location when it occurs outside the scope of the accented
syllable
-
<f0range>, to mark
the highest f0 value in the curve
-
<repair>, to mark the
restart of the intonation contour after a disfluency
The four elements are not
hierarchically ordered. All may refer to the f0 curve, while only
the two accessory element <target>
and <f0range> are necessarily
linked to <f0>. The <tobitone>
and <repair> elements can
be linked to prosodic units and/or to phonetic descriptions of intonation,
rather than raw f0.
-
example: The following example
shows the ToBI annotation of the English utterance "Show me the cheapest
fare from Philadelphia to Dallas excluding restriction VU slash one" (obtained
from the TOBI-TRAINING material), using the elements <tobitone>
and <repair>.
tobitone.xml
<tobitone
id="tbtn_001" type="H*" class="pitaccent" href="word.xml# id(wrd_001)"
start="2052" end="2052"/>
<tobitone
id="tbtn_002" type="L+H*" class="pitaccent" href="word.xml# id(wrd_004)"
start="2579" end="2579"/>
<tobitone
id="tbtn_003" type="!H*" class="pitaccent" href="word.xml# id(wrd_005)"
start="3065" end="3065"/>
<tobitone
id="tbtn_004" type="L-" class="phraccent" href="word.xml# id(wrd_005)"
start="3315" end="3315"/>
<tobitone
id="tbtn_005" type="L%" class="boundtone" href="word.xml# id(wrd_005)"
start="3315" end="3315"/>
<tobitone
id="tbtn_006" type="L+H*" class="pitaccent" href="word.xml# id(wrd_009)"
start="4470" end="4470"/>
<tobitone
id="tbtn_007" type="!H*" class="pitaccent" href="word.xml# id(wrd_009)"
start="4771" end="4771"/>
<tobitone
id="tbtn_008" type="L-" class="phraccent" href="word.xml# id(wrd_009)"
start="5015" end="5015"/>
<tobitone
id="tbtn_009" type="H*" class="pitaccent" href="word.xml# id(wrd_011)"
start="5388" end="5388"/>
<tobitone
id="tbtn_010" type="L-" class="phraccent" href="word.xml# id(wrd_011)"
start="5855" end="5855"/>
<tobitone
id="tbtn_011" type="L%" class="boundtone" href="word.xml# id(wrd_011)"
start="5855" end="5855"/>
<tobitone
id="tbtn_012" type="L+H*" class="pitaccent" href="word.xml# id(wrd_012)"
start="6984" end="6984"/>
<tobitone
id="tbtn_013" type="L-" class="phraccent" href="word.xml# id(wrd_012)"
start="7399" end="7399"/>
<tobitone
id="tbtn_014" type="L%" class="boundtone" href="word.xml# id(wrd_012)"
start="7399" end="7399"/>
<tobitone
id="tbtn_015" type="H*" class="pitaccent" href="word.xml# id(wrd_013)"
start="8154" start="8154"/>
<tobitone
id="tbtn_016" type="L-" class="phraccent" href="word.xml# id(wrd_013)"
start="8585" end="8585"/>
<tobitone
id="tbtn_017" type="L%" class="boundtone" href="word.xml# id(wrd_013)"
start="8585" end="8585"/>
<tobitone
id="tbtn_018" type="H*" class="pitaccent" href="word.xml# id(wrd_014)"
start="8711" end="8711"/>
<tobitone
id="tbtn_019" type="!H*" class="pitaccent" href="word.xml# id(wrd_015)"
start="8928" end="8928"/>
<tobitone
id="tbtn_020" type="L-" class="phraccent" href="word.xml# id(wrd_015)"
start="9114" end="9114"/>
<tobitone
id="tbtn_021" type="H*" class="pitaccent" href="word.xml# id(wrd_016)"
start="9353" end="9353"/>
<tobitone
id="tbtn_022" type="H*" class="pitaccent" href="word.xml# id(wrd_017)"
start="9694" end="9694"/>
<tobitone
id="tbtn_023" type="L-" class="phraccent" href="word.xml# id(wrd_017)"
start="9880" end="9880"/>
<tobitone
id="tbtn_024" type="L%" class="boundtone" href="word.xml# id(wrd_017)"
start="9880" end="9880"/>
repair.xml
<repair
id="rpr_001" type="%r" start="4149" end="4149"/>
ELEMENT
<tobitone>
ATTRIBUTES:
id
[ASCII]
type
L-, H-, !H-, -, -?, X-?, L-L%, L-H%, H-H%, H-L%, %, %?, X%?, H*, !H*, L*,
L*+H, L*+!H, L+H*, L+!H*, H+!H*, *, *?, X*?
class
pitaccent, phraccent, boundtone
href
<f0> or <closecopy> or <momel> or <intone> or <syllable>
or <word>
start
[FLOAT]
end
[FLOAT]
The set of symbols
defined for the attribute `type' includes the allowable combination of
pitch accents, phrase accents, boundary tones and/or uncertainty symbols,
as defined in the ToBI guidelines. The value for the attribute 'type' should
be consistent with the attribute 'class', according to the semantics of
the different labels described in the tables in 6.2.1.
ELEMENT
<target>
ATTRIBUTES:
id
[ASCII]
type
EarlyF0, LateF0
href
<f0> or
<closecopy>
or
<momel>
or
<intone>
start
[FLOAT]
end
[FLOAT]
ELEMENT <f0range>
ATTRIBUTES:
id [ASCII]
type HiF0
href <f0> or
<closecopy> or
<momel> or
<intone>
start [FLOAT]
end [FLOAT]
ELEMENT <repair>
ATTRIBUTES:
id [ASCII]
type %r
href <f0> or
<closecopy> or
<momel> or
<intone> or
<syllable>
start [FLOAT]
end [FLOAT]
-
coding procedure: Different
procedures may be followed to obtain a ToBI annotation of intonation. A
possible procedure, assuming that the <f0> and <syllable>
elements are available, is:
-
open the following synchronized
windows: signal window, <f0>
(with its graphical display), <word>,
<syllable> by listening to
the signal and inspecting the f0 curve and the aligned words and syllables,
find out pitch accents, phrase boundaries, boundary tones and repairs (according
to ToBI definitions and guidelines)
-
for each detected intonation event,
select the <syllable> on which it occurs, create the corresponding (linked)
<tobitone> or <repair>
element and assign it the proper class attribute;
time values will be inherited from <syllable>
(or, if desired, can be set explicitly in correspondence of the f0 peak
or valley)
-
in case the peak or valley of the
event is outside the syllable, choose its exact <f0>
point and create the linked <target>
element
-
for each phrase, find out its f0
maximum, select its <f0> point
and create the linked <f0range>
element
-
creation notes:
-
Authors: Silvia Quazza, Juan María
Garrido
-
Version: 1., October 1999
-
Comments: none
-
Literature:
Prosodic Phrasing - ToBI (Break Index) Coding
Module