3. Phonetic Representation of
Intonation - IPO coding module
-
coding purpose: symbolic annotation
of stylized f0 contours as a sequence of pitch movements, according to
the the IPO scheme
-
data sources: spoken corpora (speech
files, f0 stylised contour files, phonetic transcription)
-
module references: f0 coding module
(optional), phonetic transcription module
-
description: Here we consider two
hierarchically ordered elements:
-
<closecopy>, representing
the inflection points in the stylized curve
-
<pitmove>, representing
the classified movements from one inflection point to the next one.
In principle, <closecopy>
could be directly imported from f0 stylised files or linked to one <f0>
element, and <pitmove> should
be linked to two consecutive <closecopy>
elements.
-
example: The following example
shows the coding of the Italian sentence
"quell'artificio contabile
sara` scoperto facilmente" read by a female speaker.
closecopy.xml
<closecopy
id="clscpy_001" value="207" start="130" end="130"/>
<closecopy
id="clscpy_002" value="243" start="540" end="540"/>
<closecopy
id="clscpy_003" value="285" start="690" end="690"/>
<closecopy
id="clscpy_004" value="212" start="860" end="860"/>
<closecopy
id="clscpy_005" value="189" start="1110" end="1110"/>
<closecopy
id="clscpy_006" value="159" start="1290" end="1290"/>
<closecopy
id="clscpy_007" value="209" start="1500" end="1500"/>
<closecopy
id="clscpy_008" value="206" start="1750" end="1750"/>
<closecopy
id="clscpy_009" value="246" start="2070" end="2070"/>
<closecopy
id="clscpy_010" value="226" start="2600" end="2600"/>
<closecopy
id="clscpy_011" value="148" start="2780" end="2780"/>
<closecopy
id="clscpy_012" value="144" start="3070" end="3070"/>
pitmove.xml
<pitmove
id="pitm_001" type="4" href="closecopy.xml# id(clscpy_001).. id(clscpy_002)"/>
<pitmove
id="pitm_001" type="1" href="closecopy.xml# id(clscpy_002).. id(clscpy_003)"/>
<pitmove
id="pitm_001" type="B" href="closecopy.xml# id(clscpy_003).. id(clscpy_004)"/>
<pitmove
id="pitm_001" type="Ø" href="closecopy.xml# id(clscpy_004).. id(clscpy_005)"/>
<pitmove
id="pitm_001" type="B" href="closecopy.xml# id(clscpy_005).. id(clscpy_006)"/>
<pitmove
id="pitm_001" type="4" href="closecopy.xml# id(clscpy_006).. id(clscpy_007)"/>
<pitmove
id="pitm_001" type="Ø" href="closecopy.xml# id(clscpy_007).. id(clscpy_008)"/>
<pitmove
id="pitm_001" type="4" href="closecopy.xml# id(clscpy_008).. id(clscpy_009)"/>
<pitmove
id="pitm_001" type="0" href="closecopy.xml# id(clscpy_009).. id(clscpy_010)"/>
<pitmove
id="pitm_001" type="B" href="closecopy.xml# id(clscpy_010).. id(clscpy_011)"/>
<pitmove
id="pitm_001" type="Ø" href="closecopy.xml# id(clscpy_011).. id(clscpy_012)"/>
ELEMENT <closecopy>
ATTRIBUTES
id [ASCII]
value [FLOAT]
href <f0> ***< (optional) >***
start [FLOAT]
end [FLOAT]
ELEMENT <pitmove>
ATTRIBUTES
id [ASCII]
type 0, Ø, 1, 2, 3, 4, 5, A, B, C, D, E, &2,
&3, &4, &A, &C, &D
href <closecopy>..<closecopy>
start [FLOAT]
end [FLOAT]
-
coding procedure: The most IPO-conformant
coding procedure will directly import the stylized f0 curve, obtained with
the help of a proper external environment for perceptual, using the <closecopy>
element with no need of the <f0>
element, and will consist in the following steps:
-
open the speech file in order to
listen to its intonation
-
open the corresponding phonetic
segmentation (<phone> and
<syllable>)
-
import the close copy and display
it as a curve, aligned with phonetic segmentation
-
define <pitmove>
elements by selecting the segments of the stylized curve (delimited by
two consecutive <closecopy> elements) and labeling each of them according
to the following criteria:
-
if it can be considered to coincide
with the ideal baseline or topline, by a global look at the curve, label
it 0 or Ø respectively
-
otherwise choose the proper label
on the basis of movement direction and size and of its position in the
syllable, judged by looking at its phonetic alignment
If the close copy is not
available, the third step may be replaced by the following steps (a very
simplified approximation of the correct stylization procedure):
-
import or generate automatically
the raw f0 curve and display it
-
obtain a closecopy by selecting
the 'relevant' <f0> points
on the raw curve; base such stylization on the shape of the curve, the
perceived intonation of the sound file and the alignment with syllables
(accents, boundaries...)
-
creation notes:
-
Authors: Silvia Quazza, Juan María
Garrido
-
Version: 1., October 1999
-
Comments: none
-
Literature:
Phonetic Representation of Intonation -
INTSINT Coding Module