27.12.2012 Views

Oscillations, Waves, and Interactions - GWDG

Oscillations, Waves, and Interactions - GWDG

Oscillations, Waves, and Interactions - GWDG

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

32 H. W. Strube<br />

3.5.1 Glottis pulse parameters by inverse filtering<br />

For describing the glottal oscillation, it is hardly feasible to estimate the many mechanical<br />

parameters of the vocal-fold system, but characteristic quantities can be<br />

used as occurring in parametric models of the pulse shape of the glottal volume velocity,<br />

for instance, the well-known LF-model [29]. Such a model can be fitted to a<br />

measured pulse shape. For the measurement to be feasible during normal speaking,<br />

only inverse-filtering techniques are applicable. That is, the filtering by the vocal<br />

tract is computationally undone in order to obtain the glottal volume velocity. This<br />

leads to the circular problem that the vocal-tract transfer function cannot be estimated<br />

exactly enough without knowledge of the input signal. Thus, a simultaneous<br />

estimation of the transfer function <strong>and</strong> the glottal pulse (as LF-model) was carried<br />

out iteratively with a multidimensional optimization method, see Fig. 4.<br />

The transfer function was computed in a pitch-related way with a modified DAP algorithm<br />

[32] <strong>and</strong> the Itakura-Saito error occurring therein was also used as optimality<br />

criterion for the total iteration. The method was verified with synthetic speech, where<br />

beside the LF parameters also derived quantities such as Open-Quotient, Closed-<br />

Quotient, Speed-Quotient, Parabolic Spectral Parameter were compared with the<br />

given data. For natural speech, the energies of 200-ms intervals <strong>and</strong> the electroglottographically<br />

measured open-quotient were employed for comparison with the model.<br />

A detailed description can be found in Refs. [30,31].<br />

Choice of<br />

parameter set<br />

Winner<br />

set<br />

Multi−dimensional<br />

optimizer<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

-0.1<br />

-0.2<br />

-0.3<br />

-0.4<br />

-0.5<br />

Glottis model<br />

Spectrum<br />

without glottis<br />

Error<br />

Speech signal<br />

2.1 2.105 2.11 2.115 2.12 2.125<br />

Time [s]<br />

Inverse filtering,<br />

DAP<br />

Inverse filtered<br />

signal<br />

Figure 4. Top: Scheme of estimating glottal pulse <strong>and</strong> transfer function, cf. [30,31]. Bottom:<br />

Example of estimated glottal flow derivative <strong>and</strong> fitted LF-model (gray, smooth).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!