27.12.2012 Views

Oscillations, Waves, and Interactions - GWDG

Oscillations, Waves, and Interactions - GWDG

Oscillations, Waves, and Interactions - GWDG

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

noise component [GNE]<br />

5<br />

4<br />

3<br />

2<br />

1<br />

cancer groups<br />

normal (n=93)<br />

Speech research 27<br />

whispering (aphon) (n=60)<br />

pseudo-gl. phon. (n=9) aryepigl. phon. (n=6)<br />

glottic phon. (n=15)<br />

ventr. phon. (n=10)<br />

0 1 2 3 4 5 6 7 8 9 10<br />

irregularity component [jitter, shimmer, period correlation]<br />

Figure 1. Göttinger Hoarseness Diagram; distribution of some cancer groups; cf. Ref. [13].<br />

as jitter (period-length fluctuation) <strong>and</strong> shimmer (amplitude or energy fluctuation);<br />

on the other h<strong>and</strong>, measures of the noise component relative to glottal excitation.<br />

These are coarse correlates of subjective roughness <strong>and</strong> breathiness, respectively.<br />

There are many different definitions of such measures; e. g., for irregularity [15]:<br />

�<br />

�<br />

�<br />

�<br />

�<br />

�<br />

xn − xn−1<br />

xn<br />

�<br />

�<br />

�<br />

� or<br />

�<br />

�<br />

�<br />

�<br />

�<br />

xn − (1/M) � (M−1)/2<br />

m=−(M−1)/2 xn+m<br />

(1/M) � (M−1)/2<br />

m=−(M−1)/2 xn+m<br />

�<br />

�<br />

�<br />

� , M = 3, 5, . . . ,<br />

�<br />

each averaged over n, where xn is the length or amplitude or energy of the nth period.<br />

The period energy is more robustly measurable than the amplitude. For the period<br />

length, a method proved to be especially reliable which was based on the correlation<br />

coefficient of subsequent signal intervals [t, t + T ) <strong>and</strong> [t + T, t + 2T ) <strong>and</strong> maximizing<br />

with respect to T [16,17] (with interpolation between the signal samples). The average<br />

correlation coefficient of subsequent periods served as another irregularity measure<br />

(Mean Waveform matching Coefficient, MWC).<br />

Traditional measures of the noise component (e. g., NNE [18], CHNR [19]) are<br />

unfortunately dependent on the irregularity measures <strong>and</strong> the choice of the analysis<br />

window. With strongly irregular voices, they are often not applicable, since they usually<br />

require a harmonic spectral structure. Therefore, we started from the following<br />

assumption [20]: with glottal excitation – regardless how irregular – the excitation in<br />

different b<strong>and</strong>s should be nearly synchronous, with noise excitation, however, asynchronous.<br />

Hereon the following construction is based. After downsampling to 10<br />

kHz <strong>and</strong> linear-predictive inverse filtering for removing the formant structure, the<br />

signal is decomposed into partial b<strong>and</strong>s using Hann-window shaped filters. For each<br />

b<strong>and</strong>, the Hilbert envelope is formed <strong>and</strong> its mean removed. For all pairs of b<strong>and</strong>s

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!