15.06.2013 Views

Teza doctorat (pdf) - Universitatea Tehnică

Teza doctorat (pdf) - Universitatea Tehnică

Teza doctorat (pdf) - Universitatea Tehnică

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

O. Buza, G. Toderean, J. Domokos, A. Zs. Bodo, Construction of a Syllable-Based Text-To-Speech System for Romanian<br />

3.1. S/U/V SEGMENTATION<br />

Our segmentation method uses time domain analysis of speech signal. After low-pass<br />

filtering of signal, zero-crossing waveform points ( Zi) are detected. Then minimum ( mi) and<br />

maximum (Mi) values between two adjacent zero points are computed.<br />

Separation between silence and speech segments is realized by using a threshold value Ts<br />

on signal amplitude. In silence segments, all mi and Mi points must be lower than this threshold:<br />

|<br />

M i | T<br />

s<br />

<br />

|<br />

mi<br />

| T<br />

s<br />

, i = s… s+n, (5)<br />

where s is the segment sample index and n is the number of samples in that segment.<br />

For speech segments, distance Di between two adjacent zero points is computed.<br />

Decision of voiced segment is assumed if distance is greater than a threshold distance V:<br />

Di V , i = s,… , s+n (6)<br />

Z1<br />

Figure 3. A voiced segment of speech<br />

For the zero points between A and B from figure 3 to be included in the voiced segment, a<br />

look-ahead technique has been applied. A number of maximum Nk zero points between Zi and Zi+k<br />

can be inserted in voiced region if Di-1>V and Di+k >V :<br />

D<br />

<br />

D<br />

i<br />

<br />

D<br />

i<br />

j<br />

1<br />

k<br />

V<br />

V<br />

V<br />

A<br />

B<br />

305<br />

Zn<br />

, j = i..k; k

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!