Teza doctorat (pdf) - Universitatea Tehnică
Teza doctorat (pdf) - Universitatea Tehnică Teza doctorat (pdf) - Universitatea Tehnică
O. Buza, G. Toderean, J. Domokos, A. Zs. Bodo, Construction of a Syllable-Based Text-To-Speech System for Romanian Also a region of voiced discontinuity corresponding to intermediate frequencies associated with plosive consonants like /C/ or /G/, occurring when these consonants are splitting a sequence of vowels. 6. Voiced Jump (VJ) Is a region similar with voiced region V, but it has no periodicity. It is due to the balance of vocal signal only above or underlying median zero line. This region has no vowel or other phoneme correspondence in speech signal, but a transition or co-articulation. 7. Irregular Region (IR) This is a region in which one can find plosive consonants like /C/, /G/ or /P/. Usually it comes after a silence region, it has a short duration and a frequency band intermediate between vowels and fricative consonants. 8. High Density transition (HD) Is a transition region with high frequency values, which could indicate emergency of fricative consonants. Signal is not integrated in C or US classes because of positive or negative balance relative to median zero value. 9. Transition (T) This is an intermediate region between voiced and unvoiced and which has no the characteristics of IR or HD classes. 10. Unvoiced Consonant (C) For Romanian language, this class contains fricative consonants /S/, /Ş/, /Ţ/, /F/, /Z/, /J/, /H/, and non-fricatives /Ce/, /Ci/, /Ge/, /Gi/ . For detecting these classes, median amplitude MA(n), number of zero-cross points NTZ, signal energy E(n) have been used as section 2 describes in (1)-(4), and also short-term Fourier coefficients for detection of VP and HD special cases. 3.4. PHONETIC SEGMENTATION Phonetic segmentation is the process of associating phonetic symbols with the speech signal. This process is very usefull when we want to develop an acoustic database from a large speech corpus. Phonetic segmentation gives the capability of detecting and separating phonetic units from speech, units that will be used in achieving the output acoustic chain sequence through concatenation. 308
O. Buza, G. Toderean, J. Domokos, A. Zs. Bodo, Construction of a Syllable-Based Text-To-Speech System for Romanian State of the art on this field exposes some different methods for automated or semi- automated phonetic segmentation of speech signal: iterative methods with training stages (as HMM segmentation or region frontiers refinement), methods based on association rules, statistical methods (like segmentation based on generalized likelihoods GLR), a.o. Phonetic sgmentation method proposed by the authors is a method based on association rules, which realises a correspondence between phonetic groups taken from an input stream and distinct types of regions detected from the speech signal. Segmentation algorithm parses the input text and tries to find the best match for each phonetic group with one or more regions of speech signal. Input text is first written into a special phonetic transcription, using a simple look-after table, which includes phoneme and word transitions. Transcripted text is splitted into a sequence of phonetic groups. Special association rules will establish a correspondence with specific regions detected from speech signal. For associating phonetic groups with sequences of regions, as further in our approach, we have used LEX parser generator. We have written a set of association rules, each rule specifying a phonetic pattern for associating a particular group with a sequence of regions, and also specifying a condition to be verified in order to make that association. Each condition outlines: type of region, minimal and maximal duration, type of association: unique region or sequence of regions. Figure 7 presents association between a phonetic group i i i G F F ,..., F 309 i , where Fk i are 1 , 2 phonetic symbols, and a sequence of regions i i i SR REG REG ,..., REG correspondence rule Ri : Gi COND_REGi . Text Rules Signal … … … REG1 i F1 i N , by the meaning of a 1 , 2 F2 i … Fk i … … R1: G1 Cond_Reg1 Ri: Gi Cond_Regi Rn: Gn Cond_Regn REG2 i Figure 7. Association between phonetic groups and signal regions Gi … REGN i … N k
- Page 276 and 277: 1 Procesare Separator Procesare Cuv
- Page 278 and 279: Cap. 7. Proiectarea sistemului de s
- Page 280 and 281: Cap. 7. Proiectarea sistemului de s
- Page 282 and 283: Cap. 7. Proiectarea sistemului de s
- Page 284 and 285: 8. Concluzii finale Cercetările ef
- Page 286 and 287: 268 Cap. 8. Concluzii finale 11. A
- Page 288 and 289: 270 Cap. 8. Concluzii finale percep
- Page 290 and 291: 272 Cap. 8. Concluzii finale b) pen
- Page 292 and 293: 274 Cap. 8. Concluzii finale - nive
- Page 294 and 295: Bibliografie [And88] André-Obrecht
- Page 296 and 297: 278 Bibliografie Quality and Testin
- Page 298 and 299: 280 Bibliografie [Giu06] Giurgiu M.
- Page 300 and 301: 282 Bibliografie [Nag05] Nageshwara
- Page 302 and 303: 284 Bibliografie Research Institute
- Page 304 and 305: Anexa 2. Silabele din setul S2 dup
- Page 306 and 307: Anexa 2. Silabele din setul S2 dup
- Page 308 and 309: Anexa 3. Silabe din setul S3 după
- Page 310 and 311: Anexa 4. Silabe din setul S4 după
- Page 312 and 313: Anexa 4. Silabe din setul S4 după
- Page 314 and 315: 296 Anexa 5. Activitatea ştiinţif
- Page 316 and 317: 298 Anexa 5. Activitatea ştiinţif
- Page 318 and 319: Anexa 6. Lucrări ştiinţifice ale
- Page 320 and 321: O. Buza, G. Toderean, J. Domokos, A
- Page 322 and 323: O. Buza, G. Toderean, J. Domokos, A
- Page 324 and 325: O. Buza, G. Toderean, J. Domokos, A
- Page 328 and 329: O. Buza, G. Toderean, J. Domokos, A
- Page 330 and 331: Level 3 O. Buza, G. Toderean, J. Do
- Page 332 and 333: O. Buza, G. Toderean, J. Domokos, A
- Page 334 and 335: O. Buza, G. Toderean, J. Domokos, A
- Page 336 and 337: O. Buza, G. Toderean, J. Domokos, A
- Page 338 and 339: About Construction of a Syllable-Ba
- Page 340 and 341: WSEAS TRANSACTIONS ON COMMUNICATION
- Page 342 and 343: WSEAS TRANSACTIONS ON COMMUNICATION
- Page 344 and 345: WSEAS TRANSACTIONS ON COMMUNICATION
O. Buza, G. Toderean, J. Domokos, A. Zs. Bodo, Construction of a Syllable-Based Text-To-Speech System for Romanian<br />
Also a region of voiced discontinuity corresponding to intermediate frequencies associated<br />
with plosive consonants like /C/ or /G/, occurring when these consonants are splitting a<br />
sequence of vowels.<br />
6. Voiced Jump (VJ)<br />
Is a region similar with voiced region V, but it has no periodicity. It is due to the balance of<br />
vocal signal only above or underlying median zero line. This region has no vowel or other<br />
phoneme correspondence in speech signal, but a transition or co-articulation.<br />
7. Irregular Region (IR)<br />
This is a region in which one can find plosive consonants like /C/, /G/ or /P/. Usually it<br />
comes after a silence region, it has a short duration and a frequency band intermediate<br />
between vowels and fricative consonants.<br />
8. High Density transition (HD)<br />
Is a transition region with high frequency values, which could indicate emergency of<br />
fricative consonants. Signal is not integrated in C or US classes because of positive or<br />
negative balance relative to median zero value.<br />
9. Transition (T)<br />
This is an intermediate region between voiced and unvoiced and which has no the<br />
characteristics of IR or HD classes.<br />
10. Unvoiced Consonant (C)<br />
For Romanian language, this class contains fricative consonants /S/, /Ş/, /Ţ/, /F/, /Z/, /J/,<br />
/H/, and non-fricatives /Ce/, /Ci/, /Ge/, /Gi/ .<br />
For detecting these classes, median amplitude MA(n), number of zero-cross points NTZ,<br />
signal energy E(n) have been used as section 2 describes in (1)-(4), and also short-term Fourier<br />
coefficients for detection of VP and HD special cases.<br />
3.4. PHONETIC SEGMENTATION<br />
Phonetic segmentation is the process of associating phonetic symbols with the speech<br />
signal. This process is very usefull when we want to develop an acoustic database from a large<br />
speech corpus. Phonetic segmentation gives the capability of detecting and separating phonetic<br />
units from speech, units that will be used in achieving the output acoustic chain sequence through<br />
concatenation.<br />
308