27.12.2012 Views

Oscillations, Waves, and Interactions - GWDG

Oscillations, Waves, and Interactions - GWDG

Oscillations, Waves, and Interactions - GWDG

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

26 H. W. Strube<br />

3 Acoustic analysis of pathologic voices, extended to running speech 1<br />

3.1 Introduction<br />

For diagnostics <strong>and</strong> treatment of voice disorders the evaluation of voice quality is<br />

essential. Apart from the auditive judgment by the phoniatrist, it is desirable to find<br />

objective criteria for the rating of voice disorders, especially, to determine physical<br />

quantities appropriate for diagnostic description <strong>and</strong> classification of speech pathologies.<br />

For this purpose, acoustic analysis methods have been developed. Starting<br />

from known quantities, such as jitter, shimmer, <strong>and</strong> measures of additive noise, novel<br />

quantities were investigated that allow a better separation of noise from glottal periodicity<br />

disturbances. According to clinical usage, first stationary vowels were considered.<br />

But an analysis of running speech is also desirable, in order to assess the<br />

voice under more realistic conditions. So an important goal of the project was the<br />

extension of the methods to running speech.<br />

Further, in phoniatric diagnostics stroboscopic video recordings of the vibrating<br />

glottis are usual. These were supplemented by high-speed recordings, <strong>and</strong> automatic<br />

methods of image segmentation were developed (e. g., determination of the glottal<br />

opening area). Besides all this, an extensive data bank with acoustic <strong>and</strong> optical<br />

recordings as well as diagnostic findings was built up.<br />

3.2 Equipment <strong>and</strong> data bank<br />

For recording <strong>and</strong> processing, st<strong>and</strong>ard PCs were used with good sound cards (e. g.,<br />

Soundblaster PCI 128) under Linux in an ethernet LAN. The voice recordings were<br />

done in an anechoic <strong>and</strong> insulated recording room, not containing any noisy devices.<br />

The microphone, a Beyerdynamic HEM 191.15 with spherical characteristic, was<br />

head-fixed about 10 cm in front of the chin. A special graphic interface for recording,<br />

cutting <strong>and</strong> marking of the voice recordings was programmed.<br />

Presently there are about 77000 voice recordings as WAV files with 48 kHz sampling<br />

frequency: vowels [ε: a: e: i: o: u:] with pitches normal, low, high; running<br />

speech (phonetically balanced st<strong>and</strong>ard texts “Nordwind und Sonne” [north wind<br />

<strong>and</strong> sun] <strong>and</strong> “Buttergeschichte” [butter story]); spontaneous speech. For archiving<br />

<strong>and</strong> automatic administration of the voice <strong>and</strong> video recordings as well as the medical<br />

diagnoses (about 70 different ones), a large MySQL data base was built up. It runs<br />

with a PHP web frontend on a Linux PC, is connected to the patient information<br />

system SAP of the university hospital <strong>and</strong> has an interface to the video stroboscopy<br />

workplaces. For each patient-related voice analysis, a PDF file with color print can<br />

be generated.<br />

3.3 The Göttinger Hoarseness Diagram<br />

Especially fruitful was the voice characterization by the “Göttinger Hoarseness Diagram”<br />

(GHD) [11–14], Fig. 1. It is based on traditionally important quantities in<br />

phoniatric diagnostics: on one h<strong>and</strong>, irregularity measures of glottal oscillation, such<br />

1 extended <strong>and</strong> updated from earlier German papers [9,10]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!