Publication | Open Access
Spatio-temporal quantification of vocal fold vibrations using high-speed videoendoscopy and a biomechanical model
40
Citations
47
References
2008
Year
ElectroglottographyVoice SurgeryAcoustic ModelingSpeech RecognitionKinesiologyData ScienceVocal Tract ImagingBiostatisticsVoice RecognitionHigh-speed VideoendoscopyHealth SciencesSpeech PerceptionVocal Fold OscillationsVocal Fold VibrationsVocal FoldsSpatio-temporal QuantificationSpeech TechnologyVocal Fold DynamicsSpeech ProcessingArts
Pathologic changes within the organic constitution of vocal folds or a functional impairment of the larynx may result in disturbed or even irregular vocal fold vibrations. The consequences are perturbations of the acoustic speech signal which are perceived as a hoarse voice. By means of appropriate image processing techniques, the vocal fold dynamics are extracted from digital high-speed videos. This study addresses the approach to obtain a parametric description of the spatio-temporal characteristics of the vocal fold oscillations for the aim of classification. For this purpose a biomechanical vocal fold model is introduced. An automatic optimization procedure is developed for fitting the model dynamics to the observed vocal fold oscillations. Thus, the resulting parameter values represent a specific vibration pattern and serve as an objective quantification measure. Performance and reliability of the optimization procedure are validated with synthetically generated data sets. The high-speed videos of two normal voice subjects and six patients suffering from different voice disorders are processed. The resulting model parameters represent a rough approximation of physiological parameters along the entire vocal folds.
| Year | Citations | |
|---|---|---|
Page 1
Page 1