Concepedia

Publication | Closed Access

Phonetic-class based correlation analysis for severity of dysphonia

12

Citations

13

References

2017

Year

Abstract

The main purpose of the research is to model the cognitive processes that occur when the physician determines the severity of the dysphonia, and to build an IT system that can substitute the subjective severity diagnosis used by a clinician. In this preliminary study the relationship between acoustic parameters and the speech defect severity determined by a clinician is investigated. Being limited in the number of pathological speech samples, it is very important to choose the effective parameters. After a phoneme level segmentation, acoustic parameters were measured at a predetermined fixed points in continuous speech. Parameters were grouped according to the phonetic classes (classes according to the manner of articulation), and the correlation of the grouped parameters with the severity of dysphonia given by the RBH scale was examined, where R stands for roughness, B for breathiness, H for overall hoarseness. The analysis was carried out on a database containing several pathological disease types, the most frequent being recurrent paresis and functional dysphonia. It was found that beyond the initial acoustic parameters such as jitter(ddp), shimmer(dda), Harmonics-to-Noise Ratio (HNR) and mel-frequency cepstral coefficients (mfcc) measured on vowels, it is worth measuring Soft Phonation Index (SPI) and Empirical mode decomposition (EMD) based frequency band ratios on different phonetic classes. These measures were found to correlate with the severity of dysphonia, determined by the clinician (RBH). They provide useful information and could be useful to differentiate different types of dysphonia like functional dysphonia and recurrent paresis.

References

YearCitations

1998

22.9K

1996

2.4K

2012

648

1997

461

2000

383

2006

162

2000

60

1986

55

2013

29

2009

28

Page 1