Publication | Open Access
Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy
92
Citations
34
References
2020
Year
Voice ChangesHealthy SubjectsDiagnosisPathologyPathological SpeechLaryngeal CancerVoice AnalysisSpeech RecognitionVoice RecognitionRadiation OncologyAcoustic AnalysisCancer ResearchRadiologyHealth SciencesMedicineLarynxDeep LearningSpeech CommunicationSpeech TechnologySpeech AnalysisHigh AccuracyVoiceSpeech AcousticsSpeech ProcessingExtreme Gradient BoostingOncology
Voice changes may be the earliest signs in laryngeal cancer. We investigated whether automated voice signal analysis can be used to distinguish patients with laryngeal cancer from healthy subjects. We extracted features using the software package for speech analysis in phonetics (PRAAT) and calculated the Mel-frequency cepstral coefficients (MFCCs) from voice samples of a vowel sound of /a:/. The proposed method was tested with six algorithms: support vector machine (SVM), extreme gradient boosting (XGBoost), light gradient boosted machine (LGBM), artificial neural network (ANN), one-dimensional convolutional neural network (1D-CNN) and two-dimensional convolutional neural network (2D-CNN). Their performances were evaluated in terms of accuracy, sensitivity, and specificity. The result was compared with human performance. A total of four volunteers, two of whom were trained laryngologists, rated the same files. The 1D-CNN showed the highest accuracy of 85% and sensitivity and sensitivity and specificity levels of 78% and 93%. The two laryngologists achieved accuracy of 69.9% but sensitivity levels of 44%. Automated analysis of voice signals could differentiate subjects with laryngeal cancer from those of healthy subjects with higher diagnostic properties than those performed by the four volunteers.
| Year | Citations | |
|---|---|---|
Page 1
Page 1