Publication | Closed Access
Detecting Emotions in Mandarin Speech
19
Citations
17
References
2004
Year
EngineeringMfcc ComponentsSpeech CorpusSpoken Language ProcessingMultimodal Sentiment AnalysisSpeech RecognitionNatural Language ProcessingApplied LinguisticsPattern RecognitionComputational LinguisticsAffective ComputingLanguage StudiesSpeech Emotion RecognitionSpeech CommunicationSpeech AnalysisMandarin SpeechSpeech ProcessingSpeech PerceptionEmotionLinguisticsEmotion Recognition
The importance of automatically recognizing emotions in human speech has grown with the increasing role of spoken language interfaces in human-computer interaction applications. In this paper, a Mandarin speech based emotion classification method is presented. Five primary human emotions, including anger, boredom, happiness, neutral and sadness, are investigated. Combining different feature streams to obtain a more accurate result is a well-known statistical technique. For speech emotion recognition, we combined 16 LPC coefficients, 12 LPCC components, 16 LFPC components, 16 PLP coefficients, 20 MFCC components and jitter as the basic features to form the feature vector. Two corpora were employed. The recognizer presented in this paper is based on three classification techniques: LDA, K-NN and HMMs. Results show that the selected features are robust and effective for the emotion recognition in the valence and arousal dimensions of the two corpora. Using the HMMs emotion classification method, an average accuracy of 88.7% was achieved.
| Year | Citations | |
|---|---|---|
Page 1
Page 1