Publication | Closed Access
Long short term memory recurrent neural network based encoding method for emotion recognition in video
31
Citations
24
References
2016
Year
Unknown Venue
EngineeringMachine LearningMultimodal Sentiment AnalysisRecurrent Neural NetworkSocial SciencesVideo InterpretationSpeech RecognitionNatural Language ProcessingEmotion ChallengePattern RecognitionAffective ComputingVideo TransformerCategory Emotion RecognitionHuman EmotionVideo UnderstandingDeep LearningFacial Expression RecognitionEmotionEmotion Recognition
Human emotion is a temporally dynamic event which can be inferred from both audio and video feature sequences. In this paper we investigate the long short term memory recurrent neural network (LSTM-RNN) based encoding method for category emotion recognition in the video. LSTM-RNN is able to incorporate knowledge about how emotion evolves over long range successive frames and emotion clues from isolated frame. After encoding, each video clip can be represented by a vector for each input feature sequence. The vectors contain both frame level and sequence level emotion information. These vectors are then concatenated and fed into support vector machine (SVM) to get the final prediction result. Extensive evaluations on Emotion Challenge in the Wild (EmotiW2015) dataset show the efficiency of the proposed encoding method and competitive results are obtained. The final recognition accuracy achieves 46.38% for audio-video emotion recognition sub-challenge, where the challenge baseline is 39.33%.
| Year | Citations | |
|---|---|---|
Page 1
Page 1