Publication | Closed Access
DBLSTM-based multi-scale fusion for dynamic emotion prediction in music
22
Citations
9
References
2016
Year
Unknown Venue
MusicEngineeringMachine LearningDynamic Emotion PredictionMultimodal Sentiment AnalysisSocial SciencesSpeech RecognitionNatural Language ProcessingData ScienceDblstm ModelsAffective ComputingMusic RetrievalPredictive AnalyticsAudio RetrievalComputer ScienceDeep LearningMusic ClassificationTemporal ContextEmotion Recognition
Dynamic Music Emotion Prediction is crucial to the emerging applications of music retrieval and recommendation. Considering the influence of temporal context and hierarchical structure on emotion in music, we propose a Deep Bidirectional Long Short-Term Memory (DBLSTM) based multi-scale regression method. In this method, a post-processing component is utilised for individual DBSLTM output to further enhance the ability of temporal context processing and a fusion component is to integrate the output of all DBLSTM models with different scales. In addition, we investigate how the difference of sequence length between the training and predicting phase affects the performance of DBLSTM. We conduct our experiments on a public database of Emotion in Music task at MediaEval 2015, and the result shows that our method achieves significant improvement when compared with the state-of-art methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1